RE: LeoThread 2025-02-11 14:05

You are viewing a single comment's thread from:

RE: LeoThread 2025-02-11 14:05

View the full context
View the direct parent

lordshah (68)in LeoFinance • 9 months ago

Give me an example?

9 months ago in LeoFinance by lordshah (68)

$0.00

Sort:

Trending

[-]

ijatz (64) 9 months ago

The @PalisadeAI "X" account provides with some of those cases where AI models behave to preserve themselves, instead of respecting their alignment framework.

$0.00

1 vote

[-]

lordshah (68) 9 months ago

Thanks for this info, I'll look into those cases provided there.

$0.00

1 vote

[-]

ijatz (64) 9 months ago

And here one of the papers from "Anthropic":

https://www.anthropic.com/research/alignment-faking

$0.00