You are viewing a single comment's thread from:

RE: LeoThread 2024-10-22 09:10

in LeoFinance11 months ago

Anthropic just made it harder for AI to go rogue with its updated safety policy

Anthropic, the artificial intelligence company behind the popular Claude chatbot, today announced a sweeping update to its Responsible Scaling Policy (RSP), aimed at mitigating the risks of highly capable AI systems.

#technology #anthropic #ai #safety #newsonleo

Sort:  

The policy, originally introduced in 2023, has evolved with new protocols to ensure that AI models, as they grow more powerful, are developed and deployed safely.

This revised policy sets out specific Capability Thresholds—benchmarks that indicate when an AI model’s abilities have reached a point where additional safeguards are necessary.