Part 7/11:
The capability of an AI to control a computer naturally raises safety concerns. Anthropic emphasizes that they are aware of these risks. They have implemented safety measures, including not training Claude on user screenshots or allowing web access during training. Moreover, they’ve developed classifiers designed to steer the AI away from risky behaviors—such as creating social media accounts or interacting with government websites.