Part 2/13:
Anthropic’s latest update to Claude, version 4.1, quietly made its debut but carries significant improvements, especially in coding and complex reasoning tasks. Building upon the previous 4.0 iteration, Claude 4.1 achieved a new benchmark score of 74.5% on SWBench—a test designed to evaluate an AI’s ability to fix real-world code rather than toy examples. This performance surpasses many current models, demonstrating Anthropic’s focus on making Claude not just conversationally smart but practically capable.