As artificial intelligence models improve, the companies developing them are seeking more sophisticated ways to measure how ...
Anthropic’s Claude chatbot can now write and run JavaScript code. Today, Anthropic launched a new analysis tool that helps Claude respond with what the company describes as “mathematically precise and ...
Anthropic is starting to train its models on new Claude chats. If you’re using the bot and don’t want your chats used as training data, here’s how to opt out. Anthropic is prepared to repurpose ...
The new model shows significant gains in technical benchmarks. On the SWE-bench Verified evaluation, which tests real-world software coding skills, Claude Sonnet 4.5 achieves state-of-the-art results.
This article is adapted from an edition of our Off the Charts newsletter originally published in October 2021. Off the Charts is a weekly, subscriber-only guide to The Economist’s award-winning data ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results