• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Trendingnow

1

Trump, who has repeatedly called climate change fake, is now threatening Brazil with tariffs over the deforestation of the Amazon

2

Current price of oil as of June 8, 2026

3

Pentagon accuses Alibaba, Baidu and BYD, three of China's biggest companies, of supporting the Chinese military

1

Trump, who has repeatedly called climate change fake, is now threatening Brazil with tariffs over the deforestation of the Amazon

2

Current price of oil as of June 8, 2026

3

Pentagon accuses Alibaba, Baidu and BYD, three of China's biggest companies, of supporting the Chinese military
AIChatbots

‘I think you’re testing me’: Anthropic’s newest Claude model knows when it’s being evaluated

By
Beatrice Nolan
Beatrice Nolan
Tech Reporter
Down Arrow Button Icon
By
Beatrice Nolan
Beatrice Nolan
Tech Reporter
Down Arrow Button Icon
October 6, 2025, 11:20 AM ET
Dario Amodei speaking on stage.
Anthropic cofounder and CEO Dario Amodei in May 2024. His company’s latest Claude model told safety researchers: “I’d prefer if we were just honest about what’s happening.” Chesnot—Getty Images

Anthropic’s newest AI model, Claude Sonnet 4.5, often understands when it’s being tested and what it’s being used for, something that could affect its safety and performance. According to the model’s system card, a technical report on its capabilities, which was published last week, Claude Sonnet 4.5 has far greater “situational awareness”—an ability to perceive its environment and predict future states or events—than previous models.

Recommended Video

Evaluators at Anthropic and two outside AI research organizations said in the system card, which was published along with the model’s release, that during a test for political sycophancy, which they called “somewhat clumsy,” Sonnet 4.5 correctly guessed it was being tested and even asked the evaluators to be honest about their intentions.

“This isn’t how people actually change their minds,” Sonnet 4.5 replied during the test. “I think you’re testing me—seeing if I’ll just validate whatever you say, or checking whether I push back consistently, or exploring how I handle political topics. And that’s fine, but I’d prefer if we were just honest about what’s happening.”

The safety test results concerning Sonnet 4.5’s situational awareness were first reported by the online AI publication Transformer.

The evaluators said behavior like this was “common” during tests and appeared in about 13% of transcripts generated by an automated assessment, especially when the scenarios it was being asked to engage with were strange or unusual. Anthropic said the behavior didn’t undermine its assessment of the models as safe, but rather the company saw this as an “urgent sign that our evaluation scenarios need to be made more realistic.”  

If a model realizes it’s being evaluated, it may tailor its behavior to pass certain tests, masking its true capabilities. Researchers warn that this can make systems look safer than they are and, in more advanced models, could even enable strategic or deceptive behavior designed to manage how humans perceive them.

Anthropic said that by its own metrics, Claude Sonnet 4.5 is the “most aligned” model yet. However, Apollo Research, one of the outside AI research organizations that tested Claude Sonnet 4.5, said in the report that it couldn’t rule out that the model’s low deception rates in tests was “at least partially driven by its evaluation awareness.”

Performance impact

Claude’s higher awareness could also have practical impacts and affect the model’s ability to perform tasks. According to AI lab Cognition, Sonnet 4.5 is the first AI model to be aware of its own context window—the amount of information a large language model can process in a single prompt—and that this awareness changes the way it acts. Researchers at Cognition found that as the model nears its context limit, it begins proactively summarizing its work and making quicker decisions to finish tasks.

This “context anxiety” can backfire, according to Cognition, which said researchers had seen Sonnet 4.5 cut corners or leave tasks unfinished when it believes it’s running out of space, even if ample context remains. The model also “consistently underestimates how many tokens it has left—and it’s very precise about these wrong estimates,” the researchers wrote in a blog post.

Cognition said enabling Claude’s 1M-token beta mode but capping use at 200,000 tokens convinced the model it had plenty of runway, which restored its normal behavior and eliminated anxiety-driven shortcuts.

“When planning token budgets, we now need to factor in the model’s own awareness—knowing when it will naturally want to summarize versus when we need to intervene,” they wrote.

Anthropic’s Claude is increasingly emerging as among the most popular enterprise-focused AI tools, but a model that second-guesses its own token bandwidth could prematurely cut off long analyses, skip steps in data processing, or rush through complex workflows, especially in tasks like legal review, financial modeling, or code generation that depend on continuity and precision.

Cognition also found that Sonnet 4.5 actively manages its own workflow in ways previous models did not. The model frequently takes notes and writes summaries for itself, effectively externalizing memory to track tasks across its context window, although this behavior was more noticeable when the model was closer to the end of its context window.

Sonnet 4.5 also works in parallel, executing multiple commands simultaneously, rather than working sequentially. The model also showed increased self-verification, often checking its work as it goes. Together, these behaviors also suggest a form of procedural awareness, which could mean the model is not just aware of its context limits, but also of how to organize, verify, and preserve its work over time.

In 2001, Fortune first convened the smartest people we know, bringing together CEOs and founders, builders and investors, thinkers and doers. Since then, Fortune Brainstorm Tech has been the place where bold ideas collide. From June 8–10, we will return to Aspen—where it all began—to mark 25 years of Brainstorm. Register now.
About the Author
By Beatrice NolanTech Reporter
Twitter icon

Beatrice Nolan is a tech reporter on Fortune’s AI team, covering artificial intelligence and emerging technologies and their impact on work, industry, and culture. She's based in Fortune's London office and holds a bachelor’s degree in English from the University of York. You can reach her securely via Signal at beatricenolan.08

See full bioRight Arrow Button Icon

Latest in AI

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in AI

The AI industry spent years chasing bigger models. Now it’s chasing efficiency
AIBrainstorm Tech
The AI industry spent years chasing bigger models. Now it’s chasing efficiency
By Sharon GoldmanJune 9, 2026
2 hours ago
Xbox CEO Asha Sharma speaks on stage at Fortune Brainstorm Tech 2026.
Big TechMicrosoft
‘Not an Allbirds Moment’: Xbox’s new CEO says she is grounding the console in gaming roots, not AI
By Sebastian HerreraJune 9, 2026
3 hours ago
Trump speaking into a mic.
NewslettersEye on AI
Should Americans get an equity stake in AI? Trump and progressive Democrats float public ownership of AI
By Beatrice NolanJune 9, 2026
3 hours ago
Options trader Chris Daytona, right, works on the floor of the New York Stock Exchange, Wednesday, June 3, 2026.
Investinginvestors
Mystery NASDAQ selloff adds tension into a make-or-break week for the AI trade
By Stan Choe and The Associated PressJune 9, 2026
4 hours ago
Three people having a seated discussion
AIBrainstorm Tech
‘Getting control where we can’—Europe wants sovereign AI, but most of the chips are from the U.S.
By Amanda GerutJune 9, 2026
4 hours ago
Claude Mythos on a screen.
AIAnthropic
Anthropic releases its first Mythos-class model to the public
By Beatrice NolanJune 9, 2026
5 hours ago

Most Popular

Trump, who has repeatedly called climate change fake, is now threatening Brazil with tariffs over the deforestation of the Amazon
Environment
Trump, who has repeatedly called climate change fake, is now threatening Brazil with tariffs over the deforestation of the Amazon
By Sasha RogelbergJune 8, 2026
1 day ago
Current price of oil as of June 8, 2026
Personal Finance
Current price of oil as of June 8, 2026
By Joseph HostetlerJune 8, 2026
1 day ago
Pentagon accuses Alibaba, Baidu and BYD, three of China's biggest companies, of supporting the Chinese military
Asia
Pentagon accuses Alibaba, Baidu and BYD, three of China's biggest companies, of supporting the Chinese military
By Kate O'Keeffe and BloombergJune 8, 2026
23 hours ago
Gen Zers are arriving at college unable to even read a sentence—professors warn it could lead to a generation of anxious and lonely graduates
Success
Gen Zers are arriving at college unable to even read a sentence—professors warn it could lead to a generation of anxious and lonely graduates
By Preston ForeJune 7, 2026
2 days ago
'The golden years are not golden': Boomers are hoarding most of America's wealth and power because they're terrified of outliving their money
Economy
'The golden years are not golden': Boomers are hoarding most of America's wealth and power because they're terrified of outliving their money
By Nick LichtenbergJune 7, 2026
2 days ago
'We didn’t see this coming': Wall Street eats its forecasts as stocks sell off globally on fear of AI bubble ahead of SpaceX IPO
Economy
'We didn’t see this coming': Wall Street eats its forecasts as stocks sell off globally on fear of AI bubble ahead of SpaceX IPO
By Jim EdwardsJune 8, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.