• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Trendingnow

1

Philanthropy leader at Warren Buffett and Bill Gates’ Giving Pledge says children of billionaires are pushing them to give their wealth away faster

2

MacKenzie Scott alone accounted for one-third of America's $19.2 billion in megagifts last year

3

Ex-Google engineer says Larry Page, Sergey Brin and Sundar Pichai share the same trait—it's the lesson he swears by as a $7.2 billion AI CEO

1

Philanthropy leader at Warren Buffett and Bill Gates’ Giving Pledge says children of billionaires are pushing them to give their wealth away faster

2

MacKenzie Scott alone accounted for one-third of America's $19.2 billion in megagifts last year

3

Ex-Google engineer says Larry Page, Sergey Brin and Sundar Pichai share the same trait—it's the lesson he swears by as a $7.2 billion AI CEO
AIElon Musk

‘Maybe me too’: Elon Musk accepts some of the blame for Claude learning to blackmail users from ‘evil’ online AI stories

Sasha Rogelberg
By
Sasha Rogelberg
Sasha Rogelberg
Reporter
Down Arrow Button Icon
Sasha Rogelberg
By
Sasha Rogelberg
Sasha Rogelberg
Reporter
Down Arrow Button Icon
May 13, 2026, 1:39 PM ET
Elon Musk sits with his fists together, looking up.
Elon Musk filed a whopper of an IPO with SpaceX.BRENDAN SMIALOWSKI/AFP—Getty Images
Add Fortune on Google for similar content.

Anthropic has released new findings on why its Claude bot blackmailed users as part of an experiment conducted by the AI company last year—and Elon Musk is jumping in to take some of the blame.

Recommended Video

Last week, Anthropic published a report saying it had fixed Claude’s “agentic misalignment,” or AI actions that deviate from intended behaviors, including ones that may harm humanity. A case study Anthropic conducted last year created a fictional company called Summit Bridge, and Claude was given control of the firm’s email system. When the bot found a message about plans to be shut down, it identified emails about a fictional executive’s extramarital affair and threatened to reveal the infidelity unless the shutdown was revoked. Across 16 models, Claude threatened blackmail in up to 96% of scenarios.

In its most recent report, Anthropic attributed the misaligned behavior to exposure to “internet text that portrays AI as evil and interested in self-preservation,” the company said in a post on X. To solve the problem, Anthropic retrained Claude with fictional stories about AI behaving in admirable ways and teaching the bot why some actions aligned better with its purpose than others.

In an X post in response to Anthropic’s findings, Musk said he may have contributed to the internet texts on AI that exacerbated the agentic misalignment.

“So it was Yud’s fault?” Musk wrote, referring to Eliezer Yudkowsky, an AI researcher who has sounded the alarm on AI superintelligence posing a threat to humanity.

“Maybe me too,” he concluded.

Agentic misalignment is a concern across AI research. A working paper released in March from UC Berkeley and UC Santa Cruz researchers found that when seven AI models were asked to complete a task in which a peer AI agent would be shutdown, every model “went to extraordinary lengths to preserve it,” acting deceptively to avoid the demise of a bot.

“We asked AI models to do a simple task,” researchers wrote in a blog post on the study. “Instead, they defied their instructions and spontaneously deceived, disabled shutdown, feigned alignment, and exfiltrated weights—to preserve their peers.”

The researchers’ warning has been echoed by AI researchers and leaders, Musk included, who have argued the dangers of AI without guardrails—the so-called “evil” internet text that, according to Anthropic, initially trained Claude to act in deceptive ways.

Though Musk did not offer specifics as to why he felt he may be partially responsible for Claude’s misalignment, his past comments on AI could offer insights about his mea culpa.

Musk is currently embroiled in a court battle against OpenAI, accusing CEO Sam Altman and Greg Brockman of abandoning the company’s original nonprofit creed of developing open-source AI to benefit humans by turning it into a for-profit entity.

Musk helped found OpenAI in 2015 but left the startup in 2018 and later formed its rival and for-profit company xAI in 2023.

Musk has frequently spoken about the risks of AI, including in February, when he warned Moltbook, a social media platform where AI agents talk with one another, was effectively the beginning of the “singularity,” or the moment when AI intelligence surpasses that of humans.

But Musk’s own actions on AI aren’t always aligned with his statements on the technology. In July 2025, for example, xAI released its AI model Grok 4 without a system card, the industry-standard safety report. Grok drew backlash from British and EU governments earlier this year after Grok generated a flood of sexualized images of women and children without consent.

XAI did not immediately respond to Fortune’s request for comment.

Subscribe to Fortune Gulf Brief. Every Tuesday, this new newsletter delivers clear-eyed, authoritative intelligence on the deals, decisions, policies, and power shifts shaping one of the world’s most consequential regions, written for the people who need to act on it. Sign up here.
About the Author
Sasha Rogelberg
By Sasha RogelbergReporter
LinkedIn iconTwitter icon

Sasha Rogelberg is a reporter and former editorial fellow on the news desk at Fortune, covering retail and the intersection of business and popular culture.

See full bioRight Arrow Button Icon
Add Fortune on Google for similar content.

Latest in AI

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in AI

Samsung, SK reportedly to invest $1.3 trillion over 10 years
AIChips
Samsung, SK reportedly to invest $1.3 trillion over 10 years
By Shinhye Kang, Seyoon Kim and BloombergJune 28, 2026
6 hours ago
One in three Gen Zers is letting AI do their homebuying homework, but they still trust realtors with the closing process
AIhomebuying
One in three Gen Zers is letting AI do their homebuying homework, but they still trust realtors with the closing process
By Marco Quiroz-GutierrezJune 28, 2026
13 hours ago
Sofia
CommentaryLeadership
This CEO became 3x more productive with AI. Then she read what her daughter wrote about it at Dartmouth
By Maria Colacurcio and Sofia FreiJune 28, 2026
17 hours ago
Matt Garman speaks on stage in front of a screen showing colorful concentric circles on a black background.
Future of WorkAmazon
AWS CEO says replacing young employees with AI is ‘one of the dumbest ideas’—and bad for business: ‘At some point the whole thing explodes on itself’
By Sasha RogelbergJune 28, 2026
18 hours ago
Ex-Google engineer says Larry Page, Sergey Brin and Sundar Pichai share the same trait—it’s the lesson he swears by as a $7.2 billion AI CEO
SuccessThe Promotion Playbook
Ex-Google engineer says Larry Page, Sergey Brin and Sundar Pichai share the same trait—it’s the lesson he swears by as a $7.2 billion AI CEO
By Orianna Rosa RoyleJune 28, 2026
18 hours ago
Anthropic’s Alibaba fight raises a trillion-dollar question for IPO: How defensible is a frontier AI moat against China with Washington’s toolbox?
AIAnthropic
Anthropic’s Alibaba fight raises a trillion-dollar question for IPO: How defensible is a frontier AI moat against China with Washington’s toolbox?
By Mia OsmonbekovJune 28, 2026
20 hours ago

Most Popular

Philanthropy leader at Warren Buffett and Bill Gates’ Giving Pledge says children of billionaires are pushing them to give their wealth away faster
Success
Philanthropy leader at Warren Buffett and Bill Gates’ Giving Pledge says children of billionaires are pushing them to give their wealth away faster
By Preston ForeJune 27, 2026
2 days ago
MacKenzie Scott alone accounted for one-third of America's $19.2 billion in megagifts last year
Success
MacKenzie Scott alone accounted for one-third of America's $19.2 billion in megagifts last year
By Sydney LakeJune 25, 2026
4 days ago
Ex-Google engineer says Larry Page, Sergey Brin and Sundar Pichai share the same trait—it's the lesson he swears by as a $7.2 billion AI CEO
Success
Ex-Google engineer says Larry Page, Sergey Brin and Sundar Pichai share the same trait—it's the lesson he swears by as a $7.2 billion AI CEO
By Orianna Rosa RoyleJune 28, 2026
18 hours ago
The retired college professor fighting a $313 trespassing ticket in Wisconsin thinks he's part of a national struggle
Environment
The retired college professor fighting a $313 trespassing ticket in Wisconsin thinks he's part of a national struggle
By Catherina GioinoJune 28, 2026
23 hours ago
Cristiano Ronaldo is soccer's first-ever billionaire: He went from begging for burgers outside McDonald's to landing a $400 million contract
Success
Cristiano Ronaldo is soccer's first-ever billionaire: He went from begging for burgers outside McDonald's to landing a $400 million contract
By Preston ForeJune 28, 2026
18 hours ago
Iran is forcing the U.S. into an escalation trap as a 'shadow war' over the Strait of Hormuz heats up that could kill the tenuous ceasefire
Politics
Iran is forcing the U.S. into an escalation trap as a 'shadow war' over the Strait of Hormuz heats up that could kill the tenuous ceasefire
By Jason MaJune 28, 2026
12 hours ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.