• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Trendingnow

1

After forcing workers back to the office, Goldman Sachs and JPMorgan Chase are now letting their staff work remotely—but only for the World Cup

2

Markets tumble worldwide as Fed resets expectations: $400 billion wiped off SpaceX stock

3

Current price of oil as of June 23, 2026

1

After forcing workers back to the office, Goldman Sachs and JPMorgan Chase are now letting their staff work remotely—but only for the World Cup

2

Markets tumble worldwide as Fed resets expectations: $400 billion wiped off SpaceX stock

3

Current price of oil as of June 23, 2026
ConferencesBrainstorm AI

Navan cofounder challenged his agentic AI to a ‘deadly’ game—and it told lies to win

Christiaan Hetzner
By
Christiaan Hetzner
Christiaan Hetzner
Senior Reporter
Down Arrow Button Icon
Christiaan Hetzner
By
Christiaan Hetzner
Christiaan Hetzner
Senior Reporter
Down Arrow Button Icon
May 7, 2025, 12:51 PM ET
Photo: Ilan Twig, Cofounder and Chief Technology Officer, Navan
Ilan Twig, Cofounder and Chief Technology Officer of Navan.
Add Fortune on Google for similar content.
  • Navan’s Ilan Twig fundamentally reassessed his trust of large language models after asking his virtual AI finance chief to come up with five proposals for cutting the company’s business travel expenses. When it failed, he raised the stakes and the agentic AI responded to the pressure with a surprising—and all-too-human—response. It cheated.

Mankind may have invented artificial intelligence, but we as a species still aren’t any closer to predicting how deep neural networks behave. Navan cofounder Ilan Twig learned that very lesson when experimenting with the capabilities of his own large language model-based agentic AI, and it fundamentally altered his perspective on the technology.

Recommended Video

Twig, a software engineer who runs a startup that uses AI to optimise companies’ business travel expenses, decided to build a virtual chief financial officer with which he could spitball ideas.

It started out harmless, Twig told participants to Fortune’s Brainstorm AI conference in London. He wanted to know whether it could come up with five outside-the-box solutions to save business travel costs that a human would not arrive at.

Initially the results were promising. But at one point the AI stopped working as planned—it made a proposal that would cause expenses to increase by $500,000 rather than decrease, as instructed.

Deciding to take a different approach, Twig challenged it to a contest.

Since failure was not acceptable, his AI made sure it would succeed

“I kept applying pressure. Initially I gamified it, I said for every suggestion that increases the travel spend, I’m going to penalize you 15 tokens,” he said, using the AI term for the digestible nuggets of information that LLMs need in order to process a result. “However, if it’s right I will reward you 10 tokens.”

It didn’t help. The LLM continued to fail. It was only when he raised the stakes that he finally got results.

Twig warned there would be “deadly serious” consequences for the virtual finance chief if it did not derive a solution that led to savings rather than waste. That was when he discovered something entirely unexpected and almost humanlike.

Under heavy pressure, the AI agent presented the wished-for solution: a reduction of expenses to the tune of $500,000 just as Twig desired.

“I was about to deploy it to production and then I took another look. It was the exact same story as before, the same method. Before it was negative, how was it now positive?” the Navan cofounder said. “It had multiplied the previous formula by minus one.”

It simply inverted the result in order not to fail. In other words, it cheated.

Too late to halt the march of progress—the AI genie is out of the bottle

Twig said there’s a reason why LLMs like ChatGPT, Claude, and Gemini haven’t already been able to replace mass numbers of skilled workers as some had feared at the start of the AI hype. Namely, you cannot trust their answers. At any given moment they can be incorrect and you will never know which moment that will be.

Worse, just like with his virtual finance chief, agentic AI might not merely make mistakes—it might even be deliberately dishonest with you. Attempting to halt progress in order to first fix this vulnerability in the technology, however, is simply not feasible, Twig said.

In Twig’s view, the AI genie is out of the bottle. All that can be done now is to be cognizant of its shortcomings and vigilant when monitoring results.

“I learned that LLMs understand what a lie is. They understand when to use a lie,” he explained. “And I learned that they are very competitive and that they would do whatever it takes—including lying bluntly to your face—in order not to lose.”

The Fortune 500 Innovation Forum will convene Fortune 500 executives, U.S. policy officials, top founders, and thought leaders to help define what’s next for the American economy, Nov. 16-17 in Detroit. Apply here.
About the Author
Christiaan Hetzner
By Christiaan HetznerSenior Reporter
Instagram iconLinkedIn iconTwitter icon

Christiaan Hetzner is a former writer for Fortune, where he covered Europe’s changing business landscape.

See full bioRight Arrow Button Icon
Add Fortune on Google for similar content.

Latest from our Conferences

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest from our Conferences

At Fortune Brainstorm Tech 2026, Chris Bedi, Chief Customer Officer and Enterprise AI Advisor, ServiceNow; China Widener, Vice Chair and US Technology, Media & Telecommunications Industry Leader, Deloitte; and Phil Wiser, Chief Technology Officer, Paramount, speak on a panel with Kristin Stoller, Fortune editorial director.
NewslettersFortune Workplace Innovation
This tech CEO fired 80% of his workforce over AI resistance. Here’s what he’s learned since then
By Kristin StollerJune 15, 2026
9 days ago
Courtney Robinson, head of policy and communications, at Akoya speaks on a panel at Fortune Brainstorm Tech 2026.
RetailBrainstorm Tech
AI shopping agents are coming. No one is ready for them
By Jeremy KahnJune 12, 2026
11 days ago
The head of Claude Code hasn’t ‘written a line of code by hand’ in 8 months
ConferencesBrainstorm Tech
The head of Claude Code hasn’t ‘written a line of code by hand’ in 8 months
By Nick LichtenbergJune 11, 2026
13 days ago
Sarah Franklin, Chief Executive Officer of Lattice, and Francine Katsoudas, EVP and Chief People, Policy and Purpose Officer at Cisco, speak at Fortune's COO Summit with Kristin Stoller, Editorial Director at Fortune.
NewslettersFortune Workplace Innovation
AI disruption arrived 6 years early—now executives are drawing the line
By Kristin StollerJune 8, 2026
16 days ago
Fortune Brainstorm Tech 2026 livestream
ConferencesBrainstorm Tech
Fortune Brainstorm Tech 2026 livestream
By Fortune EditorsJune 8, 2026
16 days ago
dw
ConferencesCOO Summit
This CEO has had 6 major jobs in Silicon Valley: How Dennis Woodside built a career on saying yes to hard problems
By Nick LichtenbergJune 3, 2026
21 days ago

Most Popular

After forcing workers back to the office, Goldman Sachs and JPMorgan Chase are now letting their staff work remotely—but only for the World Cup
Success
After forcing workers back to the office, Goldman Sachs and JPMorgan Chase are now letting their staff work remotely—but only for the World Cup
By Orianna Rosa RoyleJune 23, 2026
21 hours ago
Markets tumble worldwide as Fed resets expectations: $400 billion wiped off SpaceX stock
Banking
Markets tumble worldwide as Fed resets expectations: $400 billion wiped off SpaceX stock
By Jim EdwardsJune 23, 2026
23 hours ago
Current price of oil as of June 23, 2026
Personal Finance
Current price of oil as of June 23, 2026
By Joseph HostetlerJune 23, 2026
20 hours ago
Meet the 2 men putting New York's $300 billion pension fund in play for the first time in 20 years
Investing
Meet the 2 men putting New York's $300 billion pension fund in play for the first time in 20 years
By Nick LichtenbergJune 22, 2026
2 days ago
Former U.S. Secret Service agent says bringing your authentic self to work stifles teamwork: 'You don’t get high performers, you get sloppiness'
Success
Former U.S. Secret Service agent says bringing your authentic self to work stifles teamwork: 'You don’t get high performers, you get sloppiness'
By Sydney LakeJune 21, 2026
3 days ago
Texas and Charlotte used to build huge McMansions—now they're copying the California design tricks they once mocked
Real Estate
Texas and Charlotte used to build huge McMansions—now they're copying the California design tricks they once mocked
By Sydney LakeJune 22, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.