• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
TechAI

Microsoft knows you love tricking its AI chatbots into doing weird stuff and it’s designing ‘prompt shields’ to stop you

By
Jackie Davalos
Jackie Davalos
and
Bloomberg
Bloomberg
Down Arrow Button Icon
By
Jackie Davalos
Jackie Davalos
and
Bloomberg
Bloomberg
Down Arrow Button Icon
March 29, 2024, 8:24 AM ET
Sarah Bird
Sarah Bird, principal program manager and responsible AI lead for Azure AI at Microsoft, at the company's headquarters in Redmond, Washington, on Feb. 7, 2023. Chona Kasinger/Bloomberg via Getty Images

Microsoft Corp. is trying to make it harder for people to trick artificial intelligence chatbots into doing weird things. 

Recommended Video

New safety features are being built into Azure AI Studio which lets developers build customized AI assistants using their own data, the Redmond, Washington-based company said in a blog post on Thursday. 

The tools include “prompt shields,” which are designed to detect and block deliberate attempts — also known as prompt injection attacks or jailbreaks  — to make an AI model behave in an unintended way. Microsoft is also addressing “indirect prompt injections,” when hackers insert malicious instructions into the data a model is trained on and trick it into performing such unauthorized actions as stealing user information or hijacking a system. 

Such attacks are “a unique challenge and threat,” said Sarah Bird, Microsoft’s chief product officer of responsible AI. The new defenses are designed to spot suspicious inputs and block them in real time, she said. Microsoft is also rolling out a feature that alerts users when a model makes things up or generates erroneous responses.

Microsoft is keen to boost trust in its generative AI tools, which are now being used by consumers and corporate customers alike. In February, the company investigated incidents involving its Copilot chatbot, which was generating responses that ranged from weird to harmful. After reviewing the incidents, Microsoft said users had deliberately tried to fool Copilot into generating the responses.

“Certainly we see it increasing as there’s more use of the tools but also as more people are aware of these different techniques,” Bird said. Tell-tale signs of such attacks include asking a chatbot a question multiple times or prompts that describe role-playing. 

Microsoft is OpenAI’s largest investor and has made the partnership a key part of its AI strategy. Bird said Microsoft and OpenAI are dedicated to deploying AI safely and building protections into the large language models underlying generative AI. 

“However, you can’t rely on the model alone,” she said. “These jailbreaks for example, are an inherent weakness of the model technology.” 

Join us at the Fortune Workplace Innovation Summit May 19–20, 2026, in Atlanta. The next era of workplace innovation is here—and the old playbook is being rewritten. At this exclusive, high-energy event, the world’s most innovative leaders will convene to explore how AI, humanity, and strategy converge to redefine, again, the future of work. Register now.
About the Authors
By Jackie Davalos
See full bioRight Arrow Button Icon
By Bloomberg
See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

iran
Cybersecuritycyber
‘There are a lot more attacks happening that aren’t being reported’: Iran’s cyber response creeps across the globe
By David Klepper and The Associated PressMarch 29, 2026
1 hour ago
lanzone
AIYahoo
Yahoo CEO Jim Lanzone on ‘the white whale of turnarounds’ and turning to AI—licensed from Anthropic
By Michael Liedtke and The Associated PressMarch 29, 2026
2 hours ago
sony
PoliticsSony PlayStation
Sony raises PlayStation price another $100, second price hike in under a year
By Matt Ott and The Associated PressMarch 29, 2026
2 hours ago
big tech
EnvironmentData centers
Big tech was embracing clean energy and turning a corner on climate change. Then AI data centers arrived
By Tammy Webber and The Associated PressMarch 29, 2026
2 hours ago
Traders work on the floor of the New York Stock Exchange during morning trading on March 25, 2026 in New York City.
Big TechIran
The Iran war turned Mag 7 stocks into dip-buying bait. But no one is jumping in yet even though Wall Street expects U.S. tech to outperform
By Eva RoytburgMarch 29, 2026
2 hours ago
AI
AIPsychology
AI is so sycophantic there’s a Reddit channel called ‘AITA’ documenting its sociopathic advice
By Matt O'Brien and The Associated PressMarch 29, 2026
2 hours ago

Most Popular

Europe
413,793 KitKat bars stolen: 'Whilst we appreciate the criminals’ exceptional taste, the fact remains that cargo theft is an escalating issue'
By Fortune EditorsMarch 28, 2026
21 hours ago
Economy
U.S. debt suddenly draws weaker demand as $10 trillion must be rolled over this year amid Iran war. 'The bond market remains undefeated'
By Fortune EditorsMarch 28, 2026
1 day ago
Energy
Saudi pipeline to bypass Hormuz hits 7 million barrel goal
By Fortune EditorsMarch 28, 2026
18 hours ago
Economy
The stay-at-home boyfriend is now an economic trend as more women than men go to work
By Fortune EditorsMarch 28, 2026
1 day ago
Success
Meetings are not work, says Southwest Airlines CEO—and he’s taking action by blocking his calendar every afternoon from Wednesday to Friday 
By Fortune EditorsMarch 27, 2026
2 days ago
Personal Finance
Current price of gold as of March 27, 2026
By Fortune EditorsMarch 27, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.