• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Trendingnow

1

After forcing workers back to the office, Goldman Sachs and JPMorgan Chase are now letting their staff work remotely—but only for the World Cup

2

Markets tumble worldwide as Fed resets expectations: $400 billion wiped off SpaceX stock

3

Current price of oil as of June 23, 2026

1

After forcing workers back to the office, Goldman Sachs and JPMorgan Chase are now letting their staff work remotely—but only for the World Cup

2

Markets tumble worldwide as Fed resets expectations: $400 billion wiped off SpaceX stock

3

Current price of oil as of June 23, 2026
NewslettersEye on AI

OpenAI says its making progress on “The Alignment Problem”

Jeremy Kahn
By
Jeremy Kahn
Jeremy Kahn
Editor, AI
Down Arrow Button Icon
Jeremy Kahn
By
Jeremy Kahn
Jeremy Kahn
Editor, AI
Down Arrow Button Icon
January 27, 2022, 12:59 PM ET
Updated March 21, 2023, 1:58 PM ET
Add Fortune on Google for similar content.

Hello and welcome to a new, special monthly edition of Fortune’s “Eye on A.I.” newsletter. Today,  OpenAI, the San Francisco A.I. research company, announced that it had made significant progress on something called “The Alignment Problem.”

The term refers to the difficulty of making sure that an A.I. system does what humans want it to do. In traditional software, alignment wasn’t much of an issue, because humans both chose the goal they wanted the software to accomplish and wrote a very specific instruction set, or code, detailing every step the computer should take to achieve it. If the program did something wrong along the way, it was because the instructions were faulty.

With A.I., alignment is harder. While humans might specify the goal, the software itself now learns how best to achieve it. Often, the logic behind the software’s decision in any particular case is opaque, even to the person who created the software. And this problem becomes more challenging the more capable an A.I. system becomes.

OpenAI is interested in alignment because its founding mission is the creation of artificial general intelligence (AGI). That’s the kind of super-intelligent software that, for now, remains the stuff of science fiction—a single system that can perform most cognitive tasks as well or better than a human.

“Alignment is critical to the mission of OpenAI,” Ilya Sutskever, the legendary machine learning researcher who is OpenAI’s co-founder and chief scientist, tells me. “We want to build general purpose A.I. to benefit humanity, so it must not just be smart, but safe, and does the complicated tasks that we want it to do safely.”

OpenAI has not managed to create AGI. But it already has an alignment problem on its hands with its sole commercial product. That product, which it simply calls The API, is an application programming interface that lets paying customers access the company’s algorithm GPT. The best known version of that algorithm is GPT-3, a massive natural language processing system that can compose long blocks of text that are often indistinguishable from human writing. GPT-3 can also perform a lot of other language tasks, including translation, summarization, and answering questions. OpenAI’s API is available to customers of Microsoft’s Azure cloud computing platform as well as to OpenAI’s own customers.

The problem is that it can be very difficult to get GPT-3 to compose text the way a user might want. Prompt the software to “Please explain the moon landing to a six-year old,” and the system might well begin writing similar phrases, such as, “Please explain climate change to a six-year old,” and “Please explain the big bang to a six-year old,” rather than actually summarizing the story of Apollo 11 using age-appropriate language, says Jan Leike, an OpenAI researcher who focuses on The Alignment Problem.

Another issue is that having been trained on a vast amount of written material scraped from the Internet and previously published books, the text GPT-3 generates can be sexist, racist, and Islamophobic. It has tendency to veer into descriptions of violence. It is also difficult to get GPT-3 to answer questions factually, as opposed to just making stuff up.

OpenAI now says that it has made progress towards solving these alignment problems by creating a new version of GPT, which it calls InstructGPT. InstructGPT starts out a bit like GPT-3 in basic design and training. It too initially learns about language by ingesting a giant amount of text scraped from the Internet and books. But InstructGPT is a much smaller piece of software, only handling some 1.5 billion different variables at a time, rather than the 175 billion that GPT-3 uses. That is important because it makes InstructGPT easier and less expensive to train.

After its initial training, InstructGPT is then fine-tuned with two additional steps. First, it is supplied with what Leike says were “a few tens of thousands of examples” of text humans wrote in response to the same sort of prompts that OpenAI’s customers use to try to get GPT-3 to do something. The system has to learn to imitate these human-written responses. Next, the system is further honed by asking it to generate two different responses to a prompt and having human reviewers pick the one they think is best. This information is then used to create an internal reward mechanism where InstructGPT itself has to guess which of the responses it has generated is most likely to be preferred by a human, and that becomes its output.

Leike tells me that InstructGPT has not completely cracked The Alignment Problem. “It will still sometimes ignore an instruction or say something toxic,” he says. It can also sometimes still generate violent prose and false information. He also says that InstructGPT is so good at following human instructions that there is potential for abuse—someone could very easily teach the system to be more racist or sexist, for example. But OpenAI found that the new InstructGPT is so much less likely to run off-the-rails than the original GPT-3, that it has decided to make InstructGPT the default algorithm for all of its customers. People can still opt to use the larger GPT-3 if they wish, but Leike says that so far the human reviewers and beta customers OpenAI has used to test the system much prefer InstructGPT’s responses, even though InstructGPT doesn’t perform quite as well as GPT-3 on some academic natural language processing benchmarks.

That’s not surprising. The Instruct version of GPT is safer and more trustworthy. And to most businesses, as long as a certain performance bar is cleared, that’s what matters. It also shows that academic benchmarks may be a poor proxy for the things businesses actually want natural language processing software to do.

It’s not clear we’re very close to achieving AGI. But it’s good to know that companies like OpenAI are at least thinking hard about The Alignment Problem—and making some progress towards solving it.

Thanks for reading this special edition. Here’s tidbits of A.I. news that have occurred since the last regular edition of the newsletter earlier this week.

Jeremy Kahn
@jeremyakahn 
jeremy.kahn@fortune.com

A.I. IN THE NEWS

Tesla says its "Tesla Bot" is on track to be "the most powerful A.I. development platform." That's according to Andrej Karpathy, Tesla's director of A.I., in a recent LinkedIn post touting job openings for A.I. and robotics researchers at the company. Meanwhile on an earnings call, Tesla founder and CEO Elon Musk said the humanoid robot, which the company calls "Optimus" internally, could be more important than its lineup of electric vehicles, Bloomberg News reported. 

Speaking of Elon Musk, his brain-computer interface company Neuralink is getting closer to putting a chip in a person's brain, but former company insiders paint a picture of impossible deadlines, dysfunctional management and an absent CEO. That's what my Fortune colleague and "Eye on A.I." co-writer Jonathan Vanian and I discovered after spending two months digging into the company. We also found that Neuralink has made some genuine advances in brain-computer interface hardware and helped spawn a whole industry of similar startups that are attracting real money from venture capital firms. But that doesn't mean Neuralink will be able to live up to Musk's radical vision for what the technology will do. You can check out our feature story in the current issue of Fortune magazine and on the web here. 

Donald Trump's new social network plans to use A.I. to moderate content. Trump's new Truth Social network, which will launch on President's Day in February, said it will used technology from San Francisco A.I. company Hive to keep sexually-explicit content, and posts that include violence, bullying, hate speech, and spam, off the site. That's according to a story from Fox Business. "This is not political," Hive CEO and co-founder Kevin Guo told the network. "These are not things that are left or right or have any political baggage." 

Will driverless cars really be more than a gimmick? That's the provocative question The Financial Times tech reporter Patrick McGee asks in a reported essay in the paper's magazine. McGee argues that the problem with driverless cars is not so much getting the tech to work, but the economics. The costs associated with owning and running a fleet of robotaxis in sufficient numbers so they are always available on demand are much, much worse than for standard rides haring "marketplaces" such as Uber and Lyft that are based on a gig economy model. As a result, McGee says, companies like Waymo and Cruise could have real trouble making their business models successful. 

About the Author
Jeremy Kahn
By Jeremy KahnEditor, AI
LinkedIn iconTwitter icon

Jeremy Kahn is the AI editor at Fortune, spearheading the publication's coverage of artificial intelligence. He also co-authors Eye on AI, Fortune’s flagship AI newsletter.

See full bioRight Arrow Button Icon
Add Fortune on Google for similar content.

Latest in Newsletters

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Newsletters

Google DeepMind CEO Demis Hassabis (left) stands on a spiral staircase next to Google DeepMind researcher John Jumper.
NewslettersEye on AI
Defections from Google DeepMind prompt questions about Alphabet’s efforts to stay at the forefront of AI
By Jeremy KahnJune 23, 2026
13 hours ago
From Audrey Gelman to Bobbi Brown, second-time female founders are on the rise
NewslettersMPW Daily
From Audrey Gelman to Bobbi Brown, second-time female founders are on the rise
By Emma HinchliffeJune 23, 2026
15 hours ago
Cred founder and CEO Kunal Shah. (Courtesy: Cred)
NewslettersFortune Tech
Meta’s latest reverse acqui-hire: Cred founder Kunal Shah
By Andrew NuscaJune 23, 2026
21 hours ago
Saudi PIF’s governor wants the kingdom to become a global investment center
NewslettersFortune Gulf Brief
Saudi PIF’s governor wants the kingdom to become a global investment center
By Melissa HancockJune 23, 2026
21 hours ago
The CEO with real-time data on 1 in 6 American workers says stop worrying about jobs—and start thinking about tasks
NewslettersCEO Daily
The CEO with real-time data on 1 in 6 American workers says stop worrying about jobs—and start thinking about tasks
By Diane BradyJune 23, 2026
22 hours ago
The WNBA turns 30—and women’s basketball is dreaming bigger than ever
NewslettersMPW Daily
The WNBA turns 30—and women’s basketball is dreaming bigger than ever
By Emma HinchliffeJune 22, 2026
2 days ago

Most Popular

After forcing workers back to the office, Goldman Sachs and JPMorgan Chase are now letting their staff work remotely—but only for the World Cup
Success
After forcing workers back to the office, Goldman Sachs and JPMorgan Chase are now letting their staff work remotely—but only for the World Cup
By Orianna Rosa RoyleJune 23, 2026
19 hours ago
Markets tumble worldwide as Fed resets expectations: $400 billion wiped off SpaceX stock
Banking
Markets tumble worldwide as Fed resets expectations: $400 billion wiped off SpaceX stock
By Jim EdwardsJune 23, 2026
21 hours ago
Current price of oil as of June 23, 2026
Personal Finance
Current price of oil as of June 23, 2026
By Joseph HostetlerJune 23, 2026
19 hours ago
Meet the 2 men putting New York's $300 billion pension fund in play for the first time in 20 years
Investing
Meet the 2 men putting New York's $300 billion pension fund in play for the first time in 20 years
By Nick LichtenbergJune 22, 2026
2 days ago
Former U.S. Secret Service agent says bringing your authentic self to work stifles teamwork: 'You don’t get high performers, you get sloppiness'
Success
Former U.S. Secret Service agent says bringing your authentic self to work stifles teamwork: 'You don’t get high performers, you get sloppiness'
By Sydney LakeJune 21, 2026
3 days ago
Texas and Charlotte used to build huge McMansions—now they're copying the California design tricks they once mocked
Real Estate
Texas and Charlotte used to build huge McMansions—now they're copying the California design tricks they once mocked
By Sydney LakeJune 22, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.