• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
Tech

Google’s DeepMind Claims Massive Progress in Synthesized Speech

By
David Meyer
David Meyer
Down Arrow Button Icon
By
David Meyer
David Meyer
Down Arrow Button Icon
September 9, 2016, 4:26 AM ET
Photograph by Getty Images

Researchers at Google’s DeepMind artificial intelligence division claim to have come up with a way of producing much more natural-sounding synthesized speech, compared with the techniques that are currently in use.

Existing text-to-speech (TTS) systems tend to use a system called concatenative TTS, where the audio is generated by recombining fragments of recorded speech. There’s also a technique called parametric TTS that generates speech by passing information through a vocoder, but that sounds even less natural.

So DeepMind has come up with a new technique called WaveNet that learns from the audio it’s fed, and produces raw audio sample-by-sample. To give an idea of how detailed that is, we’re talking at least 16,000 samples per second.

Get Data Sheet, Fortune’s technology newsletter.

A WaveNet is a “neural network”—essentially an artificial brain—that is trained on real waveforms and then uses statistics to choose which samples of that audio to use when “speaking,” piece by piece.

“Building up samples one step at a time like this is computationally expensive, but we have found it essential for generating complex, realistic-sounding audio,” DeepMind’s researchers said in a post about their findings.

That post is well worth checking out, as it includes several clips of the same pieces of text, read out by different speech synthesis techniques. For both U.S. English and Mandarin Chinese, the WaveNet-generated audio is noticeably more realistic than that produced by concatenative TTS.

DeepMind claimed that blind tests with human subjects showed the WaveNet audio to be at least 50% closer to real human speech—though of course such tests are subjective.

For more on DeepMind, watch our video.

DeepMind’s researchers said they would be able to add emotions and accents as inputs, to make the speech sound even more realistic.

Fascinatingly, WaveNets can generate speech without text—or at least, what the neural networks think speech should sound like. As the clips show, these are word-like sounds that mean nothing, and they’re rather creepy.

The same techniques can also be used to create non-speech audio. The post includes clips of the “music” generated by WaveNets that were trained on classical music—again, a good approximation of actual music that might get away with it if you’re not listening too closely.

Again, all this requires an awful lot of computational power and isn’t being used in any real-world applications just yet. But as is always the case with such things, it’s really just a matter of time before computers will be able to sound extremely human.

About the Author
By David Meyer
LinkedIn icon
See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

roger
AIMedia
Roger Bennett’s message to A-Rod is one for the country: Soccer has already overtaken baseball in America
By Nick LichtenbergMay 11, 2026
55 minutes ago
Ciridae co-founders Jack Soslow and Jack Weissenberger pose for a picture
Startups & VentureAndreessen Horowitz
Apple and Andreessen Horowitz alums raise $20 million to bring AI to ‘real economy’ businesses
By Jack KubinecMay 11, 2026
55 minutes ago
This upstart stablecoin bank just won a rare OCC charter and raised $40 million. Its CEO is only 25
Bankingstablecoins
This upstart stablecoin bank just won a rare OCC charter and raised $40 million. Its CEO is only 25
By Jeff John RobertsMay 11, 2026
4 hours ago
Sharon Shmueli, left, and Tal Shlomo, right, sit and smile
NewslettersTerm Sheet
Exclusive: Index Ventures backs Frame’s $50 million bet that employees are still cybersecurity’s weakest link
By Lily Mae LazarusMay 11, 2026
4 hours ago
Apple presents its new user interface, "Liquid Glass," at its Worldwide Developers Conference on June 9, 2025 in Cupertino, California. (Photo: Andrej Sokolow/dpa/Getty Images)
NewslettersFortune Tech
Apple just might address those Liquid Glass issues
By Andrew NuscaMay 11, 2026
5 hours ago
voters in line
EconomyElections
Forget the Rust Belt or the Sun Belt. The ‘Wired Belt’ may be the next frontier of American political power
By Jake AngeloMay 11, 2026
8 hours ago

Most Popular

‘This is the way’: Elon Musk endorses Warren Buffett’s famed 5-minute plan to fix the national debt
Economy
‘This is the way’: Elon Musk endorses Warren Buffett’s famed 5-minute plan to fix the national debt
By Jacqueline MunisMay 10, 2026
1 day ago
OpenAI CEO Sam Altman says Gen Z and millennials are using ChatGPT like a 'life advisor'—but college students might be one step ahead
Tech
OpenAI CEO Sam Altman says Gen Z and millennials are using ChatGPT like a 'life advisor'—but college students might be one step ahead
By Sydney LakeMay 10, 2026
1 day ago
'Employers are increasingly turning to degree and GPA' in hiring: Recruiters retreat from ‘talent is everywhere,’ double down on top colleges
Future of Work
'Employers are increasingly turning to degree and GPA' in hiring: Recruiters retreat from ‘talent is everywhere,’ double down on top colleges
By Jake AngeloMay 9, 2026
2 days ago
Red flag test: former CEO explains why he rejects job candidates who say they can start right away
Success
Red flag test: former CEO explains why he rejects job candidates who say they can start right away
By Orianna Rosa RoyleMay 9, 2026
2 days ago
Trump thinks he's flying to Beijing with leverage. China spent 6 years making sure he doesn't have any
Commentary
Trump thinks he's flying to Beijing with leverage. China spent 6 years making sure he doesn't have any
By Steve H. HankeMay 10, 2026
1 day ago
Ted Cruz says the quiet part out loud: Trump accounts are Social Security personal accounts as GOP senator reveals 'dirty little secret'
Politics
Ted Cruz says the quiet part out loud: Trump accounts are Social Security personal accounts as GOP senator reveals 'dirty little secret'
By Jason MaMay 9, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.