Amazon Polly

Deploy high-quality, natural-sounding human voices in dozens of languages

Introducing Amazon Polly

Amazon Polly uses deep learning technologies to synthesize natural-sounding human speech, so you can convert articles to speech. With dozens of lifelike voices across a broad set of languages, use Amazon Polly to build speech-activated applications.

Customizable speech output

Customize and control speech output that supports lexicons and Speech Synthesis Markup Language (SSML) tags.

Speech redistribution

Store and redistribute speech in standard formats like MP3 and OGG.

Deliver lifelike voices

Quickly deliver lifelike voices and conversational user experiences in consistently fast response times.

Use cases

Add speech to applications with a global audience, such as RSS feeds, websites, or videos.

Learn more about speech generation

Store and replay Amazon Polly speech output to prompt callers through interactive or automated voice response systems.

Learn more about neural text-to-speech (TTS)

Use SSML, a W3C standard XML-based markup language for speech synthesis applications, to support common SSML tags for phrasing, emphasis, and intonation.

Learn more about SSML


Explore more of AWS