AWS Amazon Polly – Text to Speech Converter

November 28, 2024

Description:

Amazon Polly is a service that turns text into lifelike speech, allowing you to create applications that talk, and build entirely new categories of speech-enabled products. Polly’s Text-to-Speech (TTS) service uses advanced deep learning technologies to synthesize natural sounding human speech. With dozens of lifelike voices across a broad set of languages, you can build speech-enabled applications that work in many different countries.

In addition to Standard TTS voices, Amazon Polly offers Neural Text-to-Speech (NTTS) voices that deliver advanced improvements in speech quality through a new machine learning approach. Polly’s Neural TTS technology also supports two speaking styles that allow you to better match the delivery style of the speaker to the application: a Newscaster reading style that is tailored to news narration use cases, and a Conversational speaking style that is ideal for two-way communication like telephony applications.

Online Demo

Features of Amazon Polly

Support for over 33+ Languages and Dialects
Support for over 87+ Different Voices and Accents
Powered By:
Natural sounding voices (Neural Voices)
Advanced Deep Learning Technology
Various Combination of SSML effects for all Voices
Mix up to 20 voices in a single synthesize task
Synthesize up to 60K mixed voice synthesize text with just few clicks
Powerful Sound Studio that supports 2 audio formats
Add Background Music to your text
Merge synthesize results with similar formats
Multiple Audio Output Formats:
Store & redistribute speech easily via social media
Near Real-time text synthesize
Customize & control speech output
Optimize Your Streaming Audio
Adjust Speech Rate, Pitch, and Loudness
Adjust Speaking Emphasis
Pronounce digits/dates/words/abbreviations properly
Add work/phrase replacement effect
Mute/Beep Out any part of text/sentence
Store results in:
- Local Server
- Amazon S3
- Wasabi Storage
Conveniently Share synthesize results or Download
Fully Responsive Interface
Closely Monitor Estimated Spending for Cloud TTS Services
One Click Auto Update Option
Developed with PHP 7.4.x and Laravel 8.4.x
Detailed and Comprehensive Documentation

Cloud Vendor Text to Speech Prices

Notes

Please note, for the script to work correctly, you need to have valid AWS account.

Latest Changes

22.04.2022 - 2.0
     - New: Full redesign with Laravel Framework
     - New: Powerful integrated Sound Studio
     - New: Mixing up to 20 voices in a single synthesize task

16.05.2020 - 1.5
     - Update: Standard Voices character limit increase
     - Update: Neural Voices character limit increase
     - Update: Direct Keys include simplified
     - Update: Documentation

17.03.2020 - 1.4
     - Update: Support for raw PCM audio stream formats added for Large Text
     - Fix: JS bug fixes

30.01.2020 - 1.3
     - Update: Support for Neural TTS added, provides high quality life like voices
     - Update: Support for Large Text added, output results are directly sent to Amazon S3
     - Update: Additional voice effects are added for Neural TTS
     - Update: Additional voice effects are added for Standard TTS
     - Fix: Minor bug fixes

16.11.2019 - 1.2
     - Update: AWS PHP SDK v3 is now included with the package
     - Update: App can now run directly with only IAM Access and Secret Access Keys
     - Update: Additional voice effects are added

11.11.2019 - 1.1
     - Fix: Audio Player play/pause fix during direct play
     - Fix: Additional Settings dropdown sign fix

29.10.2019 - 1.0
     - Initial Release

Browse

Want to chat?

Social