Description:
Amazon Polly is a service that turns text into lifelike speech, allowing you to create applications that talk, and build entirely new categories of speech-enabled products. Polly’s Text-to-Speech (TTS) service uses advanced deep learning technologies to synthesize natural sounding human speech. With dozens of lifelike voices across a broad set of languages, you can build speech-enabled applications that work in many different countries.
In addition to Standard TTS voices, Amazon Polly offers Neural Text-to-Speech (NTTS) voices that deliver advanced improvements in speech quality through a new machine learning approach. Polly’s Neural TTS technology also supports two speaking styles that allow you to better match the delivery style of the speaker to the application: a Newscaster reading style that is tailored to news narration use cases, and a Conversational speaking style that is ideal for two-way communication like telephony applications.
Online Demo
Features of Amazon Polly
- Support for over 33+ Languages and Dialects
- Support for over 87+ Different Voices and Accents
- Powered By:
- Natural sounding voices (Neural Voices)
- Advanced Deep Learning Technology
- Various Combination of SSML effects for all Voices
- Mix up to 20 voices in a single synthesize task
- Synthesize up to 60K mixed voice synthesize text with just few clicks
- Powerful Sound Studio that supports 2 audio formats
- Add Background Music to your text
- Merge synthesize results with similar formats
- Multiple Audio Output Formats:
- Store & redistribute speech easily via social media
- Near Real-time text synthesize
- Customize & control speech output
- Optimize Your Streaming Audio
- Adjust Speech Rate, Pitch, and Loudness
- Adjust Speaking Emphasis
- Pronounce digits/dates/words/abbreviations properly
- Add work/phrase replacement effect
- Mute/Beep Out any part of text/sentence
- Store results in:
- Local Server
- Amazon S3
- Wasabi Storage
- Conveniently Share synthesize results or Download
- Fully Responsive Interface
- Closely Monitor Estimated Spending for Cloud TTS Services
- One Click Auto Update Option
- Developed with PHP 7.4.x and Laravel 8.4.x
- Detailed and Comprehensive Documentation
Cloud Vendor Text to Speech Prices
Notes
Please note, for the script to work correctly, you need to have valid AWS account.
Latest Changes
22.04.2022 - 2.0 - New: Full redesign with Laravel Framework - New: Powerful integrated Sound Studio - New: Mixing up to 20 voices in a single synthesize task 16.05.2020 - 1.5 - Update: Standard Voices character limit increase - Update: Neural Voices character limit increase - Update: Direct Keys include simplified - Update: Documentation 17.03.2020 - 1.4 - Update: Support for raw PCM audio stream formats added for Large Text - Fix: JS bug fixes 30.01.2020 - 1.3 - Update: Support for Neural TTS added, provides high quality life like voices - Update: Support for Large Text added, output results are directly sent to Amazon S3 - Update: Additional voice effects are added for Neural TTS - Update: Additional voice effects are added for Standard TTS - Fix: Minor bug fixes 16.11.2019 - 1.2 - Update: AWS PHP SDK v3 is now included with the package - Update: App can now run directly with only IAM Access and Secret Access Keys - Update: Additional voice effects are added 11.11.2019 - 1.1 - Fix: Audio Player play/pause fix during direct play - Fix: Additional Settings dropdown sign fix 29.10.2019 - 1.0 - Initial Release