You pay as you go. There is a trial period in which you can generate 100 000 characters of text for free. After that, you will have to pay 15 USD for each 100 000 additional characters. This is about 33 standard pages of text or 10 minutes of audio.
Voxabot provides the following production services:
Please contact us for more information about our production and post-production TTS services.
Also if the traditional voice over works better, we have competitive rates for professional human voice over.
Sure, you can find it here:
Yes you can!
Although the SSML standard covers virtually all aspects of Text-To-Speech, there are some areas which the SSML standard does not specify, and each TTS engine and voice can have their one custom SSML tags. This is particularly true with the newest neural voices which include additional SSML elements for controlling the voice output. For example, Microsoft Azure has some voices which support tags for expressing emotions like cheerfulness, empathy, and calm, or optimize the voice for different scenarios like customer service, and voice assistant.
To add a custom tag do the following:
Yes, you can!
To import SSML code into Voxabot editor follow these steps:
To export SSML code from the Voxabot editor follow these steps:
SSML stands for Speech Synthesis Markup Language Specification. SSML is the standard markup language for generating synthetic speech. The TTS engines make their default interpretation of the written texts, and you can use SSML to customize a series of elements to control aspects of speech such as pronunciation, volume, pitch, rate, etc.
The Voxabot SSML Editor helps you to insert the SSML codes automatically in the text script and hear the generated audio immediately.
For more information read the Wikipedia article: https://en.wikipedia.org/wiki/Speech_Synthesis_Markup_Language
Contact us here:
We have connections to Google, AWS Polly, and Microsoft Azure. Click on the links below for information from that cloud provider. Note that they are constantly evolving their offerings and we are constantly adding their new features to our service.
Unfortunately, Amazon Polly limits the use of NTTS to specific voices, regions, and languages as well as limiting the allowed tags--this prevents us from implementing it faithfully. Azure and Google’s Neural TTS are available instead.
Generally speaking, standard voices have been created using speech synthesis technology available before 2016 which included multiple stages, such as a text analysis frontend, an acoustic model, and an audio synthesis module. Neural voices have been created using similar technologies but make use of neural networks and deep learning technologies (Wavenet, Tacotron, VoiceLoop) which are faster to produce and deliver a more human-like sound.
These are the steps you should follow for creating an mp3 audio file from written text using the TTS engine default values:
You can see how it works here:App.Voxabot Introduction - YouTube
Text-To-Speech, abbreviated TTS, is the artificial production of human speech from written texts. Currently Voxabot Editor generates human-like voices using the TTS engines from Amazon, Microsoft and Google. This means that you have a great choice of standard voices and state-of-the-art neural voices to suit your needs in many languages.