Text To Speech

Convert text into natural-sounding speech using an API powered by the best of AI technologies.

The Text To Speech (TTS) API is a powerful tool designed to convert text-based input into natural-sounding speech. This API enables developers to integrate speech synthesis capabilities into their applications, providing users with an auditory experience for various use cases.

Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data like MP3.

Pricing

Requests made to the Text To Speech (TTS) API are billed. Prices are based on the number of characters sent to the service to be synthesized into audio.

The pricing for API requests is as follows:

  • Per Request Cost: 3 units base cost per request.

POST https://api.autogon.ai/api/v1/services/text-to-speech/

Headers

NameTypeDescription

Content-Type*

String

application/json

Request Body

NameTypeDescription

text*

String

Text to be processed and converted to audio

language_code

String

Specifies the language used for speech synthesis, defaults to "en" which is English

{
    "success": true,
    "data": {
        "audio": "http://cloud.autogonai.s3.amazonaws.com/tts-f45a5db2-481b-442f-bb92-d16d0b3d7f72.wav"
    }
}

Last updated