Speech To Text
Convert speech into text using an API powered by the best of AI technologies.
Last updated
Convert speech into text using an API powered by the best of AI technologies.
Last updated
The Speech To Text (STT) API is a robust tool designed to convert spoken language into written text. This API empowers developers to integrate speech recognition capabilities into their applications, enabling users to interact with spoken language for various purposes.
Requests made to the Speech To Text (STT) API are billed. Prices are based on the number of characters sent to the service to be synthesized into audio.
The pricing for API requests is as follows:
Per Request Cost: 3 units base cost per request.
POST
https://api.autogon.ai/api/v1/services/speech-to-text/
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Content-Type*
String
application/json
audio*
File
Audio to be processed and converted to text
language_code
String
Specifies the language spoken in the audio, defaults to "en"