POST
/
api
/
ai
/
speech-text
curl --request POST \
  --url https://api.worqhat.com/api/ai/speech-text \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multi-part/form-data' \
  --data '{
  "audio": "/path/to/audio.mp3"
}'
{
  "status": "success",
  "data": {
    "text": "This is a sample text to be converted to speech.",
    "speaker_labels": [
      {
        "speaker": "A",
        "text": "This is a sample text to be converted to speech."
      }
    ]
  },
  "processing_time": 888,
  "processing_id": "4b139bfe-1f71-4345-b539-fa6c1700f166",
  "processing_count": 12
}

Authorizations

Authorization
string
headerrequired

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

multi-part/form-data
audio
string

The audio file to be sent as input. The audio file can be of any format.

Response

200 - application/json
status
string

Status indicating the success of the Audio Conversion Process. In case you have an error for multipart/form-data requests, try removing the Content-Type header.

data
object
processing_time
integer

The time taken to process the request, in milliseconds.

processing_id
string

A universally unique identifier for the request. This can be used to track the request in the logs.

processing_count
integer

The number of times the request has been processed. This is what is considered in the Billing Process. This is either the number of times the image is processed or the number of words that the server processes.