1 of 1

Speech To Text API

All the available APIs needed to generate transcript from audio files.

Speech to Text

This endpoint is called for transcribing an audio file.

POST https://api.convai.com/stt/

The user can send the audio file they want to transcribe to this endpoint and get the transcript in the response.

This endpoint also has an option for enabling time stamps, which will provide the timestamp along with the transcript.

Headers

Request Body

{ 
	"result" : "<the complete transcription of the audio file>",
	"details": [
		{
			"id": "<sub-trsancript order number>",
			"start-time" : "<starting time, accurate up to milliseconds>",
			"end-time" : "<ending time, accurate up to milliseconds>",
			"text": "<sub-transcript for the time interval>"
		},
		.
		.

{ 
	"result" : "<the complete transcription of the audio file>"
}

{
    "ERROR": "<Error related to API key>"
}

{
    "ERROR": "<The corresponding error caused by incorrect request data>"
}

Here some ample codeto demonstrate the request format for th endpoint -->

import requests

url = "https://api.convai.com/stt/"

payload={
	"enableTimestamps": "<True or False>"	# Dont need to set if False (default).
}
files=[
  ('file',('audio.wav',open('<path to audio file>','rb'),'audio/wav'))
]
headers = {
  'CONVAI-API-KEY': '<your api key>'
}

response = requests.request("POST", url, headers=headers, data=payload, files=files)

print(response.text)

curl --location --request POST 'https://api.convai.com/stt/' \
--header 'CONVAI-API-KEY: <your api key>' \
--form 'file=@"<path to your audio file>"' \
--form 'enableTimestamps="<True or False>"'

Please note currently the API only supports and format for audio files. Sending audio files of other formats such as aac, flac, etc will result in a rror.

Note: The audio should have a bit depth of at least 16 bits or higher.

Add Words

Adding specific words to be focused upon during the Speech-To-Text processing

POST https://api.convai.com/stt/add-words

This API is called to add new words that the user wants to focus on during the Speech to Text processing. Users can use this endpoint to add uncommon words, that they expect in their audio files and want the Speech-to-Text system to correctly recognize them.

Headers

Request Body

{
    "ERROR": "<Error related to API key>"
}

{
    "ERROR": "<The corresponding error caused by incorrect request data>"
}

{
    "STATUS" : 0
}

Here are some sample codes to demonstrate the request format for the endpoint -->

import requests
import json

url = "https://api.convai.com/stt/add-words"

payload = json.dumps({
  "word": "<new word>"
})
headers = {
  'CONVAI-API-KEY': '<your api key>',
}

response = requests.request("POST", url, headers=headers, data=payload)

print(response.text)

curl --location --request POST 'https://api.convai.com/stt/add-words' \
--header 'CONVAI-API-KEY: <your api key>' \
--data-raw '{
    "word": "<new word>"
}'

Speech To Text API

All the available APIs needed to generate transcript from audio files.

Speech to Text

This endpoint is called for transcribing an audio file.

POST https://api.convai.com/stt/

The user can send the audio file they want to transcribe to this endpoint and get the transcript in the response.

This endpoint also has an option for enabling time stamps, which will provide the timestamp along with the transcript.

Headers

Name

Type

Description

Request Body

Name

Type

Description

{ 
	"result" : "<the complete transcription of the audio file>",
	"details": [
		{
			"id": "<sub-trsancript order number>",
			"start-time" : "<starting time, accurate up to milliseconds>",
			"end-time" : "<ending time, accurate up to milliseconds>",
			"text": "<sub-transcript for the time interval>"
		},
		.
		.

{ 
	"result" : "<the complete transcription of the audio file>"
}

{
    "ERROR": "<Error related to API key>"
}

{
    "ERROR": "<The corresponding error caused by incorrect request data>"
}

Here some ample codeto demonstrate the request format for th endpoint -->

import requests

url = "https://api.convai.com/stt/"

payload={
	"enableTimestamps": "<True or False>"	# Dont need to set if False (default).
}
files=[
  ('file',('audio.wav',open('<path to audio file>','rb'),'audio/wav'))
]
headers = {
  'CONVAI-API-KEY': '<your api key>'
}

response = requests.request("POST", url, headers=headers, data=payload, files=files)

print(response.text)

curl --location --request POST 'https://api.convai.com/stt/' \
--header 'CONVAI-API-KEY: <your api key>' \
--form 'file=@"<path to your audio file>"' \
--form 'enableTimestamps="<True or False>"'

Please note currently the API only supports and format for audio files. Sending audio files of other formats such as aac, flac, etc will result in a rror.

Note: The audio should have a bit depth of at least 16 bits or higher.

Add Words

Adding specific words to be focused upon during the Speech-To-Text processing

POST https://api.convai.com/stt/add-words

Headers

Name

Type

Description

Request Body

Name

Type

Description

{
    "ERROR": "<Error related to API key>"
}

{
    "ERROR": "<The corresponding error caused by incorrect request data>"
}

{
    "STATUS" : 0
}

Here are some sample codes to demonstrate the request format for the endpoint -->

import requests
import json

url = "https://api.convai.com/stt/add-words"

payload = json.dumps({
  "word": "<new word>"
})
headers = {
  'CONVAI-API-KEY': '<your api key>',
}

response = requests.request("POST", url, headers=headers, data=payload)

print(response.text)

curl --location --request POST 'https://api.convai.com/stt/add-words' \
--header 'CONVAI-API-KEY: <your api key>' \
--data-raw '{
    "word": "<new word>"
}'