Streaming Transcription API
Stream audio to Convai’s ASR engine using WebSockets and receive real-time transcriptions. Designed for low-latency, intelligent voice experiences with your AI characters.
Overview
Authentication
Headers
Name
Type
Description
Alternative (Query Parameter)
Connect Session
Description
Session Start Example
Session Close Example
Response (Server Event) Body
Name
Type
Description
WebSocket Event Reference
Event Type
Description
Example Payload
Common Data Fields
Field
Type
Description
Streaming Audio Requirements
Parameter
Specification
Control Messages
Command
Example
Description
Error Handling
Example Error Message
Status Codes
Code
Description
Troubleshooting
Issue
Possible Cause
Resolution
Example Progression (Single Utterance)
Example (End-to-End Streaming Client)
End to End Streaming Client - Python
Running Steps
Output Example
cURL - Quick Connectivity Check
Conclusion
Last updated
Was this helpful?