IBM Watson Speech to Text offers a variety of options for transcribing audio in various languages and formats:
WebSockets – establish a persistent connection over the WebSocket protocol for continuous transcription
Sessionless – transcribe audio without the overhead of establishing and maintaining a session
Sessions – create long multi-turn exchanges with the service or establish multiple parallel conversations with a particular instance of the service
Asynchronous – provides a non-blocking HTTP interface for transcribing audio. You can register a callback URL to be notified of job status and results, or you can poll the service to learn job status and retrieve results manually.
See the Getting Started topic to learn how to get started with Speech to Text and other Watson services. For more Speech to Text details and examples, see the API reference and the documentation.