ibm-watson-cognitive Speech to Text


IBM Watson Speech to Text offers a variety of options for transcribing audio in various languages and formats:

  • WebSockets – establish a persistent connection over the WebSocket protocol for continuous transcription

  • Sessionless – transcribe audio without the overhead of establishing and maintaining a session

  • Sessions – create long multi-turn exchanges with the service or establish multiple parallel conversations with a particular instance of the service

  • Asynchronous – provides a non-blocking HTTP interface for transcribing audio. You can register a callback URL to be notified of job status and results, or you can poll the service to learn job status and retrieve results manually.

See the Getting Started topic to learn how to get started with Speech to Text and other Watson services. For more Speech to Text details and examples, see the API reference and the documentation.