Speech-to-Text
Overview
Speech-to-Text APIs: REST, real-time streaming, and batch jobs.
import (
"github.com/Shreehari-Acharya/sarvamai-go/stt"
"github.com/Shreehari-Acharya/sarvamai-go/sttjob"
)
- Call
Initialize to create a job and set options.
- Call
GetUploadLinks to get presigned upload URLs.
- Upload audio files to those URLs.
- Call
Start to trigger processing.
- Poll with
GetStatus until terminal state.
- Call
GetDownloadLinks to fetch output URLs.
| Rule | Result |
|---|
ModelSaaras + Mode* | Valid |
ModelSaarika + Mode* | Validation error |
model omitted + Mode* | Valid in SDK validation (uses saaras:v3 model spec for mode checks) |
| Model context | Allowed language set |
|---|
ModelSaarika | languages.SaarikaLanguages |
ModelSaaras | languages.SaarasLanguages |
model omitted | languages.SaarasLanguages (validation default) |
| Combination | Result |
|---|
WithNumSpeakers(n) + WithDiarization(true) | Valid |
WithNumSpeakers(n) without diarization enabled | Validation error |