audio
Classes
TTSConfigForm
Bases: BaseModel
Configuration for Text-to-Speech (TTS).
Returned by:
- Code Reference models audio Classes AudioConfigUpdateForm Attributes tts
Attributes
OPENAI_PARAMS
Additional parameters for OpenAI TTS requests.
Dict Fields
model(str, optional): ID of the model to use. Defaults to tts-1 or tts-1-hdvoice(str, optional): The voice to use for speech. Options: alloy, echo, fable, nova, onyx, shimmerresponse_format(str, optional): Format of the returned audio. Options: mp3, opus, aac, flac, wav, pcmspeed(float, optional): The speed of the generated audio. Must be between 0.25 and 4.0. Default is 1.0- Any other parameters supported by the OpenAI TTS API can be included
STTConfigForm
Bases: BaseModel
Configuration for Speech-to-Text (STT).
Returned by:
- Code Reference models audio Classes AudioConfigUpdateForm Attributes stt
Attributes
ENGINE
The STT engine to use (e.g. 'openai', 'deepgram', 'azure', 'mistral', or empty for local Whisper).
SUPPORTED_CONTENT_TYPES
List of supported content types (MIME types) for uploads.
AudioConfigUpdateForm
Bases: BaseModel
Form for updating audio configuration (TTS and STT).
Used by:
- Code Reference routers audio Classes AudioClient Functions update_config