◆ Input
The text to generate audio for. The maximum length is 4096 characters.
◆ Model
FString FSpeech::Model {"tts-1"} |
◆ Response_Format
FString FSpeech::Response_Format {"mp3"} |
The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.
◆ Speed
float FSpeech::Speed {1.0f} |
The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.
◆ Voice
FString FSpeech::Voice {"alloy"} |
The voice to use when generating the audio. Supported voices are alloy, echo, fable, onyx, nova, and shimmer.
The documentation for this struct was generated from the following file: