◆ Input
The text to generate audio for. The maximum length is 4096 characters.
◆ Instructions
FString FSpeech::Instructions {} |
Control the voice of your generated audio with additional instructions. Does not work with tts-1 or tts-1-hd.
◆ Model
FString FSpeech::Model {"tts-1"} |
◆ Response_Format
FString FSpeech::Response_Format {"mp3"} |
The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.
◆ Speed
float FSpeech::Speed {1.0f} |
The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.
◆ Voice
FString FSpeech::Voice {"alloy"} |
The voice to use when generating the audio. Supported voices are alloy, ash, ballad, coral, echo, fable, onyx, nova, sage, shimmer, and verse.
The documentation for this struct was generated from the following file: