◆ Input
The text to generate audio for. The maximum length is 4096 characters.
◆ Instructions
| FString FSpeech::Instructions {} |
Control the voice of your generated audio with additional instructions. Does not work with tts-1 or tts-1-hd.
◆ Model
| FString FSpeech::Model {"tts-1"} |
◆ Response_Format
| FString FSpeech::Response_Format {"mp3"} |
The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.
◆ Speed
| float FSpeech::Speed {1.0f} |
The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.
◆ Voice
| FString FSpeech::Voice {"alloy"} |
The voice to use when generating the audio. Supported voices are alloy, ash, ballad, coral, echo, fable, onyx, nova, sage, shimmer, and verse.
The documentation for this struct was generated from the following file: