Public Attributes
FString	Model {"tts-1"}
FString	Input
FString	Voice {"alloy"}
FString	Instructions {}
FString	Response_Format {"mp3"}
float	Speed {1.0f}
FString	Stream_Format {}

Member Data Documentation

◆ Input

FString FSpeech::Input

The text to generate audio for. The maximum length is 4096 characters.

◆ Instructions

FString FSpeech::Instructions {}

Control the voice of your generated audio with additional instructions. Does not work with tts-1 or tts-1-hd.

◆ Model

FString FSpeech::Model {"tts-1"}

One of the available TTS models: tts-1, tts-1-hd or gpt-4o-mini-tts: https://platform.openai.com/docs/models/tts

◆ Response_Format

FString FSpeech::Response_Format {"mp3"}

The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.

◆ Speed

float FSpeech::Speed {1.0f}

The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.

◆ Stream_Format

FString FSpeech::Stream_Format {}

The format to stream the audio in. Supported formats are sse and audio. sse is not supported for tts-1 or tts-1-hd.

◆ Voice

FString FSpeech::Voice {"alloy"}

The voice to use when generating the audio. Supported voices are alloy, ash, ballad, cedar, coral, echo, fable, marin, nova, onyx, sage, shimmer, and verse.

The documentation for this struct was generated from the following file:

Source/OpenAI/Public/Provider/Types/AudioTypes.h

Public Attributes