The TTS engine takes text input from your application and synthesizes speech output. It then sends the speech output to the speakers.
Your application communicates with the voice text object to provide TTS codes, as well as control codes, to adjust the speed, pitch, or cadence of the text spoken. The voice text object controls the TTS engine that sends synthesized speech output to speakers. If you create a TTS notification sink, the voice text object sends detailed information to your application about the timing of the text as played.
The following illustration shows a basic TTS system.
The voice text object exposes methods your application can use to implement TTS. Its interfaces provide a high-level API that isolates applications from the details of the TTS engine and the audio destination.