Statistical Parameter Synthesis

Graph

Model the acoustic parameters of speech
Synthesis audio from parameters
Hidden Markov Model - precursor to tacotron

HMM based TTS pipeline

Text
Phonemes
Duration Model (HMM)
Acoustic Model (HMM)
Vocoder - Parametric
Waveform

graph TD

A[Text] -->|G2P| B[Phonemes]

B -->|State durations| C[Duration Model - HMM]

C -->|Timed states| D[Acoustic Model - HMM]

D -->|Spectral features| E[Vocoder]

E -->|Audio signal| F[Waveform]

Example

HTS