Section 1: Automatic speech recognition: Background.- Feature extraction: basic frontend.- Acoustic model: Gaussian mixture hidden Markov model.- Language model: stochastic N-gram.- Historical reviews of speech recognition research: 1st, 2nd, 3rd, 3.5th, and 4th generations.- Section 2: Advanced feature extraction and transformation.- Unsupervised feature extraction.- Discriminative feature transformation.- Section 3: Advanced acoustic modeling.- Conditional random field (CRF) and hidden conditional random field (HCRF).- Deep-Structured CRF.- Semi-Markov conditional random field.- Deep stacking models.- Deep neural network - hidden Markov hybrid model.- Section 4: Advanced language modeling.- Discriminative Language model.- Log-linear language model.- Neural network language model.
This book summarizes the recent advancement in the field of automatic speech recognition with a focus on discriminative and hierarchical models. This will be the first automatic speech recognition book to include a comprehensive coverage of recent developments such as conditional random field and deep learning techniques. It presents insights and theoretical foundation of a series of recent models such as conditional random field, semi-Markov and hidden conditional random field, deep neural network, deep belief network, and deep stacking models for sequential learning. It also discusses practical considerations of using these models in both acoustic and language modeling for continuous speech recognition.
Discusses important theoretical foundation and practical considerations of using discriminative and hierarchical models for speech recognition
Reviews past and present work on discriminative and hierarchical models for both acoustic and language modeling
Analyzes the research direction and trends towards establishing future-generation speech recognition