Follow
Xu Shao
Title
Cited by
Cited by
Year
An audio-visual corpus for speech perception and automatic speech recognition
M Cooke, J Barker, S Cunningham, X Shao
The Journal of the Acoustical Society of America 120 (5), 2421-2424, 2006
12762006
Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model
B Milner, X Shao
Seventh International Conference on Spoken Language Processing, 2002
822002
Prediction of fundamental frequency and voicing from mel-frequency cepstral coefficients for unconstrained speech reconstruction
B Milner, X Shao
IEEE transactions on audio, speech, and language processing 15 (1), 24-33, 2006
692006
Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end
B Milner, X Shao
Speech Communication 48 (6), 697-715, 2006
572006
Stream weight estimation for multistream audio–visual speech recognition in a multispeaker environment
X Shao, J Barker
Speech Communication 50 (4), 337-353, 2008
512008
Pitch prediction from MFCC vectors for speech reconstruction
X Shao, B Milner
2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004
452004
Energetic and informational masking effects in an audiovisual speech recognition system
J Barker, X Shao
IEEE transactions on audio, speech, and language processing 17 (3), 446-458, 2009
232009
Predicting formant frequencies from MFCC vectors [speech recognition applications]
J Darch, B Milner, X Shao, S Vaseghi, Q Yang
Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005
212005
Clean speech reconstruction from noisy mel-frequency cepstral coefficients using a sinusoidal model
X Shao, B Milner
2003 IEEE International Conference on Acoustics, Speech, and Signal …, 2003
162003
Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction
X Shao, B Milner
The Journal of the Acoustical Society of America 118 (2), 1134-1143, 2005
122005
Pruning redundant synthesis units based on static and delta unit appearance frequency.
H Lu, W Zhang, X Shao, Q Zhou, W Lei, H Zhou, AP Breen
INTERSPEECH, 269-273, 2015
92015
Robust algorithms for speech reconstruction on mobile devices
X Shao
University of East Anglia, 2005
92005
Methods, apparatus and data structure for cross-language speech adaptation
X Shao, A Breen
US Patent 9,798,653, 2017
82017
Low bit-rate feature vector compression using transform coding and non-uniform bit allocation
B Milner, X Shao
2003 IEEE International Conference on Acoustics, Speech, and Signal …, 2003
82003
Integrated pitch and MFCC extraction for speech reconstruction and speech recognition applications.
X Shao, BP Milner, SJ Cox
INTERSPEECH, 1725-1728, 2003
72003
Audio-visual speech fragment decoding.
J Barker, X Shao
AVSP, 2007
62007
Audio-visual speech recognition in the presence of a competing speaker
X Shao, J Barker
Ninth International Conference on Spoken Language Processing, 2006
62006
Model-based parametric prosody synthesis with deep neural network
H Liu, H Lu, X Shao, Y Xu
Proceedings of the Annual Conference of the International Speech …, 2016
52016
MAP prediction of pitch from MFCC vectors for speech reconstruction
X Shao, BP Milner
Eighth International Conference on Spoken Language Processing, 2004
52004
Fundamental frequency and voicing prediction from MFCCs for speech reconstruction from unconstrained speech.
B Milner, X Shao, J Darch
INTERSPEECH, 321-324, 2005
32005
The system can't perform the operation now. Try again later.
Articles 1–20