2021 - IEEE Fellow For leadership in creating cloud speech recognition services in industry
His scientific interests lie mostly in Speech recognition, Artificial intelligence, Artificial neural network, Pattern recognition and Hidden Markov model. His work on Word error rate as part of general Speech recognition research is often related to Set, thus linking different fields of science. His work in Artificial intelligence covers topics such as Natural language processing which are related to areas like Context model.
His Artificial neural network research incorporates themes from Singular value decomposition and Feature extraction. His work in the fields of Speaker recognition and Feature transformation overlaps with other areas such as Merge and Bottleneck. His Hidden Markov model study incorporates themes from Time delay neural network, Discriminative model and Softmax function.
The scientist’s investigation covers issues in Speech recognition, Artificial intelligence, Word error rate, Artificial neural network and Pattern recognition. His specific area of interest is Speech recognition, where Yifan Gong studies Acoustic model. His Artificial intelligence research includes elements of Machine learning and Natural language processing.
His research integrates issues of Language model, Training set, Algorithm, Decoding methods and Robustness in his study of Word error rate. His research investigates the connection between Artificial neural network and topics such as Dictation that intersect with issues in Regularization and Voice search. His Pattern recognition research incorporates elements of Speech enhancement, Normalization and Utterance.
His primary areas of investigation include Speech recognition, Recurrent neural network, Word error rate, End-to-end principle and Reduction. His biological study spans a wide range of topics, including Artificial neural network and Decoding methods. His research investigates the connection between Artificial neural network and topics such as Embedding that intersect with problems in Attention network.
His Word error rate study combines topics in areas such as Language model, Normalization and Task. His Training set research entails a greater understanding of Artificial intelligence. His work on Invariant feature extraction is typically connected to A domain as part of general Artificial intelligence study, connecting several disciplines of science.
Yifan Gong mainly investigates Speech recognition, Recurrent neural network, Word error rate, Initialization and Training set. Yifan Gong frequently studies issues relating to Reduction and Speech recognition. His work in Recurrent neural network addresses issues such as End-to-end principle, which are connected to fields such as Heuristic.
In his research on the topic of Word error rate, Interpolation, Connectionism, Task and Leverage is strongly related with Language model. His Speaker recognition study combines topics from a wide range of disciplines, such as Data set, Robustness and Monaural. His Speech synthesis research is multidisciplinary, incorporating perspectives in Transducer, Keyword spotting and Speaker adaptation.
This overview was generated by a machine learning system which analysed the scientist’s body of work. If you have any feedback, you can contact us here.
Recent advances in deep learning for speech research at Microsoft
Li Deng;Jinyu Li;Jui-Ting Huang;Kaisheng Yao.
international conference on acoustics, speech, and signal processing (2013)
Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers
Jui-Ting Huang;Jinyu Li;Dong Yu;Li Deng.
international conference on acoustics, speech, and signal processing (2013)
An overview of noise-robust automatic speech recognition
Jinyu Li;Li Deng;Yifan Gong;Reinhold Haeb-Umbach.
IEEE Transactions on Audio, Speech, and Language Processing (2014)
Restructuring of Deep Neural Network Acoustic Models with Singular Value Decomposition
Jian Xue;Jinyu Li;Yifan Gong.
conference of the international speech communication association (2013)
Learning small-size DNN with output-distribution-based criteria.
Jinyu Li;Rui Zhao;Jui-Ting Huang;Yifan Gong.
conference of the international speech communication association (2014)
Recognition architecture for generating Asian characters
Shiun-Zu Kuo;Kevin E. Feige;Yifan Gong;Taro Miwa.
(2007)
Adaptation of context-dependent deep neural networks for automatic speech recognition
Kaisheng Yao;Dong Yu;Frank Seide;Hang Su.
spoken language technology workshop (2012)
Recognition architecture for generating Asian characters
Kuo Shiun-Zu;Feige Kevin E;Gong Yifan;Miwa Taro.
(2007)
Identifying language origin of words
Min Chu;Yi Ning Chen;Shiun-Zu Kuo;Xiaodong He.
(2006)
Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network
Jian Xue;Jinyu Li;Dong Yu;Mike Seltzer.
international conference on acoustics, speech, and signal processing (2014)
If you think any of the details on this page are incorrect, let us know.
We appreciate your kind effort to assist us to improve this page, it would be helpful providing us with as much detail as possible in the text box below:
Microsoft (United States)
Citadel
Tencent (China)
University of North Texas
Apple (United States)
Facebook (United States)
Microsoft (United States)
University of Paderborn
Apple (United States)
Amazon (United States)
Centre for Research and Technology Hellas
Eindhoven University of Technology
University of Notre Dame
Jiangnan University
Universität Hamburg
Kansas State University
Selçuk University
Aix-Marseille University
University of Potsdam
Federal University of Toulouse Midi-Pyrénées
National Taiwan Ocean University
University of Colorado Boulder
University of British Columbia
University of New South Wales
University of Versailles Saint-Quentin-en-Yvelines
University of North Carolina at Chapel Hill