D-Index & Metrics Best Publications

D-Index & Metrics D-index (Discipline H-index) only includes papers and citation values for an examined discipline in contrast to General H-index which accounts for publications across all disciplines.

Discipline name D-index D-index (Discipline H-index) only includes papers and citation values for an examined discipline in contrast to General H-index which accounts for publications across all disciplines. Citations Publications World Ranking National Ranking
Computer Science D-index 32 Citations 4,669 140 World Ranking 9283 National Ranking 4227

Overview

What is he best known for?

The fields of study he is best known for:

  • Artificial intelligence
  • Speech recognition
  • Statistics

His scientific interests lie mostly in Speech recognition, Speech enhancement, Speech processing, Reverberation and Signal processing. His Speech recognition research is multidisciplinary, relying on both Artificial neural network and Microphone array. His Speech enhancement research includes elements of Noise reduction and Speaker diarisation.

Within one scientific family, Takuya Yoshioka focuses on topics pertaining to Speaker recognition under Speech processing, and may sometimes address concerns connected to Acoustic model and Voice activity detection. His research integrates issues of Deconvolution and Blind signal separation in his study of Reverberation. His Word error rate study combines topics in areas such as Feature extraction and Beamforming.

His most cited work include:

  • Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction (214 citations)
  • A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research (186 citations)
  • Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition (182 citations)

What are the main themes of his work throughout his whole career to date?

Takuya Yoshioka mainly focuses on Speech recognition, Speech enhancement, Artificial intelligence, Speech processing and Reverberation. His Speech recognition research integrates issues from Artificial neural network and Noise. His Speech enhancement study combines topics from a wide range of disciplines, such as Background noise, Speaker recognition, Audio signal processing, Noise reduction and Linear predictive coding.

The study incorporates disciplines such as Algorithm and Pattern recognition in addition to Artificial intelligence. His work on Voice activity detection is typically connected to Process as part of general Speech processing study, connecting several disciplines of science. His Reverberation study integrates concerns from other disciplines, such as Blind signal separation and Signal processing.

He most often published in these fields:

  • Speech recognition (83.11%)
  • Speech enhancement (27.03%)
  • Artificial intelligence (22.97%)

What were the highlights of his more recent work (between 2019-2021)?

  • Speech recognition (83.11%)
  • Monaural (8.11%)
  • End-to-end principle (8.11%)

In recent papers he was focusing on the following fields of study:

His primary areas of investigation include Speech recognition, Monaural, End-to-end principle, Word error rate and Speaker diarisation. His study looks at the relationship between Speech recognition and fields such as Joint, as well as how they intersect with chemical problems. Takuya Yoshioka has included themes like Speaker recognition and Data set in his Monaural study.

He usually deals with End-to-end principle and limits it to topics linked to Microphone and Stream processing, Direction of arrival, Contrast, Adaptive beamformer and Robustness. His studies in Word error rate integrate themes in fields like Recurrent neural network and Transformer. Speaker diarisation is closely attributed to Speech enhancement in his research.

Between 2019 and 2021, his most popular works were:

  • Dual-Path RNN: Efficient Long Sequence Modeling for Time-Domain Single-Channel Speech Separation (53 citations)
  • CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings (49 citations)
  • Continuous Speech Separation: Dataset and Analysis (40 citations)

In his most recent research, the most cited papers focused on:

  • Artificial intelligence
  • Speech recognition
  • Statistics

His primary scientific interests are in Speech recognition, End-to-end principle, Transcription, Monaural and Speaker diarisation. His Speech recognition research is multidisciplinary, incorporating elements of Speech enhancement and Relevance. His work deals with themes such as Robustness and Microphone, which intersect with End-to-end principle.

He interconnects Speaker recognition, Speaker identification and Joint in the investigation of issues within Monaural. His Joint research incorporates themes from Mutual information, Cluster analysis and Joint probability distribution. His biological study spans a wide range of topics, including Natural and Conversational speech.

This overview was generated by a machine learning system which analysed the scientist’s body of work. If you have any feedback, you can contact us here.

Best Publications

Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction

Tomohiro Nakatani;Takuya Yoshioka;Keisuke Kinoshita;Masato Miyoshi.
IEEE Transactions on Audio, Speech, and Language Processing (2010)

352 Citations

A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research

Keisuke Kinoshita;Marc Delcroix;Sharon Gannot;Emanuël A. P. Habets.
EURASIP Journal on Advances in Signal Processing (2016)

327 Citations

Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition

Takuya Yoshioka;A. Sehr;M. Delcroix;K. Kinoshita.
IEEE Signal Processing Magazine (2012)

303 Citations

The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices

Takuya Yoshioka;Nobutaka Ito;Marc Delcroix;Atsunori Ogawa.
ieee automatic speech recognition and understanding workshop (2015)

231 Citations

Dual-Path RNN: Efficient Long Sequence Modeling for Time-Domain Single-Channel Speech Separation

Yi Luo;Zhuo Chen;Takuya Yoshioka.
international conference on acoustics speech and signal processing (2020)

216 Citations

Generalization of Multi-Channel Linear Prediction Methods for Blind MIMO Impulse Response Shortening

T. Yoshioka;T. Nakatani.
IEEE Transactions on Audio, Speech, and Language Processing (2012)

209 Citations

Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise

Takuya Higuchi;Nobutaka Ito;Takuya Yoshioka;Tomohiro Nakatani.
international conference on acoustics, speech, and signal processing (2016)

183 Citations

Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization

Takuya Yoshioka;Tomohiro Nakatani;Masato Miyoshi;Hiroshi G Okuno.
IEEE Transactions on Audio, Speech, and Language Processing (2011)

171 Citations

Blind speech dereverberation with multi-channel linear prediction based on short time fourier transform representation

T. Nakatani;T. Yoshioka;K. Kinoshita;M. Miyoshi.
international conference on acoustics, speech, and signal processing (2008)

156 Citations

CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings

Shinji Watanabe;Michael Mandel;Jon Barker;Emmanuel Vincent.
6th International Workshop on Speech Processing in Everyday Environments (CHiME 2020) (2020)

129 Citations

If you think any of the details on this page are incorrect, let us know.

Contact us

Best Scientists Citing Takuya Yoshioka

Tomohiro Nakatani

Tomohiro Nakatani

NTT (Japan)

Publications: 102

Shinji Watanabe

Shinji Watanabe

Carnegie Mellon University

Publications: 84

Keisuke Kinoshita

Keisuke Kinoshita

NTT (Japan)

Publications: 79

Reinhold Haeb-Umbach

Reinhold Haeb-Umbach

University of Paderborn

Publications: 47

Shoko Araki

Shoko Araki

NTT (Japan)

Publications: 44

Simon Doclo

Simon Doclo

Carl von Ossietzky University of Oldenburg

Publications: 39

DeLiang Wang

DeLiang Wang

The Ohio State University

Publications: 36

Hirokazu Kameoka

Hirokazu Kameoka

NTT (Japan)

Publications: 34

Emmanuel Vincent

Emmanuel Vincent

University of Lorraine

Publications: 34

Dong Yu

Dong Yu

Tencent (China)

Publications: 27

Sharon Gannot

Sharon Gannot

Bar-Ilan University

Publications: 26

Haizhou Li

Haizhou Li

Chinese University of Hong Kong, Shenzhen

Publications: 25

Nobutaka Ono

Nobutaka Ono

Tokyo Metropolitan University

Publications: 25

Walter Kellermann

Walter Kellermann

University of Erlangen-Nuremberg

Publications: 23

John R. Hershey

John R. Hershey

Google (United States)

Publications: 22

Emanuel A. P. Habets

Emanuel A. P. Habets

University of Erlangen-Nuremberg

Publications: 22

Trending Scientists

Bhaskar Dutta

Bhaskar Dutta

University of Warwick

Alessandro Bottaro

Alessandro Bottaro

University of Genoa

Garvin Heath

Garvin Heath

National Renewable Energy Laboratory

Raimondo Maggi

Raimondo Maggi

University of Parma

Huolin L. Xin

Huolin L. Xin

University of California, Irvine

Inkyu Park

Inkyu Park

Korea Advanced Institute of Science and Technology

Tanja Kortemme

Tanja Kortemme

University of California, San Francisco

Momoko Horikoshi

Momoko Horikoshi

University of Oxford

Kevin J. Peterson

Kevin J. Peterson

Dartmouth College

Valentijn R. N. Pauwels

Valentijn R. N. Pauwels

Monash University

Christopher Kennard

Christopher Kennard

University of Oxford

Bernard Hirschel

Bernard Hirschel

Geneva College

Andrew J.S. Coats

Andrew J.S. Coats

University of Warwick

Vincenzo Di Marzo

Vincenzo Di Marzo

National Research Council (CNR)

Benoit H. Mulsant

Benoit H. Mulsant

Centre for Addiction and Mental Health

Gregory Phillips

Gregory Phillips

Northwestern University

Something went wrong. Please try again later.