World's Best Scientists 2026 revealed!

D-Index & Metrics

Computer Science

D-Index
79
Citations
26811
World Ranking
1137
National Ranking
604

Overview

Shinji Watanabe is affiliated with Carnegie Mellon University in the United States and has contributed extensively to the fields of computer science, with a strong focus on artificial intelligence and signal processing. Their research spans speech recognition, speech and audio processing, and natural language processing techniques.

The scientist's recent publications include the following:

  • Self-Supervised Speech Representation Learning: A Review, 2022, IEEE Journal of Selected Topics in Signal Processing
  • Conditional Diffusion Probabilistic Model for Speech Enhancement, 2022, ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • End-to-End Speech Recognition: A Survey, 2023, IEEE/ACM Transactions on Audio Speech and Language Processing
  • TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation, 2023, IEEE/ACM Transactions on Audio Speech and Language Processing
  • Far-Field Automatic Speech Recognition, 2020, Proceedings of the IEEE

Frequent co-authors working alongside Shinji Watanabe include:

  • Xuankai Chang
  • Siddhant Arora
  • Yifan Peng
  • Jiatong Shi
  • Wangyou Zhang

The primary research topics covered in their work are:

  • Speech Recognition and Synthesis
  • Speech and Audio Processing
  • Music and Audio Processing
  • Natural Language Processing Techniques
  • Topic Modeling
  • Speech and dialogue systems
  • Advanced Adaptive Filtering Techniques

Shinji Watanabe's contributions mainly appear in the following publication venues:

  • arXiv (Cornell University)
  • Interspeech 2022
  • ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • IEEE/ACM Transactions on Audio Speech and Language Processing
  • Computer Speech & Language

Their scientific work intersects with subfields like artificial intelligence, signal processing, computer vision and pattern recognition, computational mechanics, and experimental and cognitive psychology. The dominant field of study remains computer science, within which their extensive research output is concentrated.

Best Publications

  • ESPNet: End-to-end speech processing toolkit

    Shinji Watanabe;Takaaki Hori;Shigeki Karita;Tomoki Hayashi

  • Deep clustering: Discriminative embeddings for segmentation and separation

    John R. Hershey;Zhuo Chen;Jonathan Le Roux;Shinji Watanabe

  • Joint CTC-attention based end-to-end speech recognition using multi-task learning

    Suyoun Kim;Takaaki Hori;Shinji Watanabe

  • Hybrid CTC/Attention Architecture for End-to-End Speech Recognition

    Shinji Watanabe;Takaaki Hori;Suyoun Kim;John R. Hershey

  • The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines

    Jon Barker;Ricard Marxer;Emmanuel Vincent;Shinji Watanabe

  • Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks

    Hakan Erdogan;John R. Hershey;Shinji Watanabe;Jonathan Le Roux

  • Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR

    Felix Weninger;Hakan Erdogan;Shinji Watanabe;Emmanuel Vincent

  • SUPERB: Speech processing Universal PERformance Benchmark

    Shu-wen Yang;Po-Han Chi;Yung-Sung Chuang;Cheng-I Jeff Lai

  • A Comparative Study on Transformer vs RNN in Speech Applications

    Shigeki Karita;Nanxin Chen;Tomoki Hayashi;Takaaki Hori

  • Self-Supervised Speech Representation Learning: A Review

    Unknown

  • An analysis of environment, microphone and data simulation mismatches in robust speech recognition

    Emmanuel Vincent;Shinji Watanabe;Aditya Arie Nugraha;Jon Barker

  • Single-Channel Multi-Speaker Separation using Deep Clustering

    Yusuf Ziya Isik;Yusuf Ziya Isik;Jonathan Le Roux;Zhuo Chen;Zhuo Chen;Shinji Watanabe

  • Improved MVDR beamforming using single-channel mask prediction networks

    Hakan Erdogan;John R. Hershey;Shinji Watanabe;Michael I. Mandel

  • The second ‘chime’ speech separation and recognition challenge: Datasets, tasks and baselines

    Emmanuel Vincent;Jon Barker;Shinji Watanabe;Jonathan Le Roux

  • A Comparative Study on Transformer vs RNN in Speech Applications

    Shigeki Karita;Xiaofei Wang;Shinji Watanabe;Takenori Yoshimura

  • Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM

    Takaaki Hori;Shinji Watanabe;Yu Zhang;William Chan

  • The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines.

    Jon Barker;Shinji Watanabe;Emmanuel Vincent;Jan Trmal

  • A review of speaker diarization: Recent advances with deep learning

    Tae Jin Park;Naoyuki Kanda;Dimitrios Dimitriadis;Kyu J. Han

  • End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors

    Unknown

  • Conditional Diffusion Probabilistic Model for Speech Enhancement

    Unknown

  • End-to-end neural speaker diarization with permutation-free objectives

    Yusuke Fujita;Yusuke Fujita;Naoyuki Kanda;Shota Horiguchi;Kenji Nagamatsu

  • End-to-End Neural Speaker Diarization with Self-Attention

    Yusuke Fujita;Naoyuki Kanda;Shota Horiguchi;Yawen Xue

  • GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

    Guoguo Chen;Shuzhou Chai;Guanbo Wang;Jiayu Du

Frequent Co-Authors

John R. Hershey
John R. Hershey Google (United States)
Jonathan Le Roux
Jonathan Le Roux Mitsubishi Electric (United States)
Sanjeev Khudanpur
Sanjeev Khudanpur Johns Hopkins University
Hakan Erdogan
Hakan Erdogan Google (United States)
Marc Delcroix
Marc Delcroix NTT (Japan)
Najim Dehak
Najim Dehak Johns Hopkins University
Yanmin Qian
Yanmin Qian Shanghai Jiao Tong University
Emmanuel Vincent
Emmanuel Vincent University of Lorraine
Takuya Yoshioka
Takuya Yoshioka Microsoft (United States)

If you think any of the details on this page are incorrect, let us know.

Report an issue

We appreciate your kind effort to assist us to improve this page, it would be helpful providing us with as much detail as possible in the text box below:

Related Online Degrees & Career Pathways

Exploring Computer Science in the USA opens doors to a variety of related online degrees and expanding fields. Many students enhance their expertise by pursuing specialized online programs in engineering and data-driven disciplines.

For instance, environmental concerns are fueling the growth of environmental engineering schools online, offering affordable programs that merge computer science with sustainability. Students interested in design and robotics may benefit from pursuing an online degree in mechanical engineering, equipping them with skills for an evolving tech-driven industry.

If you are inclined toward the mathematical and theoretical aspects of computing, consider a physics degree online. This broadens career options in research, high-tech manufacturing, or scientific computing.

For those focused on Big Data and analytics, the cheapest master in data science programs can be a strategic addition, preparing graduates for roles as data scientists or analysts in various industries.

These pathways complement core computer science studies and enhance employability in STEM fields, both in the U.S. and globally.

Best Scientists Citing Shinji Watanabe

Trending Scientists

Recently Published Articles