World's Best Scientists 2026 revealed!
IEEE/ACM Transactions on Audio Speech and Language Processing
H-index 53

IEEE/ACM Transactions on Audio Speech and Language Processing

Ranking & Metrics

Discipline name Position Best Scientists Publications D-Index
Computer Science 69 337 741 52
Electronics and Electrical Engineering 168 36 94 21

Additional Metrics

Number of Best Scientists*: 391
Documents by Best Scientists*: 818
Top 100 Ranked Scientists*: 5
SCIMAGO H-index: 92
SCIMAGO SJR: 1.061
Impact Factor: 5.1

Overview

Top Research Topics at IEEE Transactions on Audio, Speech, and Language Processing?

The foci of the journal are Speech recognition, Artificial intelligence, Speech processing, Pattern recognition and Algorithm. The journal focuses on Speech recognition but the discussions also offer insight into other areas such as Speech enhancement, Noise and Audio signal processing. The studies in Speech enhancement featured incorporate elements of Intelligibility (communication), Wiener filter, Estimator, Noise measurement and Noise reduction.

Audio signal processing research presented falls under the umbrella topic of Audio signal. While Artificial intelligence is the focus of the journal, it also provided insights into the studies of Machine learning and Natural language processing. The studies on Speech processing discussed can also contribute to research in the domains of Speech synthesis, Reverberation, Robustness (computer science) and Speech coding.

The Reverberation study featured falls within the larger field of Acoustics. The journal features works in Pattern recognition, more specifically Mixture model, Mel-frequency cepstrum, Discriminative model and Feature vector, and explores their relation to disciplines like Gaussian process. The study of Algorithm encompasses disciplines such as Microphone, as well as fields such as Beamforming, all of which overlap with one another.

  • Speech recognition (54.36%)
  • Artificial intelligence (40.35%)
  • Speech processing (31.32%)

What are the most cited papers published in the journal?

  • Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition (2478 citations)
  • Front-End Factor Analysis for Speaker Verification (2433 citations)
  • Performance measurement in blind audio source separation (2023 citations)

Research areas of the most cited articles at IEEE Transactions on Audio, Speech, and Language Processing:

The journal papers cover a variety of subjects, including Speech recognition, Artificial intelligence, Speech processing, Pattern recognition and Audio signal processing. The most cited papers explore topics in Speech recognition which can be helpful for research in disciplines like Speech enhancement, Noise and Algorithm. The most cited articles focus on Artificial intelligence but the discussions also offer insight into other areas such as Machine learning and Natural language processing.

What topics the last edition of the journal is best known for?

  • Artificial intelligence
  • Statistics
  • Machine learning

The previous edition focused in particular on these issues:

IEEE Transactions on Audio, Speech, and Language Processing generally zeroes in on subjects such as Speech recognition, Artificial intelligence, Speech processing, Task analysis and Algorithm. The work on Speech recognition presented in IEEE Transactions on Audio, Speech, and Language Processing focuses on Hidden Markov model in particular. The Artificial intelligence works featured in IEEE Transactions on Audio, Speech, and Language Processing incorporate elements from Machine learning, Pattern recognition, Context model and Natural language processing.

The study of Data modeling and how it intertwines with concepts under Transformer (machine learning model) were explored in the presented Speech processing research. In the journal, Reverberation, Microphone array, Microphone, Time–frequency analysis and White noise are investigated in conjunction with one another to address concerns in Algorithm research. Artificial neural network research featured in it incorporates concerns from various other topics such as Convolutional neural network and Robustness (computer science).

The most cited articles from the last journal are:

  • An Overview of Voice Conversion and Its Challenges: From Statistical Modeling to Deep Learning (38 citations)
  • Expressive TTS Training With Frame and Style Reconstruction Loss (21 citations)
  • Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019 (17 citations)

Papers citation over time

A key indicator for each journal is its effectiveness in reaching other researchers with the papers published at that venue.

The chart below presents the interquartile range (first quartile 25%, median 50% and third quartile 75%) of the number of citations of articles over time.

The top authors publishing in IEEE Transactions on Audio, Speech, and Language Processing (based on the number of publications) are:

  • Jacob Benesty (66 papers) published 7 papers at the last edition, 3 more than at the previous edition,
  • DeLiang Wang (62 papers) published 7 papers at the last edition, 1 less than at the previous edition,
  • Jesper Jensen (53 papers) published 4 papers at the last edition, 2 less than at the previous edition,
  • Sharon Gannot (48 papers) published 1 paper at the last edition, 6 less than at the previous edition,
  • John H. L. Hansen (44 papers) published 3 papers at the last edition.

The overall trend for top authors publishing in this journal is outlined below. The chart shows the number of publications at each edition of the journal for top authors.

Only papers with recognized affiliations are considered

The top affiliations publishing in IEEE Transactions on Audio, Speech, and Language Processing (based on the number of publications) are:

  • Microsoft (89 papers) published 4 papers at the last edition the same number as at the previous edition,
  • Nippon Telegraph and Telephone (74 papers) published 5 papers at the last edition the same number as at the previous edition,
  • Aalborg University (72 papers) published 6 papers at the last edition, 1 more than at the previous edition,
  • Ohio State University (66 papers) published 8 papers at the last edition the same number as at the previous edition,
  • University of Texas at Dallas (57 papers) published 3 papers at the last edition, 2 more than at the previous edition.

The overall trend for top affiliations publishing in this journal is outlined below. The chart shows the number of publications at each edition of the journal for top affiliations.

Publication chance based on affiliation

The publication chance index shows the ratio of articles published by the best research institutions in the journal edition to all articles published within that journal. The best research institutions were selected based on the largest number of articles published during all editions of the journal.

The chart below presents the percentage ratio of articles from top institutions (based on their ranking of total papers).Top affiliations were grouped by their rank into the following tiers: top 1-10, top 11-20, top 21-50, and top 51+. Only articles with a recognized affiliation are considered.

During the most recent 2021 edition, 8.08% of publications had an unrecognized affiliation. Out of the publications with recognized affiliations, 22.59% were posted by at least one author from the top 10 institutions publishing in the journal. Another 10.88% included authors affiliated with research institutions from the top 11-20 affiliations. Institutions from the 21-50 range included 25.94% of all publications and 40.59% were from other institutions.

Returning Authors Index

A very common phenomenon observed among researchers publishing scientific articles is the intentional selection of journals they have already attended in the past. In particular, it is worth analyzing the case when the authors participate in the same journal from year to year.

The Returning Authors Index presented below illustrates the ratio of authors who participated in both a given as well as the previous edition of the journal in relation to all participants in a given year.

Returning Institution Index

The graph below shows the Returning Institution Index, illustrating the ratio of institutions that participated in both a given and the previous edition of the conference in relation to all affiliations present in a given year.

The experience to innovation index

Our experience to innovation index was created to show a cross-section of the experience level of authors publishing in a journal. The index includes the authors publishing at the last edition of a journal, grouped by total number of publications throughout their academic career (P) and the total number of citations of these publications ever received (C).

The group intervals were selected empirically to best show the diversity of the authors' experiences, their labels were selected as a convenience, not as judgment. The authors were divided into the following groups:

  • Novice - P < 5 or C < 25 (the number of publications less than 5 or the number of citations less than 25),
  • Competent - P < 10 or C < 100 (the number of publications less than 10 or the number of citations less than 100),
  • Experienced - P < 25 or C < 625 (the number of publications less than 25 or the number of citations less than 625),
  • Master - P < 50 or C < 2500 (the number of publications less than 50 or the number of citations less than 2500),
  • Star - P ≥ 50 and C ≥ 2500 (both the number of publications greater than 50 and the number of citations greater than 2500).

The chart below illustrates experience levels of first authors in cases of publications with multiple authors.

Bridging the Gap: Application in Education

Understanding the real-world applications and benefits of the research topics discussed in this journal is crucial for readers so they can grasp the relevance of such studies. For instance, the concepts of Speech Recognition and Artificial Intelligence can be applied in various sectors including Education, where these technologies are used in teaching and learning tools for children. A common example can be seen in preschool education, where teaching aids equipped with speech recognition features facilitate interactive learning.

Considering a career as an educator such as a preschool teacher also necessitates comprehension of these technologies. As speech recognition and AI technology increasingly become a part of teaching aids and tools, educators are expected to acquaint themselves with these advancements. In states like Virginia, knowledge and adeptness in utilizing these developments are included in the {anchor}preschool teacher requirements in Virginia. Focusing on these area of studies not only enriches one's skill set as a teacher but can also contribute to efficient and effective learning methods for children in preschool.

Top Publications

  • HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units

    Wei-Ning Hsu;Benjamin Bolte;Yao-Hung Hubert Tsai;Kushal Lakhotia

    (2021)
    1565 Citations
  • Pre-Training with Whole Word Masking for Chinese BERT

    Yiming Cui;Wanxiang Che;Ting Liu;Bing Qin

    (2021)
    1552 Citations
  • PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition

    Qiuqiang Kong;Yin Cao;Turab Iqbal;Yuxuan Wang

    (2020)
    1089 Citations
  • SoundStream: An End-to-End Neural Audio Codec

    (2021)
    576 Citations
  • AudioLM: A Language Modeling Approach to Audio Generation

    Unknown

    (2022)
    498 Citations
  • Learning Complex Spectral Mapping With Gated Convolutional Recurrent Networks for Monaural Speech Enhancement

    Ke Tan;DeLiang Wang

    (2020)
    333 Citations
  • Wavesplit: End-to-End Speech Separation by Speaker Clustering

    Neil Zeghidour;David Grangier

    (2021)
    258 Citations
  • TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech

    Andy T. Liu;Shang-Wen Li;Hung-yi Lee

    (2021)
    256 Citations
  • FSD50K: An Open Dataset of Human-Labeled Sound Events

    (2021)
    234 Citations

Related Online Degrees & Career Pathways

Exploring online college degrees offers flexibility and accessibility for students pursuing Computer Science. These programs often cater to diverse learning styles and busy schedules, making education more attainable. For those starting their journey, online associates degree programs provide a quick, affordable pathway to gain foundational tech skills and enter entry-level IT roles.

Many students benefit from accelerated degree programs, which allow learners to complete their bachelor's in less time, increasing career momentum without sacrificing quality. Such programs are ideal for motivated individuals looking to fast-track their education and enter the competitive tech industry sooner.

Choosing a degree with strong earning potential is crucial, and Computer Science is consistently among the highest paying bachelor degrees. Graduates often find rewarding careers in software development, cybersecurity, data analysis, and more. These pathways not only provide financial stability but also strong growth prospects in a technology-driven world.

Best Scientists Contributing to This Journal