2026 Florian Metze: Computer Science Researcher – H-Index, Publications & Awards

Discipline name	D-Index	World Ranking	Current World Ranking	National Ranking	Current National Ranking	Publications	Citations
Computer Science	54	4622	4493	2148	2073	300	10261

Overview

Florian Metze is affiliated with Carnegie Mellon University in the United States and specializes in computer science with a focus on artificial intelligence, computer vision and pattern recognition, and signal processing. Their publication record includes 125 works predominantly in these fields, reflecting a comprehensive engagement with areas such as speech recognition and synthesis, music and audio processing, and natural language processing techniques.

Their research covers multiple main topics including:

Speech Recognition and Synthesis
Music and Audio Processing
Natural Language Processing Techniques
Topic Modeling
Speech and Audio Processing
Multimodal Machine Learning Applications
Domain Adaptation and Few-Shot Learning

Florian Metze has contributed to various frequent publication venues, showcasing a range of interdisciplinary approaches. These venues include:

arXiv (Cornell University)
IEEE/ACM Transactions on Audio Speech and Language Processing
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Interspeech 2022
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

Recent notable papers include:

"VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding," 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
"How2Sign: A large-scale multimodal dataset for continuous American sign language," 2021, UPCommons (Polytechnic University of Catalonia)
"Masked Autoencoders that Listen," 2022, arXiv (Cornell University)
"Self-supervised object detection from audio-visual correspondence," 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
"Support-set bottlenecks for video-text representation learning," 2020, arXiv (Cornell University)

Frequent co-authors who have collaborated with Florian Metze include:

Alan W. Black
Shinji Watanabe
Xinjian Li
Siddharth Dalmia
Po-Yao Huang

Best Publications

EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding

Yajie Miao;Mohammad Gowayyed;Florian Metze

836
Citations
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding

Hu Xu;Gargi Ghosh;Po-Yao Huang;Dmytro Okhonko

368
Citations
Extracting deep bottleneck features using stacked auto-encoders

Jonas Gehring;Yajie Miao;Florian Metze;Alex Waibel

358
Citations
A one-pass decoder based on polymorphic linguistic context assignment

H. Soltau;F. Metze;C. Fugen;A. Waibel

257
Citations
Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval

Niluthpol Chowdhury Mithun;Juncheng Li;Florian Metze;Amit K. Roy-Chowdhury

254
Citations
Masked Autoencoders that Listen

Unknown

226
Citations
How2: A Large-scale Dataset for Multimodal Language Understanding

Ramon Sanabria;Ozan Caglayan;Shruti Palaskar;Desmond Elliott

209
Citations
Comparison of Four Approaches to Age and Gender Recognition for Telephone Applications

F. Metze;J. Ajmera;R. Englert;U. Bub

190
Citations
Advances in automatic meeting record creation and access

A. Waibel;M. Bett;F. Metze;K. Ries

183
Citations
A Comparison of Five Multiple Instance Learning Pooling Functions for Sound Event Detection with Weak Labeling

Yun Wang;Juncheng Li;Florian Metze

180
Citations
How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language

Amanda Duarte;Shruti Palaskar;Lucas Ventura;Deepti Ghadiyaram

180
Citations
A comparison of Deep Learning methods for environmental sound detection

Juncheng Li;Wei Dai;Florian Metze;Shuhui Qu

174
Citations
Session independent non-audible speech recognition using surface electromyography

L. Maier-Hein;F. Metze;T. Schultz;A. Waibel

168
Citations
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition

Aren Jansen;Emmanuel Dupoux;Sharon Goldwater;Mark Johnson

149
Citations
Support-set bottlenecks for video-text representation learning

Mandela Patrick;Po-Yao Huang;Yuki Asano;Florian Metze

143
Citations
Deep maxout networks for low-resource speech recognition

Yajie Miao;Florian Metze;Shourabh Rawat

130
Citations
Speaker adaptive training of deep neural network acoustic models using i-vectors

Yajie Miao;Hao Zhang;Florian Metze

130
Citations
Anger recognition in speech using acoustic and linguistic cues

Tim Polzehl;Alexander Schmitt;Florian Metze;Michael Wagner

125
Citations
A flexible stream architecture for ASR using articulatory features.

Florian Metze;Alex Waibel

114
Citations
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding

Hu Xu;Gargi Ghosh;Po-Yao Huang;Prahal Arora

90
Citations
A Comparison of deep learning methods for environmental sound.

Juncheng Li;Wei Dai;Florian Metze;Shuhui Qu

8
Citations
How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language

Amanda Duarte;Shruti Palaskar;Deepti Ghadiyaram;Kenneth DeHaan

2
Citations

Frequent Co-Authors

Alex Waibel Carnegie Mellon University

Alan W. Black Carnegie Mellon University

Tanja Schultz University of Bremen

Alexander G. Hauptmann Carnegie Mellon University

Hagen Soltau Google (United States)

Teruko Mitamura Carnegie Mellon University

Sebastian Möller Technical University of Berlin

Xavier Anguera ELSA Speak

Bhiksha Raj Carnegie Mellon University

Graham Neubig Carnegie Mellon University

External Links

Personal website of Florian Metze Google Scholar page

If you think any of the details on this page are incorrect, let us know.

Related Online Degrees & Career Pathways

Exploring online education options in computer science opens up several career pathways, whether you're starting or advancing your tech career. Many working professionals and students look for flexible and affordable programs, and the range of choices has never been greater.

For those seeking a solid foundation, an online associate degree in computer science provides entry-level knowledge and skills, often leading to immediate job opportunities or offering a stepping stone to further study. Aspiring specialists aiming for leadership or high-demand roles may consider the most useful masters degrees to focus their expertise and boost their earning potential.

Budget is another important factor for many students. There are cheap online degrees fast available, enabling learners to get qualified without excessive debt. Additionally, if your academic record has a few bumps, you can still apply to online graduate schools with low gpa requirements.

Whichever pathway you choose, online degrees provide the flexibility to balance your personal and professional commitments, making career advancement in computer science more accessible than ever.