World's Best Scientists 2026 revealed!

D-Index & Metrics

Computer Science

D-Index
76
Citations
70814
World Ranking
1307
National Ranking
691

Overview

Matei Zaharia is a researcher affiliated with the University of California, Berkeley in the United States. Their academic work primarily focuses on the field of Computer Science, with significant contributions in several subfields including Artificial Intelligence, Computer Networks and Communications, Computer Vision and Pattern Recognition, Information Systems, and Cardiology and Cardiovascular Medicine.

Their research covers a variety of topics, notably:

  • Topic Modeling
  • Natural Language Processing Techniques
  • Cloud Computing and Resource Management
  • Multimodal Machine Learning Applications
  • Advanced Data Storage Technologies
  • Distributed systems and fault tolerance
  • Advanced Image and Video Retrieval Techniques

Among their recent publications are:

  • "Advances, challenges and opportunities in creating data for trustworthy AI," 2022, published in Nature Machine Intelligence
  • "How Is ChatGPT's Behavior Changing Over Time?," 2024, published in Harvard Data Science Review
  • "ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction," 2022, published in Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
  • "ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT," 2020, published on arXiv (Cornell University)
  • "Delta lake," 2020, published in Proceedings of the VLDB Endowment

Zaharia's frequent co-authors include:

  • Peter Bailis
  • Omar Khattab
  • Albert J. Rogers
  • Sanjiv M. Narayan
  • James Zou

They have published extensively in venues such as:

  • arXiv (Cornell University)
  • Proceedings of the VLDB Endowment
  • Circulation
  • Heart Rhythm
  • Proceedings of the 2022 International Conference on Management of Data

Best Publications

  • A view of cloud computing

    Michael Armbrust;Armando Fox;Rean Griffith;Anthony D. Joseph

  • Above the Clouds: A Berkeley View of Cloud Computing

    Michael Armbrust;Armando Fox;Rean Griffith;Anthony D. Joseph

  • Spark: cluster computing with working sets

    Matei Zaharia;Mosharaf Chowdhury;Michael J. Franklin;Scott Shenker

  • Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing

    Matei Zaharia;Mosharaf Chowdhury;Tathagata Das;Ankur Dave

  • Apache Spark: a unified engine for big data processing

    Matei Zaharia;Reynold S. Xin;Patrick Wendell;Tathagata Das

  • Improving MapReduce performance in heterogeneous environments

    Matei Zaharia;Andy Konwinski;Anthony D. Joseph;Randy Katz

  • Mesos: a platform for fine-grained resource sharing in the data center

    Benjamin Hindman;Andy Konwinski;Matei Zaharia;Ali Ghodsi

  • On the Opportunities and Risks of Foundation Models.

    Rishi Bommasani;Drew A. Hudson;Ehsan Adeli;Russ Altman

  • MLlib: machine learning in apache spark

    Xiangrui Meng;Joseph Bradley;Burak Yavuz;Evan Sparks

  • Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling

    Matei Zaharia;Dhruba Borthakur;Joydeep Sen Sarma;Khaled Elmeleegy

  • Spark SQL: Relational Data Processing in Spark

    Michael Armbrust;Reynold S. Xin;Cheng Lian;Yin Huai

  • Dominant resource fairness: fair allocation of multiple resource types

    Ali Ghodsi;Matei Zaharia;Benjamin Hindman;Andy Konwinski

  • Discretized streams: fault-tolerant streaming computation at scale

    Matei Zaharia;Tathagata Das;Haoyuan Li;Timothy Hunter

  • ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT

    Omar Khattab;Matei Zaharia

  • Sparrow: distributed, low latency scheduling

    Kay Ousterhout;Patrick Wendell;Matei Zaharia;Ion Stoica

  • Managing data transfers in computer clusters with orchestra

    Mosharaf Chowdhury;Matei Zaharia;Justin Ma;Michael I. Jordan

  • Discretized streams: an efficient and fault-tolerant model for stream processing on large clusters

    Matei Zaharia;Tathagata Das;Haoyuan Li;Scott Shenker

  • PipeDream: generalized pipeline parallelism for DNN training

    Deepak Narayanan;Aaron Harlap;Amar Phanishayee;Vivek Seshadri

  • Shark: SQL and rich analytics at scale

    Reynold S. Xin;Josh Rosen;Matei Zaharia;Michael J. Franklin

  • Advances, challenges and opportunities in creating data for trustworthy AI

    Unknown

  • Efficient large-scale language model training on GPU clusters using megatron-LM

    Deepak Narayanan;Mohammad Shoeybi;Jared Casper;Patrick LeGresley

Frequent Co-Authors

Peter Bailis
Peter Bailis Stanford University
Ion Stoica
Ion Stoica University of California, Berkeley
Scott Shenker
Scott Shenker University of California, Berkeley
Ali Ghodsi
Ali Ghodsi University of Waterloo
Michael J. Franklin
Michael J. Franklin University of Chicago
Srinivasan Keshav
Srinivasan Keshav University of Cambridge
Shivaram Venkataraman
Shivaram Venkataraman University of Wisconsin–Madison
Anthony D. Joseph
Anthony D. Joseph University of California, Berkeley
David A. Patterson
David A. Patterson University of California, Berkeley
Randy H. Katz
Randy H. Katz University of California, Berkeley

If you think any of the details on this page are incorrect, let us know.

Report an issue

We appreciate your kind effort to assist us to improve this page, it would be helpful providing us with as much detail as possible in the text box below:

Related Online Degrees & Career Pathways

Exploring Computer Science in the USA opens doors to various online degree options that cater to diverse career goals and educational backgrounds. For those new to higher education, associates degrees can offer a flexible and affordable entry point into tech fields. These programs provide foundational knowledge and can help launch your career or make it easier to transfer to a bachelor’s program.

Students considering advanced or specialized roles should look into programs like the ed d in education for leadership positions within academic or training environments. Beyond degree type, selecting a program from online degrees accredited institutions is crucial to ensure your qualification is recognized by employers and professional associations.

For those passionate about tech creativity, a focus on gaming and media is possible through a game design masters online. This specialization can lead to exciting careers in interactive entertainment or simulation development. Whichever pathway you choose, online degrees provide flexibility and access to diverse career opportunities in the ever-evolving field of computer science.

Best Scientists Citing Matei Zaharia

Trending Scientists

Recently Published Articles