D-Index & Metrics Best Publications

D-Index & Metrics D-index (Discipline H-index) only includes papers and citation values for an examined discipline in contrast to General H-index which accounts for publications across all disciplines.

Discipline name D-index D-index (Discipline H-index) only includes papers and citation values for an examined discipline in contrast to General H-index which accounts for publications across all disciplines. Citations Publications World Ranking National Ranking
Computer Science D-index 31 Citations 5,808 78 World Ranking 9646 National Ranking 4383

Overview

What is he best known for?

The fields of study he is best known for:

  • Database
  • Programming language
  • Artificial intelligence

Data mining, Uncertain data, Relational database, Lineage and Database are his primary areas of study. In the field of Data mining, his study on Data integration overlaps with subjects such as Noise. In his research, Information retrieval is intimately related to IDEF1X, which falls under the overarching field of Data integration.

In his research on the topic of Uncertain data, Theoretical computer science is strongly related with Probabilistic database. His study in Theoretical computer science is interdisciplinary in nature, drawing from both Directed acyclic graph and Graph. His is doing research in Data lineage, Query language and SQL, both of which are found in Database.

His most cited work include:

  • Detecting near-duplicates for web crawling (466 citations)
  • ULDBs: databases with uncertainty and lineage (454 citations)
  • Trio: a system for data, uncertainty, and lineage (333 citations)

What are the main themes of his work throughout his whole career to date?

Anish Das Sarma spends much of his time researching Data mining, Theoretical computer science, Information retrieval, Data integration and Probabilistic logic. Uncertain data is the focus of his Data mining research. His Uncertain data study integrates concerns from other disciplines, such as Closure, Relational database, Database, Representation and Probabilistic database.

The various areas that Anish Das Sarma examines in his Theoretical computer science study include Data deduplication, Hash function, Approximation algorithm and Fuzzy logic. Anish Das Sarma interconnects Data mapping, Metadata and Data warehouse in the investigation of issues within Information retrieval. His Data integration research integrates issues from Functional dependency, Scalability, Set and Data science.

He most often published in these fields:

  • Data mining (28.40%)
  • Theoretical computer science (22.22%)
  • Information retrieval (20.99%)

What were the highlights of his more recent work (between 2012-2015)?

  • Theoretical computer science (22.22%)
  • Algorithm (7.41%)
  • Information retrieval (20.99%)

In recent papers he was focusing on the following fields of study:

Anish Das Sarma focuses on Theoretical computer science, Algorithm, Information retrieval, Set and Data mining. His Theoretical computer science research is multidisciplinary, incorporating perspectives in Programming language, Approximation algorithm and Integer programming. His Algorithm study incorporates themes from Discrete mathematics and Joins.

His study in Information retrieval is interdisciplinary in nature, drawing from both Event, Relation and Data deduplication. His research in Set intersects with topics in Relational operator and Relational algebra. His Data mining research is multidisciplinary, incorporating elements of Tree traversal, Search engine indexing and Rendering.

Between 2012 and 2015, his most popular works were:

  • Upper and lower bounds on the cost of a map-reduce computation (115 citations)
  • Finding connected components in map-reduce in logarithmic rounds (105 citations)
  • Fusing data with correlations (102 citations)

This overview was generated by a machine learning system which analysed the scientist’s body of work. If you have any feedback, you can contact us here.

Best Publications

Detecting near-duplicates for web crawling

Gurmeet Singh Manku;Arvind Jain;Anish Das Sarma.
the web conference (2007)

807 Citations

ULDBs: databases with uncertainty and lineage

Omar Benjelloun;Anish Das Sarma;Alon Halevy;Jennifer Widom.
very large data bases (2006)

704 Citations

Working Models for Uncertain Data

A.D. Sarma;O. Benjelloun;A. Halevy;J. Widom.
international conference on data engineering (2006)

461 Citations

Trio: a system for data, uncertainty, and lineage

Parag Agrawal;Omar Benjelloun;Anish Das Sarma;Chris Hayworth.
very large data bases (2006)

417 Citations

Bootstrapping pay-as-you-go data integration systems

Anish Das Sarma;Xin Dong;Alon Halevy.
international conference on management of data (2008)

345 Citations

Databases with uncertainty and lineage

Omar Benjelloun;Anish Das Sarma;Alon Halevy;Martin Theobald.
very large data bases (2008)

245 Citations

Human-assisted graph search: it's okay to ask questions

Aditya Parameswaran;Anish Das Sarma;Hector Garcia-Molina;Neoklis Polyzotis.
very large data bases (2011)

179 Citations

Upper and lower bounds on the cost of a map-reduce computation

Anish Das Sarma;Foto N. Afrati;Semih Salihoglu;Jeffrey D. Ullman.
very large data bases (2013)

177 Citations

Finding related tables

Anish Das Sarma;Lujun Fang;Nitin Gupta;Alon Halevy.
international conference on management of data (2012)

162 Citations

Fusing data with correlations

Ravali Pochampally;Anish Das Sarma;Xin Luna Dong;Alexandra Meliou.
international conference on management of data (2014)

144 Citations

If you think any of the details on this page are incorrect, let us know.

Contact us

Best Scientists Citing Anish Das Sarma

Lei Chen

Lei Chen

Hong Kong University of Science and Technology

Publications: 54

Dan Suciu

Dan Suciu

University of Washington

Publications: 44

Christopher Ré

Christopher Ré

Stanford University

Publications: 33

Divesh Srivastava

Divesh Srivastava

AT&T (United States)

Publications: 31

Xuemin Lin

Xuemin Lin

University of New South Wales

Publications: 30

Charu C. Aggarwal

Charu C. Aggarwal

IBM (United States)

Publications: 27

Val Tannen

Val Tannen

University of Pennsylvania

Publications: 27

Reynold Cheng

Reynold Cheng

University of Hong Kong

Publications: 24

Dan Olteanu

Dan Olteanu

University of Zurich

Publications: 23

Christoph Koch

Christoph Koch

École Polytechnique Fédérale de Lausanne

Publications: 23

Aditya Parameswaran

Aditya Parameswaran

University of California, Berkeley

Publications: 22

Tova Milo

Tova Milo

Tel Aviv University

Publications: 19

Zachary G. Ives

Zachary G. Ives

University of Pennsylvania

Publications: 18

Renée J. Miller

Renée J. Miller

University of Toronto

Publications: 18

Gerhard Weikum

Gerhard Weikum

Max Planck Institute for Informatics

Publications: 17

Jeffrey D. Ullman

Jeffrey D. Ullman

Stanford University

Publications: 16

Trending Scientists

Huamin Qu

Huamin Qu

Hong Kong University of Science and Technology

Hideki Imai

Hideki Imai

Chuo University

Paul Lant

Paul Lant

University of Queensland

William Jones

William Jones

University of Cambridge

Harry Schachter

Harry Schachter

University of Toronto

Moosa Mohammadi

Moosa Mohammadi

New York University Langone Medical Center

Lígia R. Rodrigues

Lígia R. Rodrigues

University of Minho

Maria T. Maldonado

Maria T. Maldonado

University of British Columbia

Natasha M. Maurits

Natasha M. Maurits

University of Groningen

Masato Kubo

Masato Kubo

RIKEN

Ingvar Lundberg

Ingvar Lundberg

University of Gothenburg

Gael I. Orsmond

Gael I. Orsmond

Boston University

Cyril Fisher

Cyril Fisher

Royal Marsden NHS Foundation Trust

Peter Vollenweider

Peter Vollenweider

University of Lausanne

Bruno Reichart

Bruno Reichart

Ludwig-Maximilians-Universität München

Francesco C. Billari

Francesco C. Billari

Bocconi University

Something went wrong. Please try again later.