D-Index & Metrics Best Publications

D-Index & Metrics D-index (Discipline H-index) only includes papers and citation values for an examined discipline in contrast to General H-index which accounts for publications across all disciplines.

Discipline name D-index D-index (Discipline H-index) only includes papers and citation values for an examined discipline in contrast to General H-index which accounts for publications across all disciplines. Citations Publications World Ranking National Ranking
Computer Science D-index 67 Citations 30,964 332 World Ranking 1352 National Ranking 772

Research.com Recognitions

Awards & Achievements

2012 - Fellow of Alfred P. Sloan Foundation

Overview

What is he best known for?

The fields of study he is best known for:

  • Artificial intelligence
  • Internal medicine
  • Surgery

His main research concerns Reinforcement learning, Artificial intelligence, Markov decision process, Mathematical optimization and Machine learning. The Q-learning research Satinder Singh does as part of his general Reinforcement learning study is frequently linked to other disciplines of science, such as Function, therefore creating a link between diverse domains of science. His Artificial intelligence research includes elements of Domain, Predictive state representation and Set.

His research in Markov decision process focuses on subjects like Object, which are connected to Variety, Action, Hierarchy and Interface. The various areas that he examines in his Mathematical optimization study include Partially observable Markov decision process, Stochastic game, Markov process and Markov chain. His study on Transfer of learning is often connected to Sketch as part of broader study in Machine learning.

His most cited work include:

  • Policy Gradient Methods for Reinforcement Learning with Function Approximation (3144 citations)
  • Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning (1998 citations)
  • Learning to act using real-time dynamic programming (969 citations)

What are the main themes of his work throughout his whole career to date?

Satinder Singh spends much of his time researching Artificial intelligence, Reinforcement learning, Internal medicine, Mathematical optimization and Cardiology. His work carried out in the field of Artificial intelligence brings together such families of science as Action, Domain, Machine learning and Set. His study in Reinforcement learning is interdisciplinary in nature, drawing from both Algorithm, Set and Human–computer interaction.

His study brings together the fields of Markov decision process and Mathematical optimization. Partially observable Markov decision process is the focus of his Markov decision process research.

He most often published in these fields:

  • Artificial intelligence (20.46%)
  • Reinforcement learning (18.01%)
  • Internal medicine (9.22%)

What were the highlights of his more recent work (between 2015-2021)?

  • Reinforcement learning (18.01%)
  • Artificial intelligence (20.46%)
  • Internal medicine (9.22%)

In recent papers he was focusing on the following fields of study:

Reinforcement learning, Artificial intelligence, Internal medicine, Mathematical optimization and Surgery are his primary areas of study. His biological study spans a wide range of topics, including Set, Human–computer interaction, Set, Value and Generalization. Set is closely attributed to Algorithm in his work.

His Artificial neural network study in the realm of Artificial intelligence interacts with subjects such as Simple. The study of Internal medicine is intertwined with the study of Cardiology in a number of ways. His research on Mathematical optimization frequently links to adjacent areas such as Markov decision process.

Between 2015 and 2021, his most popular works were:

  • Control of memory, active perception, and action in minecraft (144 citations)
  • Value Prediction Network (109 citations)
  • Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning (68 citations)

In his most recent research, the most cited papers focused on:

  • Artificial intelligence
  • Internal medicine
  • Surgery

Satinder Singh mostly deals with Reinforcement learning, Artificial intelligence, Machine learning, Contrast and Algorithm. Satinder Singh integrates several fields in his works, including Reinforcement learning and Learning environment. His studies deal with areas such as Baseline and Adaptation as well as Artificial intelligence.

His Machine learning study combines topics from a wide range of disciplines, such as Epilepsy, Seizure detection, Seizure types, Generalizability theory and Multi-task learning. Satinder Singh works mostly in the field of Contrast, limiting it down to topics relating to Set and, in certain cases, Control, Submodular set function, Greedy algorithm, Logarithm and Maximization. His research investigates the connection between Algorithm and topics such as Set that intersect with problems in Model selection, Exponential function, Focus and Polynomial.

This overview was generated by a machine learning system which analysed the scientist’s body of work. If you have any feedback, you can contact us here.

Best Publications

Policy Gradient Methods for Reinforcement Learning with Function Approximation

Richard S Sutton;David A. McAllester;Satinder P. Singh;Yishay Mansour.
neural information processing systems (1999)

5032 Citations

Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning

Richard S. Sutton;Doina Precup;Satinder Singh.
Artificial Intelligence (1999)

3278 Citations

Learning to act using real-time dynamic programming

Andrew G. Barto;Steven J. Bradtke;Satinder P. Singh.
Artificial Intelligence (1995)

1588 Citations

Near-Optimal Reinforcement Learning in Polynomial Time

Michael Kearns;Satinder Singh.
Machine Learning (2002)

1129 Citations

The national lung screening trial: Overview and study design

Constantine A. Gatsonis;Denise R. Aberle;Christine D. Berg;William C. Black.
Radiology (2011)

1111 Citations

Convergence of Stochastic Iterative Dynamic Programming Algorithms

Tommi Jaakkola;Michael I. Jordan;Satinder P. Singh.
neural information processing systems (1993)

1090 Citations

Reinforcement learning with replacing eligibility traces

Satinder P. Singh;Richard S. Sutton.
Machine Learning (1996)

938 Citations

Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Satinder Singh;Tommi Jaakkola;Michael L. Littman;Csaba Szepesvári.
Machine Learning (2000)

863 Citations

Intrinsically Motivated Reinforcement Learning

Nuttapong Chentanez;Andrew G. Barto;Satinder P. Singh.
neural information processing systems (2004)

810 Citations

Action-conditional video prediction using deep networks in Atari games

Junhyuk Oh;Xiaoxiao Guo;Honglak Lee;Richard Lewis.
neural information processing systems (2015)

688 Citations

If you think any of the details on this page are incorrect, let us know.

Contact us

Best Scientists Citing Satinder Singh

Peter Stone

Peter Stone

The University of Texas at Austin

Publications: 114

Sergey Levine

Sergey Levine

University of California, Berkeley

Publications: 104

Michael L. Littman

Michael L. Littman

Brown University

Publications: 96

Andrew G. Barto

Andrew G. Barto

University of Massachusetts Amherst

Publications: 82

Jan Peters

Jan Peters

Technical University of Darmstadt

Publications: 81

Pieter Abbeel

Pieter Abbeel

University of California, Berkeley

Publications: 81

Shie Mannor

Shie Mannor

Technion – Israel Institute of Technology

Publications: 73

Joelle Pineau

Joelle Pineau

McGill University

Publications: 70

Lihong Li

Lihong Li

Amazon (United States)

Publications: 70

Matthew E. Taylor

Matthew E. Taylor

University of Alberta

Publications: 68

Richard S. Sutton

Richard S. Sutton

University of Alberta

Publications: 66

Shimon Whiteson

Shimon Whiteson

University of Oxford

Publications: 64

Doina Precup

Doina Precup

McGill University

Publications: 64

Jürgen Schmidhuber

Jürgen Schmidhuber

Universita della Svizzera Italiana

Publications: 60

Leslie Pack Kaelbling

Leslie Pack Kaelbling

MIT

Publications: 57

Pierre-Yves Oudeyer

Pierre-Yves Oudeyer

French Institute for Research in Computer Science and Automation - INRIA

Publications: 54

Trending Scientists

Philippe A. Tanguy

Philippe A. Tanguy

Polytechnique Montréal

Jan H. Teuben

Jan H. Teuben

University of Groningen

Ke Wang

Ke Wang

Tsinghua University

Xiao Wang

Xiao Wang

Chinese Academy of Sciences

Seung-Boo Jung

Seung-Boo Jung

Sungkyunkwan University

Adam Kondorosi

Adam Kondorosi

Centre national de la recherche scientifique, CNRS

Mario Tosi

Mario Tosi

University of Rouen

Ottar N. Bjørnstad

Ottar N. Bjørnstad

Pennsylvania State University

Eric N. Ponnampalam

Eric N. Ponnampalam

University of Melbourne

Richard D. McPeters

Richard D. McPeters

Goddard Space Flight Center

Ronald M. Welch

Ronald M. Welch

University of Alabama in Huntsville

Hitoshi Irie

Hitoshi Irie

Chiba University

Mary A. Peterson

Mary A. Peterson

University of Arizona

Felix Eckstein

Felix Eckstein

Paracelsus Medical University

Timothy Shanahan

Timothy Shanahan

University of Illinois at Chicago

Bernhard Kliem

Bernhard Kliem

University of Potsdam

Something went wrong. Please try again later.