2026 Shixiang Gu: Computer Science Researcher – H-Index, Publications & Awards

Discipline name	D-Index	World Ranking	Current World Ranking	National Ranking	Current National Ranking	Publications	Citations
Rising Stars	34	862	859	140	139	58	15266
Computer Science	34	11875	11533	4841	4628	57	14129

Research.com Recognitions

2025 - Research.com Rising Stars Award

Overview

Shixiang Gu is a researcher affiliated with Google in the United States. Their work primarily spans the fields of Computer Science and Engineering, with a particular focus on Artificial Intelligence, Computer Vision and Pattern Recognition, Control and Systems Engineering, and Cognitive Neuroscience.

The main topics addressed in their research include Reinforcement Learning in Robotics, Robot Manipulation and Learning, Topic Modeling, Domain Adaptation and Few-Shot Learning, Multimodal Machine Learning Applications, and Modular Robots and Swarm Intelligence. These topics reflect a strong emphasis on the intersection of machine learning techniques and robotic systems.

Shixiang Gu has published extensively, contributing to 34 papers at arXiv (Cornell University), alongside works in Transactions of the Japanese Society for Artificial Intelligence, Advanced Robotics, IEEE Robotics and Automation Letters, and Journal of the Robotics Society of Japan.

Recent representative papers include:

Scaling Instruction-Finetuned Language Models, 2022, arXiv (Cornell University)
Large Language Models are Zero-Shot Reasoners, 2022, arXiv (Cornell University)
A Minimalist Approach to Offline Reinforcement Learning, 2021, arXiv (Cornell University)
Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization, 2020, arXiv (Cornell University)
Aligning Text-to-Image Models using Human Feedback, 2023, arXiv (Cornell University)

Their frequent co-authors include Yutaka Matsuo, Tatsuya Matsushima, Hiroki Furuta, and Yusuke Iwasawa, reflecting collaborative efforts in advancing research in AI and robotics.

Best Publications

Categorical Reparameterization with Gumbel-Softmax

Eric Jang;Shixiang Gu;Ben Poole

4808
Citations
Scaling Instruction-Finetuned Language Models

Unknown

2369
Citations
Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates

Shixiang Gu;Ethan Holly;Timothy Lillicrap;Sergey Levine

1637
Citations
Continuous deep Q-learning with model-based acceleration

Shixiang Gu;Timothy Lillicrap;Ilya Sutskever;Sergey Levine

889
Citations
Towards Deep Neural Network Architectures Robust to Adversarial Examples

Shixiang Gu;Luca Rigazio

772
Citations
Data-Efficient Hierarchical Reinforcement Learning

Ofir Nachum;Shixiang Gu;Honglak Lee;Sergey Levine

574
Citations
Large Language Models Can Self-Improve

Unknown

203
Citations
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog.

Natasha Jaques;Asma Ghandeharioun;Judy Hanwen Shen;Craig Ferguson

191
Citations
Q-PrOP: Sample-efficient policy gradient with an off-policy critic

Shixiang Gu;Timothy Lillicrap;Zoubin Ghahramani;Richard Eric Turner

188
Citations
Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Vitchyr Pong;Shixiang Gu;Murtaza Dalal;Sergey Levine

180
Citations
MuProp: Unbiased Backpropagation for Stochastic Neural Networks

Shixiang Gu;Shixiang Gu;Sergey Levine;Ilya Sutskever;Andriy Mnih

137
Citations
Dynamics-Aware Unsupervised Discovery of Skills

Archit Sharma;Shixiang Gu;Sergey Levine;Vikash Kumar

127
Citations
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning

Shixiang Gu;Timothy P. Lillicrap;Zoubin Ghahramani;Richard E. Turner

126
Citations
Sequence tutor: conservative fine-tuning of sequence generation models with KL-control

Natasha Jaques;Shixiang Gu;Dzmitry Bahdanau;José Miguel Hernández-Lobato

118
Citations
A Divergence Minimization Perspective on Imitation Learning Methods

Seyed Kamyar Seyed Ghasemipour;Richard S. Zemel;Shixiang Gu

115
Citations
Tuning Recurrent Neural Networks with Reinforcement Learning

Natasha Jaques;Shixiang Gu;Richard E. Turner;Douglas Eck

112
Citations
Neural adaptive sequential Monte Carlo

Shixiang Gu;Zoubin Ghahramani;Richard E. Turner

112
Citations
Near-Optimal Representation Learning for Hierarchical Reinforcement Learning

Ofir Nachum;Shixiang Gu;Honglak Lee;Sergey Levine

110
Citations
Aligning Text-to-Image Models using Human Feedback

Unknown

108
Citations
Language as an Abstraction for Hierarchical Deep Reinforcement Learning

YiDing Jiang;Shixiang Gu;Kevin P. Murphy;Chelsea Finn

107
Citations
Categorical Reparametrization with Gumble-Softmax

Eric Jang;Shixiang Gu;Ben Poole

105
Citations
The Mirage of Action-Dependent Baselines in Reinforcement Learning.

George Tucker;Surya Bhupatiraju;Shixiang Gu;Richard E. Turner

88
Citations
Way Off-Policy Batch Deep Reinforcement Learning of Human Preferences in Dialog

Natasha Jaques;Asma Ghandeharioun;Judy Hanwen Shen;Craig Ferguson

6
Citations

Frequent Co-Authors

Sergey Levine University of California, Berkeley

Richard E. Turner University of Cambridge

Timothy P. Lillicrap University College London

Zoubin Ghahramani University of Cambridge

Vikash Kumar University of Washington

Honglak Lee University of Michigan–Ann Arbor

Ilya Sutskever OpenAI

George Tucker Google (United States)

Yutaka Matsuo University of Tokyo

Douglas Eck Google (United States)

External Links

Personal website of Shixiang Gu

If you think any of the details on this page are incorrect, let us know.

Related Online Degrees & Career Pathways

Studying Computer Science in the USA opens doors to a wide array of online degree options and exciting career opportunities. Many students today are looking for flexible and cost-effective pathways to advance their education while balancing other commitments.

For those aiming for leadership or research roles, there are affordable online doctoral programs available, offering advanced knowledge without breaking the bank. If you’re focused on educational leadership, consider the cheapest online doctorate in educational leadership to fast-track your qualifications and career prospects in academia or administration.

If you prefer to enter the tech workforce quickly, an accelerated online associates degree can be completed in as little as six months, helping you build foundational skills fast.

Additionally, expanding your expertise with business knowledge can be beneficial. Students interested in the business side of technology should explore a business administration degree online cost to find the best value programs. These diverse options can help you carve out a successful career in computer science or related fields.