World's Best Scientists 2026 revealed!

D-Index & Metrics

Computer Science

D-Index
45
Citations
160798
World Ranking
6959
National Ranking
3040

Overview

Vincent Vanhoucke is affiliated with Google in the United States and conducts research primarily in the fields of Computer Science and Engineering. Their scholarly output includes 29 publications in Computer Science and 13 in Engineering. Within these areas, their work notably focuses on subfields such as Computer Vision and Pattern Recognition, Artificial Intelligence, Control and Systems Engineering, Computer Graphics and Computer-Aided Design, and Computational Mechanics.

The primary topics covered by Vincent Vanhoucke's research include Multimodal Machine Learning Applications, Robot Manipulation and Learning, Reinforcement Learning in Robotics, Topic Modeling, Computer Graphics and Visualization Techniques, 3D Shape Modeling and Analysis, and Robotics and Sensor-Based Localization.

The scientist has contributed extensively to leading publication venues. Frequent outlets include:

  • arXiv (Cornell University)
  • 2022 International Conference on Robotics and Automation (ICRA)
  • 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
  • 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Selected recent papers authored or coauthored by Vincent Vanhoucke illustrate their research interests and include:

  • "Do As I Can, Not As I Say: Grounding Language in Robotic Affordances" (2022), published in arXiv (Cornell University)
  • "PaLM-E: An Embodied Multimodal Language Model" (2023), published in arXiv (Cornell University)
  • "RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control" (2023), published in arXiv (Cornell University)
  • "Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items" (2022), published in the 2022 International Conference on Robotics and Automation (ICRA)
  • "Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language" (2022), published in arXiv (Cornell University)

Vincent Vanhoucke collaborates with several frequent coauthors, including:

  • Yuheng Kuang
  • Krista Reymann
  • Ken Goldberg
  • Krzysztof Choromański
  • Pannag Sanketi

Best Publications

  • Going deeper with convolutions

    Christian Szegedy;Wei Liu;Yangqing Jia;Pierre Sermanet

  • Rethinking the Inception Architecture for Computer Vision

    Christian Szegedy;Vincent Vanhoucke;Sergey Ioffe;Jon Shlens

  • Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

    Christian Szegedy;Sergey Ioffe;Vincent Vanhoucke;Alexander A Alemi

  • Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups

    G. Hinton;Li Deng;Dong Yu;G. E. Dahl

  • TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

    Martín Abadi;Ashish Agarwal;Paul Barham;Eugene Brevdo

  • Deep Neural Networks for Acoustic Modeling in Speech Recognition

    Geoffrey Hinton;Li Deng;Dong Yu;George Dahl

  • QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation

    Dmitry Kalashnikov;Alex Irpan;Peter Pastor;Julian Ibarz

  • Improving the speed of neural networks on CPUs

    Vincent Vanhoucke;Andrew Senior;Mark Z. Mao

  • PaLM-E: An Embodied Multimodal Language Model

    Unknown

  • Sim-to-Real: Learning Agile Locomotion For Quadruped Robots

    Jie Tan;Tingnan Zhang;Erwin Coumans;Atil Iscen

  • On rectified linear units for speech processing

    M. D. Zeiler;M. Ranzato;R. Monga;M. Mao

  • Using Simulation and Domain Adaptation to Improve Efficiency of Deep Robotic Grasping

    Konstantinos Bousmalis;Alex Irpan;Paul Wohlhart;Yunfei Bai

  • YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video

    Esteban Real;Jonathon Shlens;Stefano Mazzocchi;Xin Pan

  • System and method for enabling the use of captured images through recognition

    Salih Burak Gokturk;Dragomir Anguelov;Vincent Vanhoucke;Kuang-chih Lee

  • System and method for enabling search and retrieval from image files based on recognized information

    Salih Burak Gokturk;Dragomir Anguelov;Vincent Vanhoucke;Kuang-chih Lee

  • Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language

    Unknown

  • RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

    Unknown

  • Multilingual acoustic models using distributed deep neural networks

    G. Heigold;V. Vanhoucke;A. Senior;P. Nguyen

  • Application Of Pretrained Deep Neural Networks To Large Vocabulary Speech Recognition

    Navdeep Jaitly;Patrick Nguyen;Andrew W. Senior;Vincent Vanhoucke

  • System and method for enabling image recognition and searching of images

    Salih Burak Gokturk;Baris Sumengen;Diem Vu;Navneet Dalal

  • Image recognition system for use in analysing images of objects and applications thereof

    Salih Burak Gokturk;Baris Sumengen;Diem Vu;Navneet Dalal

  • System and method for search portions of objects in images and features thereof

    Salih Burak Gokturk;Baris Sumengen;Diem Vu;Navneet Dalal

  • The shared views of four research groups )

    Geoffrey Hinton;Li Deng;Dong Yu;George E. Dahl

Frequent Co-Authors

Andrew W. Senior
Andrew W. Senior Google (United States)
Patrick Nguyen
Patrick Nguyen Google (United States)
Christian Szegedy
Christian Szegedy Google (United States)
Georg Heigold
Georg Heigold German Research Centre for Artificial Intelligence
Sergey Levine
Sergey Levine University of California, Berkeley
Anelia Angelova
Anelia Angelova Google (United States)
Navdeep Jaitly
Navdeep Jaitly Google (United States)
Jonathon Shlens
Jonathon Shlens Google (United States)
Geoffrey E. Hinton
Geoffrey E. Hinton University of Toronto

If you think any of the details on this page are incorrect, let us know.

Report an issue

We appreciate your kind effort to assist us to improve this page, it would be helpful providing us with as much detail as possible in the text box below:

Related Online Degrees & Career Pathways

Exploring Computer Science in the USA often opens doors to diverse career options. Many students and professionals choose to expand their expertise with online graduate programs, which offer flexibility and affordability. For those seeking leadership roles in tech or business, online emba programs provide business acumen tailored for executives managing technology teams.

For individuals interested in information management, a mlis (Master of Library and Information Science) helps bridge technology and data management—skills highly valued in today’s digital landscape.

Affordability is a key concern for many students. Exploring cheap masters programs in fields related to Computer Science can make advanced education more accessible, without sacrificing quality or outcomes.

For those aspiring to top-tier research positions or academic roles, a phd in leadership and management online provides opportunities to influence the future of technology policy, education, and innovation.

Best Scientists Citing Vincent Vanhoucke

Trending Scientists

Recently Published Articles