World's Best Scientists 2026 revealed!

D-Index & Metrics

Computer Science

D-Index
32
Citations
8152
World Ranking
12897
National Ranking
5203

Best Publications

  • UNITER: UNiversal Image-TExt Representation Learning

    Yen-Chun Chen;Linjie Li;Licheng Yu;Ahmed El Kholy

  • Modeling Context in Referring Expressions

    Licheng Yu;Patrick Poirson;Shan Yang;Alexander C. Berg

  • MAttNet: Modular Attention Network for Referring Expression Comprehension

    Licheng Yu;Zhe Lin;Xiaohui Shen;Jimei Yang

  • TVQA: Localized, Compositional Video Question Answering

    Jie Lei;Licheng Yu;Mohit Bansal;Tamara L. Berg

  • HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training

    Linjie Li;Yen-Chun Chen;Yu Cheng;Zhe Gan

  • Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout

    Hao Tan;Licheng Yu;Mohit Bansal

  • UNITER: Learning UNiversal Image-TExt Representations

    Yen-Chun Chen;Linjie Li;Licheng Yu;Ahmed El Kholy

  • A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

    Licheng Yu;Hao Tan;Mohit Bansal;Tamara L. Berg

  • TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

    Jie Lei;Licheng Yu;Tamara L. Berg;Mohit Bansal

  • TVQA+: Spatio-Temporal Grounding for Video Question Answering

    Jie Lei;Licheng Yu;Tamara L. Berg;Mohit Bansal

  • UNITER: UNiversal Image-TExt Representation Learning

    Yen-Chun Chen;Linjie Li;Licheng Yu;Ahmed El Kholy

  • Visual Madlibs: Fill in the Blank Description Generation and Question Answering

    Licheng Yu;Eunbyung Park;Alexander C. Berg;Tamara L. Berg

  • Vector Sparse Representation of Color Image Using Quaternion Matrix Analysis

    Unknown

  • Visual Madlibs: Fill in the blank Image Generation and Question Answering

    Licheng Yu;Eunbyung Park;Alexander C. Berg;Tamara L. Berg

  • Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models.

    Jize Cao;Zhe Gan;Yu Cheng;Licheng Yu

  • Physics-Inspired Garment Recovery from a Single-View Image

    Shan Yang;Zherong Pan;Tanya Amert;Ke Wang

  • Multi-Target Embodied Question Answering

    Licheng Yu;Xinlei Chen;Georgia Gkioxari;Mohit Bansal

  • Hierarchically-Attentive RNN for Album Summarization and Storytelling

    Licheng Yu;Mohit Bansal;Tamara L. Berg

  • MAttNet: Modular Attention Network for Referring Expression Comprehension

    Licheng Yu;Zhe Lin;Xiaohui Shen;Jimei Yang

  • Violin: A Large-Scale Dataset for Video-and-Language Inference

    Jingzhou Liu;Wenhu Chen;Yu Cheng;Zhe Gan

If you think any of the details on this page are incorrect, let us know.

Report an issue

We appreciate your kind effort to assist us to improve this page, it would be helpful providing us with as much detail as possible in the text box below:

Recently Published Articles