World's Best Scientists 2026 revealed!

D-Index & Metrics

Computer Science

D-Index
34
Citations
8599
World Ranking
11927
National Ranking
4871

Best Publications

  • Dynamic Convolution: Attention Over Convolution Kernels

    Yinpeng Chen;Xiyang Dai;Mengchen Liu;Dongdong Chen

  • Dynamic Head: Unifying Object Detection Heads with Attentions

    Xiyang Dai;Yinpeng Chen;Bin Xiao;Dongdong Chen

  • RegionCLIP: Region-based Language-Image Pretraining

    Unknown

  • Mobile-Former: Bridging MobileNet and Transformer.

    Yinpeng Chen;Xiyang Dai;Dongdong Chen;Mengchen Liu

  • Dynamic DETR: End-to-End Object Detection With Dynamic Attention

    Xiyang Dai;Yinpeng Chen;Jianwei Yang;Pengchuan Zhang

  • MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment

    Da Zhang;Xiyang Dai;Xin Wang;Yuan-Fang Wang

  • Florence: A New Foundation Model for Computer Vision

    Lu Yuan;Dongdong Chen;Yi-Ling Chen;Noel Codella

  • Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding

    Pengchuan Zhang;Xiyang Dai;Jianwei Yang;Bin Xiao

  • Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding.

    Pengchuan Zhang;Xiyang Dai;Jianwei Yang;Bin Xiao

  • Temporal Context Network for Activity Localization in Videos

    Xiyang Dai;Bharat Singh;Guyue Zhang;Larry S. Davis

  • GLIPv2: Unifying Localization and Vision-Language Understanding

    Unknown

  • Focal Self-attention for Local-Global Interactions in Vision Transformers

    Jianwei Yang;Chunyuan Li;Pengchuan Zhang;Xiyang Dai

  • Generalized Decoding for Pixel, Image, and Language

    Unknown

  • CvT: Introducing Convolutions to Vision Transformers

    Haiping Wu;Bin Xiao;Noel Codella;Mengchen Liu

  • CvT: Introducing Convolutions to Vision Transformers

    Haiping Wu;Bin Xiao;Noel Codella;Mengchen Liu

  • BEVT: BERT Pretraining of Video Transformers.

    Rui Wang;Dongdong Chen;Zuxuan Wu;Yinpeng Chen

  • Dynamic ReLU

    Yinpeng Chen;Xiyang Dai;Mengchen Liu;Dongdong Chen

  • Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

    Unknown

  • Reduce Information Loss in Transformers for Pluralistic Image Inpainting

    Unknown

  • Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning

    Unknown

If you think any of the details on this page are incorrect, let us know.

Report an issue

We appreciate your kind effort to assist us to improve this page, it would be helpful providing us with as much detail as possible in the text box below:

Recently Published Articles