World's Best Scientists 2026 revealed!

D-Index & Metrics

Computer Science

D-Index
35
Citations
22778
World Ranking
11415
National Ranking
1414

Best Publications

  • Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions

    Wenhai Wang;Enze Xie;Xiang Li;Deng-Ping Fan

  • Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions

    Wenhai Wang;Enze Xie;Xiang Li;Deng-Ping Fan

  • Selective Kernel Networks

    Xiang Li;Wenhai Wang;Xiaolin Hu;Jian Yang

  • PVTv2: Improved Baselines with Pyramid Vision Transformer

    Wenhai Wang;Enze Xie;Xiang Li;Deng-Ping Fan

  • BEVFormer: Learning Bird’s-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers

    Unknown

  • Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection

    Xiang Li;Wenhai Wang;Lijun Wu;Shuo Chen

  • InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

    Unknown

  • SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

    Enze Xie;Wenhai Wang;Zhiding Yu;Anima Anandkumar

  • PolarMask: Single Shot Instance Segmentation With Polar Representation

    Enze Xie;Peize Sun;Xiaoge Song;Wenhai Wang

  • Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers

    Unknown

  • Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection

    Xiang Li;Wenhai Wang;Xiaolin Hu;Jun Li

  • DetCo: Unsupervised Contrastive Learning for Object Detection

    Enze Xie;Jian Ding;Wenhai Wang;Xiaohang Zhan

  • DetCo: Unsupervised Contrastive Learning for Object Detection

    Enze Xie;Jian Ding;Wenhai Wang;Xiaohang Zhan

  • Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection

    Xiang Li;Wenhai Wang;Lijun Wu;Shuo Chen

  • Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

    Unknown

  • VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

    Unknown

  • Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

    Unknown

  • Generalized Focal Loss: Towards Efficient Representation Learning for Dense Object Detection

    Unknown

  • VideoChat: Chat-Centric Video Understanding

    Unknown

  • Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

    Unknown

If you think any of the details on this page are incorrect, let us know.

Report an issue

We appreciate your kind effort to assist us to improve this page, it would be helpful providing us with as much detail as possible in the text box below:

Recently Published Articles