Vision transformers — citation graph of key research

Explore the most influential research on vision transformers as an interactive citation graph on Constellation. The papers below are connected by direct citations and shared references — open any one to center the graph on it and discover related work.

Top papers on vision transformers

PVT v2: Improved baselines with pyramid vision transformer
Wenhai Wang, Enze Xie, Xiang Li et al. — 2022 · Computational Visual Media · 2,215 citations
Vision Transformers for Single Image Dehazing
Yuda Song, Zhuqing He, Hui Qian et al. — 2023 · IEEE Transactions on Image Processing · 1,034 citations
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov et al. — 2020 · arXiv (Cornell University) · 21,701 citations
Vision Transformers for Remote Sensing Image Classification
Yakoub Bazi, Laila Bashmal, Mohamad Mahmoud Al Rahhal et al. — 2021 · Remote Sensing · 613 citations
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
Xiangxiang Chu, Zhi Tian, Yuqing Wang et al. — 2021 · arXiv (Cornell University) · 617 citations
ConViT: improving vision transformers with soft convolutional inductive biases*
Stéphane d’Ascoli, Hugo Touvron, Matthew L. Leavitt et al. — 2022 · Journal of Statistical Mechanics Theory and Experiment · 716 citations
Conditional Positional Encodings for Vision Transformers
Xiangxiang Chu, Zhi Tian, Bo Zhang et al. — 2021 · arXiv (Cornell University) · 407 citations
A survey of the vision transformers and their CNN-transformer based variants
Asifullah Khan, Zunaira Rauf, Anabia Sohail et al. — 2023 · Artificial Intelligence Review · 384 citations
ResViT: Residual Vision Transformers for Multimodal Medical Image Synthesis
Onat Dalmaz, Mahmut Yurt, Tolga Çukur — 2022 · IEEE Transactions on Medical Imaging · 548 citations
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu, Yutong Lin, Yue Cao et al. — 2021 · arXiv (Cornell University) · 399 citations
Comparing Vision Transformers and Convolutional Neural Networks for Image Classification: A Literature Review
José Maurício, Inês Domingues, Jorge Bernardino — 2023 · Applied Sciences · 546 citations
DeepViT: Towards Deeper Vision Transformer
Daquan Zhou, Bingyi Kang, Xiaojie Jin et al. — 2021 · arXiv (Cornell University) · 349 citations

Constellation — Explore research as a citation graph

Constellation maps the world's scientific literature as an interactive graph. Search any research topic or paste a paper's DOI or arXiv link, and see how works connect through citations and shared references. Discover the sub-fields of an area, tell foundational work from the frontier, follow citation trails, and generate a synthesis of the landscape — across every discipline, powered by OpenAlex.

This application requires JavaScript to run.