An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Work
Year: 2020
Type: preprint
Abstract: While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in... more
Source: arXiv (Cornell University)
Authors Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai +7 more
Cites: 53
Cited by: 13,880
Related to: 10
Citation percentile (by year/subfield): 99.99
Subfield: Computer Vision and Pattern Recognition
Field: Computer Science
Domain: Physical Sciences
Sustainable Development Goal Quality education
Open Access status: green