The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
Work
Year: 2023
Type: preprint
Abstract: Large multimodal models (LMMs) extend large language models (LLMs) with multi-sensory skills, such as visual understanding, to achieve stronger generic intelligence. In this paper, we analyze the late... more
Source: arXiv (Cornell University)
Cites:
Cited by: 117
Related to: 10
Citation percentile (by year/subfield): 99.99
Subfield: Computer Vision and Pattern Recognition
Field: Computer Science
Domain: Physical Sciences
Sustainable Development Goal Quality education
Open Access status: green