OpenAlex | The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)

The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)

Work

Year: 2023

Type: preprint

Abstract: Large multimodal models (LMMs) extend large language models (LLMs) with multi-sensory skills, such as visual understanding, to achieve stronger generic intelligence. In this paper, we analyze the late... more

Source: arXiv (Cornell University)

Authors Zhengyuan Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Chung-Ching Lin +2 more

Cites:

Cited by: 117

Related to: 10

Citation percentile (by year/subfield): 99.99

Topic: Multimodal Machine Learning Applications

Subfield: Computer Vision and Pattern Recognition

Field: Computer Science

Domain: Physical Sciences

Sustainable Development Goal Quality education

Open Access status: green