Alibaba Cloud Launches Open-Source Large Vision Language Model with Image Comprehension Capability


Image Recognition

ARTICLE SOURCE

Alibaba Cloud Launches Open-Source Large Vision Language Model with Image Comprehension CapabilityAlibaba Cloud, the digital technology and intelligence backbone of Alibaba Group, has launched two open-source large vision language models (LVLM), Qwen-VL and its conversationally fine-tuned Qwen-VL-Chat. Qwen-VL is the multimodal version of Qwen-7B, Alibaba Cloud’s 7-billion-parameter model of its large language model Tongyi Qianwen (also available on ModelScope as open-source). For commercial uses, companies with over 100 million monthly active users can request a licence from Alibaba Cloud. Qwen-VL-Chat has also achieved leading results in both Chinese and English for text-image dialogue and alignment levels with humans, according to the benchmark test of Alibaba Cloud. Earlier this month, Alibaba Cloud open-sourced its 7-billion-parameter LLMs, Qwen-7B and Qwen-7B-Chat as its ongoing contribution to the open-source community.