On August 25,Asian movies Archives Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Related Articles
2025-06-26 07:51
2176 views
Norrie vs. Diallo 2025 livestream: Watch Madrid Open for free
TL;DR:Live stream Norrie vs. Diallo in the 2025 Madrid Open for free on RTVE. Access this free strea
Read More
2025-06-26 07:45
1168 views
Met Gala 2024 was filled with floral fashion. The internet had thoughts.
The 2024 Met Gala kicked off on Monday night, with a slew of celebrities descending upon New York's
Read More
2025-06-26 06:46
1382 views
The Audobon Society's 2019 awards deliver some beautiful bird photos
The National Audubon Society has announced the 2019 winners of its annual photography competition, a
Read More