On August 25,vintage eroticism films tubes Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Related Articles
2025-06-26 23:46
2301 views
Trump who? Tech giants join massive effort to uphold Paris Agreement
U.S. tech titans are joining an effort by more than 1,000 U.S. governors, mayors, investors, univers
Read More
2025-06-26 23:42
123 views
Gallup/Knight poll: Americans agree on pros and cons of social media — except for one crucial issue
A poll published Thursday by Gallup and the John S. and James L. Knight Foundation finds that Democr
Read More
2025-06-26 22:46
1589 views
Halloween pet costumes at Walmart
The following content is brought to you by Mashable partners. If you buy a product or service featur
Read More