On August 25,Animation Archives Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Related Articles
2025-06-26 05:49
2857 views
Today's Hurdle hints and answers for May 12, 2025
If you like playing daily word games like Wordle, then Hurdle is a great game to add to your routine
Read More
2025-06-26 05:33
965 views
'Game of Thrones' Season 8
No Game of Thronesin 2018 means that the speculation around the show's eighth and final season will
Read More
2025-06-26 05:27
488 views
'Rick and Morty' creator lives the joke, scalps Nintendo 3DS
Remember that one weirdly specific joke from Rick and Mortywhere Rick plots a money-making scheme of
Read More