On August 25,Chingari Chaubey (2023) S02 Hindi Web Series Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
(Editor: {typename type="name"/})
Norrie vs. Diallo 2025 livestream: Watch Madrid Open for free
This startup wants to deliver affordable contact lenses straight to your door
Apple Pay is coming to major US transit systems this year
Shop the Google Pixel Pro 9 for $200 off at Amazon
Study finds racial discrimination by Uber and Lyft drivers
Apple Card is a digital credit card, but there's also an IRL titanium version
Why Apple should allow its new services on Android and Windows
Creator job opportunities grew 7x in recent years [April 2025]
'Avengers: Endgame' poster sparks photoshop tributes to all our fallen non
Boeing's new VR simulator immerses astronauts in space training
Microsoft gains control of domains used by Iranian hackers linked to U.S. fugitive
接受PR>=1、BR>=1,流量相当,内容相关类链接。