Alibaba Cloud has developed two open-source artificial intelligence models that are able to recognize information contained not only in text materials but also in images.
These machine intelligence configurations are called Qwen-VL and Qwen-VL-Chat. These versions of AI were trained on the basis of the Qwen-7B large language model from Alibaba Cloud. According to the developers, the new configurations of artificial intelligence have a high level of performance regarding image recognition and understanding of their content as a semantic construct, compared to other LLMs. The relevant information is contained in the press release of the cloud computing company.
The new AI models are a testament to Alibaba Cloud’s commitment to developing multimodal capabilities. Through the use of sensory data, including audio recordings and images, the company aims to develop new applications for researchers and commercial organizations.
The company’s press release notes that new models of machine intelligence have the potential to transform user interaction with visual content. AI configurations created by Alibaba Cloud specialists can generate photo captions for news agencies and help people who do not speak Chinese to read street signs in that language. Also, new models of machine intelligence provide an opportunity to visually answer questions. According to the developers, this feature will greatly simplify shopping for blind and visually impaired consumers.
Alibaba Group’s virtual trading platform, Taobao, has already integrated optical character recognition technology. This technology helps people with visual impairments to read text.
The previous major language models of Alibaba Cloud, Qwen-7B and Qwen-7B-Chat, were launched a month ago, since then the number of their downloads has exceeded 400 thousand. These AI configurations were intended for developers, researchers, and commercial organizations in order to facilitate the creation of their own variations of generative artificial intelligence.
Two weeks ago, Alibaba reported revenue growth of its cloud business by 4% in the second quarter of 2023.
As we have reported earlier, AI Startup Hugging Face Raises $235 Million.