Baidu Unveils New ERNIE 4.5-VL Multimodal AI Model

Baidu has officially released its latest multimodal AI model, ERNIE-4.5-VL. This new model introduces an innovative "image thinking" feature, enhancing its ability to understand and process images in addition to its powerful language processing capabilities. The model is designed for efficiency, using only 3B activation parameters, which allows for quick response times in various AI applications. The key technological breakthrough is the model's ability to not only analyze images but also to perform related actions like enlarging images and conducting image-based searches. These advancements are expected to enrich the user interaction experience between images and text, opening new possibilities for applications in intelligent search, e-commerce, and online education. Baidu has open-sourced the model, allowing developers and researchers to explore its potential and promote further development in multimodal AI. This release marks another significant step for Baidu in strengthening its position in the competitive artificial intelligence landscape.

Related News

🔴 BIDU is trading 6% down today amid broader market weakness and macro headwinds

Baidu Receives Consensus Buy Rating with $150.43 Price Target for 2026

Baidu Analysts Raise Fair Value 16% on AI and Cloud Momentum

Baidu Earnings Report Scheduled for February 26, 2026

Baidu Sees 14.2% Decrease in Short Interest in January