Alibaba and Nvidia collaborate on an advanced autonomous-driving solution. Alibaba Cloud, the digital technology backbone of the e-commerce giant, announced the collaboration, unveiling a large multimodal model (LMM) solution for automotive applications co-developed with Nvidia and Banma Network Technology, Alibaba’s intelligent cockpit solution provider.
Alibaba Cloud’s Qwen portfolio of proprietary large language models (LLMs) – including Qwen2-7B and Qwen2-VL – have been integrated with Nvidia’s Drive AGX Orin platform for autonomous vehicles. LLMs underpin generative AI services like ChatGPT. Nvidia’s Drive AGX Orin system-on-a-chip platform serves as the brain for intelligent automated-driving systems used by major Chinese electric vehicle makers. Nvidia’s model acceleration technology has already significantly reduced computational costs and minimized latency in the real-time processing of complex tasks by Alibaba Cloud’s AI models, ensuring a smooth and uninterrupted intelligent experience for both drivers and passengers.
Empowering Generative AI in Vehicles
“Together with our partners, we want to empower more businesses and individuals to unlock the potential of generative AI,” Alibaba Cloud chief technology officer Zhou Jingren said. With Qwen’s advanced capabilities in handling complex inquiries and processing visual intelligence, the new LMM solution will enable in-car voice assistants to engage in dynamic, multi-turn conversations. These assistants will also offer recommendations, ranging from providing information about nearby landmarks to proactively suggesting car headlights be turned on during certain conditions.
Enhanced In-Car Experiences
As part of that LMM solution, Alibaba Cloud’s Mobile Agent will enable vehicle owners to effortlessly execute voice commands, such as ordering milkshakes through a food delivery app, resulting in richer in-car experiences.