Cloudera, the data company for trusted enterprise AI, today announced its expanded collaboration with NVIDIA. Cloudera Powered by NVIDIA will integrate enterprise-grade NVIDIA NIM microservices, part of the NVIDIA AI Enterprise software platform, into Cloudera Machine Learning, a Cloudera Data Platform service for AI/ML workflows, to deliver fast, secure, and simplified end-to-end generative AI workflows in production.
Enterprise data, combined with a comprehensive full-stack platform optimized for large language models (LLM), plays a critical role in advancing an organization's generative AI applications from pilot to production. NVIDIA NIM and NeMo Retriever microservices let developers link AI models to their business data — including text, images, and visualizations, such as bar graphs, line plots, and pie charts — to generate highly accurate, contextually relevant responses. Developers using these microservices can deploy applications through NVIDIA AI Enterprise, which provides optimized runtimes for building, customizing, and deploying enterprise-grade LLMs. By leveraging NVIDIA microservices, Cloudera Machine Learning will enable customers to unleash the value of their enterprise data under Cloudera management by bringing high-performance AI workflows, AI platform software, and accelerated computing to the data – wherever it resides.
Cloudera will introduce multiple integrations with NVIDIA microservices. Cloudera Machine Learning will integrate model and application serving powered by NVIDIA microservices to boost model inference performance across all workloads. With this new AI model-serving functionality, customers can achieve fault-tolerance, low-latency serving and auto-scaling for models deployed anywhere - from both public and private clouds. Additionally, Cloudera Machine Learning will offer integrated NVIDIA NeMo Retriever microservices to simplify the connection of custom LLMs to enterprise data. This capability will enable users to build retrieval-augmented generation (RAG)-based applications for production use.
Cloudera previously worked with NVIDIA to harness GPU-optimized data processing through the integration of the NVIDIA RAPIDS Accelerator for Apache Spark into the Cloudera Data Platform. Now, with the planned addition of NVIDIA microservices and integration with NVIDIA AI Enterprise, Cloudera Data Platform will uniquely deliver streamlined end-to-end hybrid AI pipelines.