News

Kinetica launches GenAI solution for real-time inferencing powered by NVIDIA AI Enterprise

0

Kinetica recently announced at NVIDIA GTC a generative AI solution for enterprise customers that showcases the next step in the evolution of retrieval-augmented generation (RAG).

Kinetica’s solution — powered by the NVIDIA NeMo, part of the NVIDIA AI Enterprise software platform, and NVIDIA accelerated computing infrastructure — addresses all of these concerns. It is founded on two critical components: low-latency vector search (leveraging NVIDIA RAPIDS RAFT technology) and the ability to perform real-time, complex data queries. This powerful combination enables enterprises to instantly enrich their generative AI applications with domain-specific analytical insights, derived directly from the latest operational data.

But Kinetica goes further. To truly understand data, AI needs context about the structure, relationships and meaning of tables and columns in an enterprise’s data. Kinetica has built native database objects that allow users to define this semantic context for enterprise data. An LLM can use these objects to grasp the referential context it needs to interact with a database in a context-aware manner.

“Kinetica’s real-time RAG solution, powered by NVIDIA NeMo Retriever microservices, seamlessly integrates LLMs with real-time streaming data insights, overcoming the limitations of traditional approaches,” said Nima Negahban, Co-founder and CEO, Kinetica. “This innovation helps enterprise clients and analysts gain business insights from operational data, like network data in telcos, using just plain English. All they have to do is ask questions, and we handle the rest.”

All the features in Kinetica’s generative AI solution are exposed to developers via a relational SQL API and LangChain plugins. This means that developers building applications can harness all the enterprise-grade features that come with a relational database. This includes control over who can access the data (Role-Based Access Control), reduce data movement from existing data lakes and warehouses (query federation that allows push-down to existing data sources), and preservation of existing relational schemas.

“Data is the foundation of AI, and enterprises everywhere are eager to connect theirs to generative AI applications,” said Ronnie Vasishta, Senior Vice President of Telecom, NVIDIA. “Kinetica uses the NVIDIA AI Enterprise software platform and accelerated computing infrastructure to infuse real-time data into LLMs, helping customers transform their productivity with generative AI.”