News

Databricks open sources DBRX to help companies build their own custom LLMs

0

Databricks has announced the launch of DBRX, a general purpose large language model (LLM) that outperforms all established open source models on standard benchmarks. According to the company, DBRX democratizes the training and tuning of custom, high-performing LLMs for every enterprise so they no longer need to rely on a small handful of closed models. Available today, DBRX enables organizations around the world to cost-effectively build, train, and serve their own custom LLMs.

Ali Ghodsi, co-founder and CEO at Databricks, said: “We’re excited about DBRX for three key reasons: first, it beats open source models on state-of-the-art industry benchmarks. Second, it beats GPT-3.5 on most benchmarks, which should accelerate the trend we’re seeing across our customer base as organizations replace proprietary models with open source models. Finally, DBRX uses a mixture-of-experts architecture, making the model extremely fast in terms of tokens per second, as well as being cost effective to serve. All in all, DBRX is setting a new standard for open source LLMs — it gives enterprises a platform to build customized reasoning capabilities based on their own data.”

  • DBRX outperforms existing open source LLMs like Llama 2 70B and Mixtral-8x7B on standard industry benchmarks, such as language understanding, programming, math and logic.
  • DBRX also outperforms GPT-3.5 on relevant benchmarks
  • DBRX was developed by Mosaic AI and trained on NVIDIA DGX Cloud. Databricks optimized DBRX for efficiency with a mixture-of-experts (MoE) architecture, built on the MegaBlocks open source project. The resulting model has leading performance and is up to twice as compute-efficient as other available leading LLMs.
  • DBRX sets a new standard for open source models, enabling customizable and transparent generative AI for all enterprises. A recent survey from Andreessen Horowitz found that nearly 60 percent of AI leaders are interested in increasing open source usage or switching when fine-tuned open source models roughly match performance of closed source models. In 2024 and beyond, enterprises expect a significant shift of usage from closed towards open source. Databricks believes DBRX will accelerate this trend.
  • Paired with Databricks Mosaic AI’s unified tooling, DBRX helps customers rapidly build and deploy production-quality generative AI applications that are safe, accurate, and governed without giving up control of their data and intellectual property. Customers benefit from built-in data management, governance, lineage and monitoring capabilities on the Databricks Data Intelligence Platform.

Availability

DBRX is freely available on GitHub and Hugging Face for research and commercial use. Starting today on the Databricks Platform, enterprises can interact with DBRX, leverage its long context abilities in retrieval augmented generation (RAG) systems, and build custom DBRX models on their own unique data. DBRX is also available on AWS and Google Cloud, as well as directly on Microsoft Azure through Azure Databricks.

DBRX is also expected to be available through the NVIDIA API Catalog and supported on the NVIDIA NIM inference microservice.