Databricks Intelligence Platform is the world’s first data intelligence platform powered by generative AI. Infuse AI into every facet of your business.Databricks embraces the LangChain ecosystem in various ways:
- 🚀 Model Serving - Access state-of-the-art LLMs, such as DBRX, Llama3, Mixtral, or your fine-tuned models on Databricks Model Serving, via a highly available and low-latency inference endpoint. LangChain provides LLM (
Databricks
), Chat Model (ChatDatabricks
), and Embeddings (DatabricksEmbeddings
) implementations, streamlining the integration of your models hosted on Databricks Model Serving with your LangChain applications. - 📃 Vector Search - Databricks Vector Search is a serverless vector database seamlessly integrated within the Databricks Platform. Using
DatabricksVectorSearch
, you can incorporate the highly scalable and reliable similarity search engine into your LangChain applications. - 📊 MLflow - MLflow is an open-source platform to manage full the ML lifecycle, including experiment management, evaluation, tracing, deployment, and more. MLflow’s LangChain Integration streamlines the process of developing and operating modern compound ML systems.
- 🌐 SQL Database - Databricks SQL is integrated with
SQLDatabase
in LangChain, allowing you to access the auto-optimizing, exceptionally performant data warehouse. - 💡 Open Models - Databricks open sources models, such as DBRX, which are available through the Hugging Face Hub. These models can be directly utilized with LangChain, leveraging its integration with the
transformers
library.
Installation
First-party Databricks integrations are now available in the databricks-langchain partner package.Chat Model
ChatDatabricks
is a Chat Model class to access chat endpoints hosted on Databricks, including state-of-the-art models such as Llama3, Mixtral, and DBRX, as well as your own fine-tuned models.
LLM
Databricks
is an LLM class to access completion endpoints hosted on Databricks.
Text completion models have been deprecated and the latest and most popular models are chat completion models. Use
ChatDatabricks
chat model instead to use those models and advanced features such as tool calling.Embeddings
DatabricksEmbeddings
is an Embeddings class to access text-embedding endpoints hosted on Databricks, including state-of-the-art models such as BGE, as well as your own fine-tuned models.
Vector Search
Databricks Vector Search is a serverless similarity search engine that allows you to store a vector representation of your data, including metadata, in a vector database. With Vector Search, you can create auto-updating vector search indexes from Delta tables managed by Unity Catalog and query them with a simple API to return the most similar vectors.MLflow Integration
In the context of LangChain integration, MLflow provides the following capabilities:- Experiment Tracking: Tracks and stores models, artifacts, and traces from your LangChain experiments.
- Dependency Management: Automatically records dependency libraries, ensuring consistency among development, staging, and production environments.
- Model Evaluation Offers native capabilities for evaluating LangChain applications.
- Tracing: Visually traces data flows through your LangChain application.