Docling parses PDF, DOCX, PPTX, HTML, and other formats into a rich unified representation including document layout, tables etc., making them ready for generative AI workflows like RAG.
This integration provides Docling’s capabilities via the DoclingLoader
document loader.
Installation and Setup
Simply installlangchain-docling
from your package manager, e.g. pip:
Document Loader
TheDoclingLoader
class in langchain-docling
seamlessly integrates Docling into
LangChain, enabling you to:
- use various document types in your LLM applications with ease and speed, and
- leverage Docling’s rich representation for advanced, document-native grounding.