Skip to content
@docling-project

Docling Project

Welcome to the Docling Project

This is the GitHub organization Docling open-source project. We like to get continuous feedback from the community: take the poll!

Docling

Docling is our main open-source package. It is a powerful library which simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

We support an amazing community which helps us driving forward the adoption of Docling. Give it a try and join the community!



The key repositories of Docling are:

  • docling - The home of the main docling package.
  • docling-core - The definition of types, transforms, serializers, etc. If it has to do with the DoclingDocument you will find it here.
  • docling-parse - The backend PDF parser used by Docling.
  • docling-serve - The FastAPI wrappers for running Docling as REST API and distribute large jobs.
  • docling-ibm-models - The AI models powering Docling.
  • docling-sdg - Synthetic data generation (SDG) on documents for dataset generation for RAG, finetuning, etc.
  • docling-mcp - The definition of tools with the Model Context Protocol for document conversion, manipulation and generation agents.
  • docling-java - A Java API for interacting with Docling, currently based on docling-serve.

LF AI & Data

Docling is hosted as a project in the LF AI & Data Foundation.

IBM ❤️ Open Source AI

The project was started by the AI for knowledge team at IBM Research Zurich.

Pinned Loading

  1. docling docling Public

    Get your documents ready for gen AI

    Python 46.2k 3.3k

  2. docling-serve docling-serve Public

    Running Docling as an API service

    Python 1k 225

  3. docling-core docling-core Public

    A python library to define and validate data types in Docling.

    Python 214 110

  4. community community Public

    6 1

Repositories

Showing 10 of 22 repositories
  • docling-core Public

    A python library to define and validate data types in Docling.

    docling-project/docling-core’s past year of commit activity
    Python 214 MIT 110 43 13 Updated Dec 9, 2025
  • docling-java Public

    A Java API for Docling

    docling-project/docling-java’s past year of commit activity
    Java 41 MIT 2 4 0 Updated Dec 9, 2025
  • docling-eval Public

    Evaluation framework for document processing models and services.

    docling-project/docling-eval’s past year of commit activity
    Python 57 MIT 12 11 8 Updated Dec 8, 2025
  • docling Public

    Get your documents ready for gen AI

    docling-project/docling’s past year of commit activity
    Python 46,197 MIT 3,267 733 (8 issues need help) 21 Updated Dec 8, 2025
  • docling-workshops Public

    Docling workshops

    docling-project/docling-workshops’s past year of commit activity
    Jupyter Notebook 35 CC0-1.0 7 0 1 Updated Dec 4, 2025
  • website Public

    The Docling website

    docling-project/website’s past year of commit activity
    TypeScript 1 0 0 0 Updated Dec 3, 2025
  • docling-parse Public

    Simple package to extract text with coordinates from programmatic PDFs

    docling-project/docling-parse’s past year of commit activity
    C++ 221 MIT 48 38 8 Updated Dec 2, 2025
  • docling-project/docling-ibm-models’s past year of commit activity
    Python 174 MIT 53 28 9 Updated Dec 1, 2025
  • docling-serve Public

    Running Docling as an API service

    docling-project/docling-serve’s past year of commit activity
    Python 1,027 MIT 225 74 8 Updated Nov 24, 2025
  • docling-project/docling-jobkit’s past year of commit activity
    Python 15 MIT 13 12 2 Updated Nov 21, 2025