Community Tools

This section highlights notable tools developed by the docTR community.

docTR-Labeler

Link:

https://github.com/text2knowledge/docTR-Labeler

Overview

docTR-Labeler is a dedicated annotation tool tailored for creating and editing OCR datasets to train and fine-tune docTR models. It offers a user-friendly graphical interface, featuring polygon-based text labeling, automatic annotation suggestions via OnnxTR, and convenient label export capabilities.

Key Features

  • Interactive Polygon Editing: Draw and edit polygons around text regions with precision

  • AI-Powered Auto-Annotation: Automatic annotation suggestions and polygon refinement powered by OnnxTR

  • Auto-Correction: Automatic correction of polygon shapes to ensure accurate text region representation

  • Efficient Workflow: Keyboard shortcuts for selection, zooming, undo/redraw, and saving operations

  • Flexible Access: CLI launch with doctr-labeler command and full programmatic Python API integration

  • Privacy-First: No authentication required - everything runs locally on your machine

  • Real-Time Rendering: Live image rendering with helpful visual feedback

OnnxTR

Link:

https://github.com/felixdittrich92/OnnxTR

Overview

OnnxTR provides an ONNX-based backend for docTR models, enabling fast, cross-platform inference using ONNX Runtime. It’s a core refactored library that enhances the performance and flexibility of OCR tasks without relying on heavy frameworks like PyTorch or TensorFlow.

Key Features

  • Minimal Dependencies: No PyTorch or TensorFlow requirements

  • Fast Inference: Optimized with ONNX Runtime for production environments

  • Quantization Support: Reduced memory usage and faster inference through model quantization

  • Batch Processing: Efficient batch inference capabilities

  • Multi-Platform: CPU, GPU, and specialized accelerator runtimes like OpenVINO

  • Flexible Installation: Separate install options for different runtime requirements

  • Familiar API: One-line inference via onnxtr.models.ocr_predictor (similar to docTR)

  • Docker Ready: Production-ready Docker images available

  • Hugging Face Integration: Seamless model sharing and loading

  • Server Optimized: OpenCV headless installation options for server environments

docling-OCR-OnnxTR

Link:

https://github.com/felixdittrich92/docling-OCR-OnnxTR

Overview

docling-OCR-OnnxTR is a high-performance plugin that integrates the OnnxTR OCR engine into the Docling document parsing framework. By leveraging ONNX Runtime, it delivers superior accuracy and efficiency compared to traditional OCR engines across various hardware configurations.

Key Features

  • Native Docling Support: Direct integration with Docling pipelines using OnnxtrOcrOptions

  • Drop-in Replacement: Easy migration from existing OCR engines

  • Model Selection: Control over detection and recognition model choices

  • Multi-Language Support: Configurable language settings

  • Quality Control: Adjustable confidence thresholds

  • Performance Tuning: Batch size optimization

  • Enhanced Processing: Orientation correction and 8-bit model loading options

Contribute Your Tool

Share Your Innovation

Have you built something amazing on top of docTR ?

We’d love to showcase your work! Whether it’s a useful plugin, dataset preparation tool, or any other docTR-based project, the community would benefit from learning about it.

How to Contribute

To contribute your tool to the docTR community, please follow these steps:

  1. GitHub: Open a pull request with your tool information

  2. Format: Follow the structure above with clear descriptions and key features

Tip

Include a clear tool description and highlight what makes your tool unique or particularly useful to the docTR community.

This helps others quickly understand its value and how to use it effectively.