Community Tools¶
This section highlights notable tools developed by the docTR community.
docTR-Labeler¶
Overview
docTR-Labeler
is a dedicated annotation tool tailored for creating and editing OCR datasets to train and fine-tune docTR models. It offers a user-friendly graphical interface, featuring polygon-based text labeling, automatic annotation suggestions via OnnxTR, and convenient label export capabilities.
Key Features
Interactive Polygon Editing: Draw and edit polygons around text regions with precision
AI-Powered Auto-Annotation: Automatic annotation suggestions and polygon refinement powered by OnnxTR
Auto-Correction: Automatic correction of polygon shapes to ensure accurate text region representation
Efficient Workflow: Keyboard shortcuts for selection, zooming, undo/redraw, and saving operations
Flexible Access: CLI launch with
doctr-labeler
command and full programmatic Python API integrationPrivacy-First: No authentication required - everything runs locally on your machine
Real-Time Rendering: Live image rendering with helpful visual feedback
OnnxTR¶
Overview
OnnxTR
provides an ONNX-based backend for docTR models, enabling fast, cross-platform inference using ONNX Runtime. It’s a core refactored library that enhances the performance and flexibility of OCR tasks without relying on heavy frameworks like PyTorch or TensorFlow.
Key Features
Minimal Dependencies: No PyTorch or TensorFlow requirements
Fast Inference: Optimized with ONNX Runtime for production environments
Quantization Support: Reduced memory usage and faster inference through model quantization
Batch Processing: Efficient batch inference capabilities
Multi-Platform: CPU, GPU, and specialized accelerator runtimes like OpenVINO
Flexible Installation: Separate install options for different runtime requirements
Familiar API: One-line inference via
onnxtr.models.ocr_predictor
(similar to docTR)Docker Ready: Production-ready Docker images available
Hugging Face Integration: Seamless model sharing and loading
Server Optimized: OpenCV headless installation options for server environments
docling-OCR-OnnxTR¶
Overview
docling-OCR-OnnxTR
is a high-performance plugin that integrates the OnnxTR OCR engine into the Docling document parsing framework. By leveraging ONNX Runtime, it delivers superior accuracy and efficiency compared to traditional OCR engines across various hardware configurations.
Key Features
Native Docling Support: Direct integration with Docling pipelines using
OnnxtrOcrOptions
Drop-in Replacement: Easy migration from existing OCR engines
Model Selection: Control over detection and recognition model choices
Multi-Language Support: Configurable language settings
Quality Control: Adjustable confidence thresholds
Performance Tuning: Batch size optimization
Enhanced Processing: Orientation correction and 8-bit model loading options
Contribute Your Tool¶
Share Your Innovation
Have you built something amazing on top of docTR ?
We’d love to showcase your work! Whether it’s a useful plugin, dataset preparation tool, or any other docTR-based project, the community would benefit from learning about it.
How to Contribute
To contribute your tool to the docTR community, please follow these steps:
GitHub: Open a pull request with your tool information
Format: Follow the structure above with clear descriptions and key features
Tip
Include a clear tool description and highlight what makes your tool unique or particularly useful to the docTR community.
This helps others quickly understand its value and how to use it effectively.