Share your model with the community¶
docTR’s focus is on open source, so if you also feel in love with than we appreciate sharing your trained model with the community. To make it easy for you, we have integrated a interface to the huggingface hub.
Loading from Huggingface Hub¶
This section shows how you can easily load a pretrained model from the Huggingface Hub.
from doctr.io import DocumentFile
from doctr.models import ocr_predictor, from_hub
image = DocumentFile.from_images(['data/example.jpg'])
# Load a custom detection model from huggingface hub
det_model = from_hub('Felix92/doctr-tf-db-resnet50')
# Load a custom recognition model from huggingface hub
reco_model = from_hub('Felix92/doctr-tf-crnn-vgg16-bn-french')
# You can easily plug in this models to the OCR predictor
predictor = ocr_predictor(det_arch=det_model, reco_arch=reco_model)
result = predictor(image)
from doctr.io import DocumentFile
from doctr.models import ocr_predictor, from_hub
image = DocumentFile.from_images(['data/example.jpg'])
# Load a custom detection model from huggingface hub
det_model = from_hub('Felix92/doctr-torch-db-mobilenet-v3-large')
# Load a custom recognition model from huggingface hub
reco_model = from_hub('Felix92/doctr-torch-crnn-mobilenet-v3-large-french')
# You can easily plug in this models to the OCR predictor
predictor = ocr_predictor(det_arch=det_model, reco_arch=reco_model)
result = predictor(image)
Pushing to the Huggingface Hub¶
You can also push your trained model to the Huggingface Hub. You need only to provide the task type (classification, detection, recognition or obj_detection), a name for your trained model (NOTE: existing repositories will not be overwritten) and the model name itself.
- Prerequisites:
Huggingface account (you can easy create one at https://huggingface.co/)
installed Git LFS (check installation at: https://git-lfs.github.com/) in the repository
from doctr.models import recognition, login_to_hub, push_to_hf_hub
login_to_hub()
my_awesome_model = recognition.crnn_mobilenet_v3_large(pretrained=True)
push_to_hf_hub(my_awesome_model, model_name='doctr-crnn-mobilenet-v3-large-french-v1', task='recognition', arch='crnn_mobilenet_v3_large')
It is also possible to push your model directly after training.
python3 ~/doctr/references/recognition/train_tensorflow.py crnn_mobilenet_v3_large --name doctr-crnn-mobilenet-v3-large --push-to-hub
python3 ~/doctr/references/recognition/train_pytorch.py crnn_mobilenet_v3_large --name doctr-crnn-mobilenet-v3-large --push-to-hub
Pretrained community models¶
This section is to provide some tables for pretrained community models. Feel free to open a pull request or issue to add your model to this list.
Naming conventions¶
We suggest using the following naming conventions for your models:
Classification: doctr-<architecture>-<vocab>
Detection: doctr-<architecture>
Recognition: doctr-<architecture>-<vocab>
Classification¶
Architecture |
Repo_ID |
Vocabulary |
Framework |
---|---|---|---|
resnet18 (dummy) |
Felix92/doctr-dummy-torch-resnet18 |
french |
PyTorch |
resnet18 (dummy) |
Felix92/doctr-dummy-tf-resnet18 |
french |
TensorFlow |
Detection¶
Architecture |
Repo_ID |
Framework |
---|---|---|
db_mobilenet_v3_large (dummy) |
Felix92/doctr-torch-db-mobilenet-v3-large |
PyTorch |
db_resnet50 (dummy) |
Felix92/doctr-tf-db-resnet50 |
TensorFlow |
Recognition¶
Architecture |
Repo_ID |
Language |
Framework |
---|---|---|---|
crnn_mobilenet_v3_large (dummy) |
Felix92/doctr-torch-crnn-mobilenet-v3-large |
french |
PyTorch |
crnn_vgg16_bn (dummy) |
Felix92/doctr-tf-crnn-vgg16-bn-french |
french |
TensorFlow |
crnn_vgg16_bn |
tilman-rassy/doctr-crnn-vgg16-bn-fascan-v1 |
french + german + § |
PyTorch |
parseq |
Felix92/doctr-torch-parseq-multilingual-v1 |
multilingual |
PyTorch |