Model Zoo

This page lists the presets and release artifacts documented for version 0.1.12.

Detection

Preset

Architecture

Params

Rotated

Artifacts

Origin

License

east_50_g1

EAST (ResNet-50)

53.86M

Yes

Manuscript

MIT

yolo26s_obb_text_g1

YOLO26-S OBB

9.75M

Yes

Trained by the authors with Ultralytics YOLO

Ultralytics license

yolo26x_obb_text_g1

YOLO26-X OBB

57.61M

Yes

Trained by the authors with Ultralytics YOLO

Ultralytics license

Layout

Preset

Architecture

Params

Supports

Artifacts

Origin

License

SimpleSorting

Algorithmic ordering

Left-to-right, multi-column

Manuscript

MIT

Recognition

Preset

Architecture

Params

Script

Artifacts

Origin

License

trba_base_g1

TRBA

45.10M

Modern + pre-reform Russian

Manuscript

MIT

trba_lite_g1

TRBA-Lite

9.46M

Modern + pre-reform Russian

Manuscript

MIT

trba_lite_g2

TRBA-Lite

9.46M

Modern + pre-reform Russian

Manuscript

MIT

Correction

Preset

Architecture

Params

Orthography

Artifacts

Origin

License

modern_charlm_g1

CharLM

4.38M

Modern Russian

Manuscript

MIT

prereform_charlm_g1

CharLM

4.39M

Pre-reform Russian

Manuscript

MIT

Architecture Sources

  • EAST: An Efficient and Accurate Scene Text Detector (Zhou et al., CVPR 2017) — академическая основа для семейства детекторов EAST. Реализация в manuscript-ocr основана на оригинальной архитектуре, но процедура обучения существенно переработана. Предобученные веса получены авторами проекта.

  • What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis — архитектурная основа для семейства TRBA (TPS-ResNet-BiLSTM-Attn). Распознаватели в manuscript-ocr адаптированы под задачи проекта.