manuscript-ocr

Contents:

  • Getting Started
    • Installation
      • Minimum System Requirements
      • Diagnostics
    • Quick Start
    • Example Notebooks
    • Main Components
    • Model Zoo
    • Related Work
  • Model Zoo
    • Detection
    • Layout
    • Recognition
    • Correction
    • Architecture Sources
  • Pipeline Usage Guide
    • Stage Contracts
      • Detector
      • Recognizer
      • Layout
      • Corrector
    • Basic Usage
    • Disable Stages
    • Layout Placement
    • Built-in Components
    • TRBA Region Preparation
    • Collapsing Text Spans
    • Visualization and Profiling
    • Intermediate Results
  • Library Structure
    • Overview
  • API Reference
    • Pipeline
      • Pipeline
        • Pipeline.__init__()
        • Pipeline.predict()
        • Pipeline.get_text()
        • Pipeline.last_detection_page
        • Pipeline.last_layout_page
        • Pipeline.last_recognition_page
        • Pipeline.last_correction_page
    • Data Structures
      • TextSpan
        • TextSpan.polygon
        • TextSpan.detection_confidence
        • TextSpan.text
        • TextSpan.recognition_confidence
        • TextSpan.order
        • TextSpan.polygon
        • TextSpan.detection_confidence
        • TextSpan.text
        • TextSpan.recognition_confidence
        • TextSpan.order
      • Line
        • Line.text_spans
        • Line.order
        • Line.order
        • Line.words
      • Block
        • Block.lines
        • Block.text_spans
        • Block.order
        • Block.order
        • Block.__init__()
        • Block.words
      • Page
        • Page.blocks
        • Page.blocks
        • Page.to_dict()
        • Page.to_json()
        • Page.from_json()
    • Detectors
      • EAST
        • EAST.default_weights_name
        • EAST.pretrained_registry
        • EAST.__init__()
        • EAST.predict()
        • EAST.train()
        • EAST.export()
      • YOLO
        • YOLO.default_weights_name
        • YOLO.default_target_size
        • YOLO.pretrained_registry
        • YOLO.config_registry
        • YOLO.__init__()
        • YOLO.predict()
    • Layouts
      • SimpleSorting
        • SimpleSorting.__init__()
        • SimpleSorting.predict()
    • Recognizers
      • TRBA
        • TRBA.default_weights_name
        • TRBA.pretrained_registry
        • TRBA.config_registry
        • TRBA.charset_registry
        • TRBA.__init__()
        • TRBA.predict()
        • TRBA.train()
        • TRBA.export()
    • Correctors
      • CharLM
        • CharLM
        • Overview
        • Available Presets
        • Quick Example
        • Basic Usage
        • Advanced Configuration
        • Using Custom Lexicon
        • Training Custom Model
        • Export to ONNX
    • Utilities
      • read_image()
      • create_page_from_text()
      • create_page_from_image()
      • visualize_page()
      • organize_page()
      • crop_axis_aligned()
      • crop_polygon_mask()
      • merge_polygons()
      • order_quad_points()
      • polygon_to_bbox()
      • warp_quad()
      • merge_text_spans()
      • collapse_line_text_spans()
      • collapse_block_text_spans()
      • collapse_page_text_spans()
      • set_seed()
  • Related Work
  • Acknowledgements
manuscript-ocr
  • Related Work
  • View page source

Related Work

The following publications are related to the manuscript-ocr project and its experiments on pre-reform handwritten Russian texts:

  • Sherstnev, P.A.; Kozhin, K.D.; Pyataeva, A.V. Analyzing the Influence of Hyperparameters on the Efficiency of an OCR Model for Pre-Reform Handwritten Texts. Program Comput Soft 51, 173–180 (2025). https://doi.org/10.1134/S0361768825700069

  • Шерстнев, П. А.; Кожин, К. Д.; Пятаева, А. В. Анализ влияния гиперпараметров на эффективность OCR-модели для дореформенных рукописных текстов // Программирование. – 2025. – № 3. – С. 70-79. – DOI 10.31857/S0132347425030071. – EDN GRLAPG.

  • Шерстнев, П. А.; Кожин, К. Д.; Пятаева, А. В. Распознавание рукописных текстов отчетов губернаторов Енисейской губернии 19 века // GraphiCon 2024 : Материалы 34-й Международной конференции по компьютерной графике и машинному зрению, Омск, 17–19 сентября 2024 года. – Омск: Омский государственный технический университет, 2024. – С. 519-524. – DOI 10.25206/978-5-8149-3873-2-2024-519-524. – EDN GBEKEZ.

Previous Next

© Copyright 2026, Konstantin Kozhin.

Built with Sphinx using a theme provided by Read the Docs.