Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Intellindust-AI-Lab/HTR-VT

Open more actions menu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

📙 HTR-VT (Pattern Recognition)

Introduction

This is the official implementation of our Pattern Recognition(PR) 2025 paper "HTR-VT: Handwritten Text Recognition with Vision Transformer". It's a new and effective baseline for handwritten text recognition solely using Vision Transformer and CTC Loss.

[Project Page] [Paper] [arXiv] [Datasets] [Checkpoints]

Table of Content

1. Overview

teaser

2. Visual Results

method

3. Installation

3.1. Environment

Our model can be learnt in a single GPU RTX-4090 24G

conda env create -f environment.yaml
conda activate htr

The code was tested on Python 3.9 and PyTorch 1.13.0.

3.2. Datasets

  • Using IAM, READ2016 and LAM for handwritten text recognition.
IAM
Register at the FKI's webpage :https://fki.tic.heia-fr.ch/databases/iam-handwriting-database)
Download the dataset from here :https://fki.tic.heia-fr.ch/databases/download-the-iam-handwriting-database
READ2016
wget https://zenodo.org/record/1164045/files/{Test-ICFHR-2016.tgz,Train-And-Val-ICFHR-2016.tgz}
LAM
Download the dataset from here: https://aimagelab.ing.unimore.it/imagelab/page.asp?IdPage=46
  • Download datasets to ./data/. Take IAM for an example: The structure of the file should be:
./data/iam/
├── train.ln
├── val.ln
├── test.ln
└── lines
      ├──a01-000u-00.png
      ├──a01-000u-00.txt
      ├──a01-000u-01.png
      ├──a01-000u-01.txt
      ...

4. Quick Start

  • We provide convenient and comprehensive commands in ./run/ to train and test on different datasets to help researchers reproducing the results of the paper.

5. Citation

If our project is helpful for your research, please consider citing :

@article{li2025htr,
  title={HTR-VT: Handwritten text recognition with vision transformer},
  author={Li, Yuting and Chen, Dexiong and Tang, Tinglong and Shen, Xi},
  journal={Pattern Recognition},
  volume={158},
  pages={110967},
  year={2025},
  publisher={Elsevier}
}

6. Acknowledgement

We appreciate helps from public code: VAN and OrigamiNet.

About

(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Morty Proxy This is a proxified and sanitized view of the page, visit original site.