📙 HTR-VT (Pattern Recognition)

Introduction

This is the official implementation of our Pattern Recognition(PR) 2025 paper "HTR-VT: Handwritten Text Recognition with Vision Transformer". It's a new and effective baseline for handwritten text recognition solely using Vision Transformer and CTC Loss.

[Project Page] [Paper] [arXiv] [Datasets] [Checkpoints]

1. Overview

2. Visual Results

3. Installation

3.1. Environment

Our model can be learnt in a single GPU RTX-4090 24G

conda env create -f environment.yaml
conda activate htr

The code was tested on Python 3.9 and PyTorch 1.13.0.

3.2. Datasets

Using IAM, READ2016 and LAM for handwritten text recognition.

IAM

Register at the FKI's webpage :https://fki.tic.heia-fr.ch/databases/iam-handwriting-database)
Download the dataset from here :https://fki.tic.heia-fr.ch/databases/download-the-iam-handwriting-database

wget https://zenodo.org/record/1164045/files/{Test-ICFHR-2016.tgz,Train-And-Val-ICFHR-2016.tgz}

LAM

Download the dataset from here: https://aimagelab.ing.unimore.it/imagelab/page.asp?IdPage=46

Download datasets to ./data/. Take IAM for an example: The structure of the file should be:

./data/iam/
├── train.ln
├── val.ln
├── test.ln
└── lines
      ├──a01-000u-00.png
      ├──a01-000u-00.txt
      ├──a01-000u-01.png
      ├──a01-000u-01.txt
      ...

4. Quick Start

We provide convenient and comprehensive commands in ./run/ to train and test on different datasets to help researchers reproducing the results of the paper.

5. Citation

If our project is helpful for your research, please consider citing :

@article{li2025htr,
  title={HTR-VT: Handwritten text recognition with vision transformer},
  author={Li, Yuting and Chen, Dexiong and Tang, Tinglong and Shen, Xi},
  journal={Pattern Recognition},
  volume={158},
  pages={110967},
  year={2025},
  publisher={Elsevier}
}

6. Acknowledgement

We appreciate helps from public code: VAN and OrigamiNet.

Name	Name	Last commit message	Last commit date
Latest commit History 26 Commits 26 Commits
data	data
example	example
img	img
model	model
run	run
utils	utils
README.md	README.md
environment.yaml	environment.yaml
test.py	test.py
train.py	train.py
valid.py	valid.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📙 HTR-VT (Pattern Recognition)

Introduction

Table of Content

1. Overview

2. Visual Results

3. Installation

3.1. Environment

3.2. Datasets

4. Quick Start

5. Citation

6. Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Search code, repositories, users, issues, pull requests...

Folders and files

Latest commit

History

Repository files navigation

📙 HTR-VT (Pattern Recognition)

Introduction

Table of Content

1. Overview

2. Visual Results

3. Installation

3.1. Environment

3.2. Datasets

4. Quick Start

5. Citation

6. Acknowledgement

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages