Custom TrOCR (Not used in actual inference)

A custom made TrOCR model. This model underperformed and is for that reason not used.

Scripts

`ocr/data_loader.py`

Defines dataset classes:
- Bible text
- N-grams
- Bible with noise
- N-grams with noise
- Mixed datasets
Uses ocr/image_creator.py to render text as images.

`ocr/image_creator.py`

Renders text into image format.
Supports padding, grayscale conversion, and font selection.

`ocr/tokenizer.py`

Tokenizer class for:
- Encoding text to token IDs
- Decoding token IDs to text
- Vocabulary management

`ocr/ocr_model.py`

Custom TrOCR-based model.
Uses a ViT encoder and an autoregressive text decoder.

`ocr/train.py`

Trains the OCR model on synthetic data.
Handles model saving, loss logging, and evaluation.

`ocr/inference.py`

Loads a trained model and runs inference on new images.
Outputs predicted text.

Name		Name	Last commit message	Last commit date
Latest commit History 128 Commits
.vscode		.vscode
data		data
huggingface_ocr		huggingface_ocr
linesegmentation		linesegmentation
ocr		ocr
old		old
results		results
seb		seb
segmentation		segmentation
viktor		viktor
.gitignore		.gitignore
DLP Project Instruction 2025.pdf		DLP Project Instruction 2025.pdf
Hebrew Charset.pdf		Hebrew Charset.pdf
Monkbrill.tar.gz		Monkbrill.tar.gz
README.md		README.md
alphabet.py		alphabet.py
bible.py		bible.py
crop_alphabet.py		crop_alphabet.py
dataloader_padding.py		dataloader_padding.py
evaluate_test.py		evaluate_test.py
gendata.sh		gendata.sh
lines.npy		lines.npy
noise.py		noise.py
noise_designer.py		noise_designer.py
pipeline.py		pipeline.py
playgroung.ipynb		playgroung.ipynb
scrolls.npy		scrolls.npy
synthetic.py		synthetic.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Custom TrOCR (Not used in actual inference)

Scripts

`ocr/data_loader.py`

`ocr/image_creator.py`

`ocr/tokenizer.py`

`ocr/ocr_model.py`

`ocr/train.py`

`ocr/inference.py`

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Custom TrOCR (Not used in actual inference)

Scripts

ocr/data_loader.py

ocr/image_creator.py

ocr/tokenizer.py

ocr/ocr_model.py

ocr/train.py

ocr/inference.py

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`ocr/data_loader.py`

`ocr/image_creator.py`

`ocr/tokenizer.py`

`ocr/ocr_model.py`

`ocr/train.py`

`ocr/inference.py`

Packages