Ka-OCR

This is a monorepo of OCR project that is able to detect Georgian (Ka) texts on images/PDFs.

The project consists of few parts/subprojects:

dataset_gen

Handles synthetic data generation, adding real image data and it's augmentation, unified metadate.csv generation, zipping and uploading to Hugging Face automatically.

ml_training

Runs model training script and manages checkpoints, model evaluation and best model saving.

api

FastAPI based user-facing api for model serving.

Setup

Each subproject is independently managed with UV. Go to the subproject root you want to run/edit and run command to automatically setup venv and install deps.

For example, in case of ml_training subproject:

cd ml_training
uv sync

Then run respective entry point:

uv run main

See more detailed information about setup and usage in each subproject's readme file.

Name		Name	Last commit message	Last commit date
Latest commit History 175 Commits
api		api
trocr_dataset_gen		trocr_dataset_gen
trocr_training		trocr_training
yolo_word_detector		yolo_word_detector
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ka-OCR

dataset_gen

ml_training

api

Setup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ka-OCR

dataset_gen

ml_training

api

Setup

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages