|
||
---|---|---|
.gitattributes | ||
README.md | ||
config.json | ||
preprocessor_config.json | ||
pytorch_model.bin | ||
special_tokens_map.json | ||
tokenizer_config.json | ||
vocab.txt |
README.md
language | tags | license | datasets | ||
---|---|---|---|---|---|
ja |
|
apache-2.0 |
|
Manga OCR
Optical character recognition for Japanese text, with the main focus being Japanese manga.
It uses Vision Encoder Decoder framework.
Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality text recognition, robust against various scenarios specific to manga:
- both vertical and horizontal text
- text with furigana
- text overlaid on images
- wide variety of fonts and font styles
- low quality images
Code for inference is available here.
Code for training will be released soon.