Create README.md

2022-01-20 22:39:39 +00:00 · 2022-01-20 22:39:39 +00:00 · 76ff029880
parent 0808c80156
commit 76ff029880
1 changed files with 26 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -0,0 +1,26 @@
+---
+language: ja
+tags:
+- image-to-text
+license: apache-2.0
+datasets:
+- manga109s
+---
+
+# Manga OCR
+
+Optical character recognition for Japanese text, with the main focus being Japanese manga.
+
+It uses [Vision Encoder Decoder](https://huggingface.co/docs/transformers/model_doc/visionencoderdecoder) framework.
+
+Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality
+text recognition, robust against various scenarios specific to manga:
+- both vertical and horizontal text
+- text with furigana
+- text overlaid on images
+- wide variety of fonts and font styles
+- low quality images
+
+Code for inference is available [here](https://github.com/kha-white/manga_ocr).
+
+Code for training will be released soon.