impira/layoutlm-document-qa is a forked repo from huggingface. License: mit
Go to file
Ankur Goyal b28dd3e831 Update README.md
Add widget example
2022-09-16 13:33:27 +00:00
.gitattributes initial commit 2022-08-07 21:07:19 +00:00
README.md Update README.md 2022-09-16 13:33:27 +00:00
config.json Remove custom pipeline 2022-09-05 14:29:58 -07:00
merges.txt Initial commit 2022-08-07 17:07:34 -07:00
pyproject.toml Bug fixes + Copy formatting from transformers repo 2022-08-07 18:42:25 -07:00
pytorch_model.bin Update model weights 2022-08-29 16:23:57 -07:00
setup.cfg Bug fixes + Copy formatting from transformers repo 2022-08-07 18:42:25 -07:00
special_tokens_map.json Initial commit 2022-08-07 17:07:34 -07:00
tf_model.h5 Fix Tensorflow weights 2022-09-04 17:48:51 -07:00
tokenizer.json Initial commit 2022-08-07 17:07:34 -07:00
tokenizer_config.json Set the tokenizer config to add_prefix_space=True 2022-09-08 07:00:50 -07:00
vocab.json Initial commit 2022-08-07 17:07:34 -07:00

README.md

language license pipeline_tag tags widget
en mit document-question-answering
layoutlm
document-question-answering
pdf
text src
What is the invoice number? 2359223c18/invoice.png
text src
What is the purchase amount? 2359223c18/contract.jpeg

LayoutLM for Visual Question Answering

This is a fine-tuned version of the multi-modal LayoutLM model for the task of question answering on documents. It has been fine-tuned using both the SQuAD2.0 and DocVQA datasets.

Getting started with the model

To run these examples, you must have PIL, pytesseract, and PyTorch installed in addition to transformers.

from transformers import pipeline

nlp = pipeline(
    "document-question-answering",
    model="impira/layoutlm-document-qa",
)

nlp(
    "https://templates.invoicehome.com/invoice-template-us-neat-750px.png",
    "What is the invoice number?"
)
# {'score': 0.9943977, 'answer': 'us-001', 'start': 15, 'end': 15}

nlp(
    "https://miro.medium.com/max/787/1*iECQRIiOGTmEFLdWkVIH2g.jpeg",
    "What is the purchase amount?"
)
# {'score': 0.9912159, 'answer': '$1,000,000,000', 'start': 97, 'end': 97}

nlp(
    "https://www.accountingcoach.com/wp-content/uploads/2013/10/income-statement-example@2x.png",
    "What are the 2020 net sales?"
)
# {'score': 0.59147286, 'answer': '$ 3,750', 'start': 19, 'end': 20}

NOTE: This model and pipeline was recently landed in transformers via PR #18407 and PR #18414, so you'll need to use a recent version of transformers, for example:

pip install git+https://github.com/huggingface/transformers.git@2ef774211733f0acf8d3415f9284c49ef219e991

About us

This model was created by the team at Impira.