30 lines
1.0 KiB
Markdown
30 lines
1.0 KiB
Markdown
|
---
|
||
|
language: en
|
||
|
license: cc-by-nc-sa-4.0
|
||
|
tags:
|
||
|
- layoutlm
|
||
|
- document-question-answering
|
||
|
- pdf
|
||
|
- invoices
|
||
|
---
|
||
|
|
||
|
# LayoutLM for Invoices
|
||
|
|
||
|
This is a fine-tuned version of the multi-modal [LayoutLM](https://aka.ms/layoutlm) model for the task of question answering on invoices and other documents. It has been fine-tuned on a proprietary dataset of
|
||
|
invoices as well as both [SQuAD2.0](https://huggingface.co/datasets/squad_v2) and [DocVQA](https://www.docvqa.org/) for general comprehension.
|
||
|
|
||
|
## Non-consecutive tokens
|
||
|
|
||
|
Unlike other QA models, which can only extract consecutive tokens (because they predict the start and end of a sequence), this model can predict longer-range, non-consecutive sequences with an additional
|
||
|
classifier head. For example, it can extract the two-line address as below:
|
||
|
|
||
|
![Two-line Address](./demo.png)
|
||
|
|
||
|
## Getting started with the model
|
||
|
|
||
|
The best way to use this model is via [DocQuery](https://github.com/impira/docquery).
|
||
|
|
||
|
## About us
|
||
|
|
||
|
This model was created by the team at [Impira](https://www.impira.com/).
|