LLM-Powered Invoice & Receipt Extractor (OSS)
LLM-Powered Invoice & Receipt Extractor (OSS) is an AI automation tool. LLM-Powered Invoice & Receipt Extractor (OSS).

- Category
- Organization & Automation
- Pricing
- Free
- Alternatives
- 6 similar tools
- Last updated
- 11 months ago
- Source
- Official site ↗
About LLM-Powered Invoice & Receipt Extractor (OSS)
We just open-sourced a language-model-powered extractor for invoices and receipts. It turns messy, unstructured text (from OCR or scanned docs) into clean, structured JSON — complete with field-level confidence scores.
How LLM-Powered Invoice & Receipt Extractor (OSS) compares
LLM-Powered Invoice & Receipt Extractor (OSS) alongside its closest alternatives in the Organization & Automation category.
| Tool | Use case | Pricing | More |
|---|---|---|---|
| LLM-Powered Invoice & Receipt Extractor (OSS)this page | LLM-Powered Invoice & Receipt Extractor (OSS) | Free | — |
| Sintra | Sintra - Your next employee hires, on AI | — | Open ↗ |
| Riku | Riku.Ai - Build No-Code Prompts & Datasets for AI Models | — | Open ↗ |
| Albus | Albus - ChatGPT Now On Slack | Springworks | — | Open ↗ |
What you get
Additional Information
Struggling to get real receipt/invoice data for your AI models? I built an open-source generator using LLMs (JSON output, no templates)
Link: https://github.com/WellApp-ai/Well/tree/main/ai-receipt-generator
Sample output: https://imgur.com/a/YtFSodj
When you're building AI systems to extract structured data from receipts, invoices, and other financial docs, there's one big bottleneck: Realistic, diverse, high-volume training data.
Most open datasets are:
- Too clean (template-generated)
- Too uniform (Western formats only)
- Not legally usable at scale
So I built this little open-source tool that uses LLMs to generate synthetic receipts in JSON format, fully customizable via prompt + config. No PDFs, no OCR simulation — just structured text output designed for evals, testing, or fine-tuning.
Key features:
- Works with OpenAI, local models, Claude, etc. (LLM-agnostic)
- JSON schema for receipts/invoices, easy to customize
- Faker fallback if you don’t want to hit a model
- Locale-aware: useful for global format simulation
- Configurable weirdness: broken totals, missing fields, typos, etc.
This helped us stress-test our document parser with realistic, non-trivial edge cases that templates couldn’t replicate.
Curious if anyone else here is:
- Generating synthetic data for document AI
- Testing LLM-based extractors or OCR+LLM combos
- Building eval suites for financial AI models
Would love feedback, ideas, or thoughts on how you’d extend this.
Frequently asked questions
What is LLM-Powered Invoice & Receipt Extractor (OSS) used for?
LLM-Powered Invoice & Receipt Extractor (OSS).Is LLM-Powered Invoice & Receipt Extractor (OSS) free?
LLM-Powered Invoice & Receipt Extractor (OSS) is free to use.Where can I get LLM-Powered Invoice & Receipt Extractor (OSS)?
LLM-Powered Invoice & Receipt Extractor (OSS) is available at https://github.com/WellApp-ai/Well/tree/main/ai-receipt-generator.What category is LLM-Powered Invoice & Receipt Extractor (OSS) in?
LLM-Powered Invoice & Receipt Extractor (OSS) is listed in Organization & Automation.







