Submit with AI
LLM-Powered Invoice & Receipt Extractor (OSS) screenshot
Category
Data Analysis
Pricing
Free
Alternatives
6 similar tools
Last updated
2 weeks ago
Source
Official site ↗

AI Collection Top Picks

Metatron screenshot
Top Picks

Metatron

Open-source codebase memory for AI coding agents

Data Analysis Category Picks

Tabula screenshot
Tabula

GPT-powered marketing data analytics. Build multi-touch attribution to see how different marketing channels work together to convert leads into customers

Genius Sheets screenshot
Genius Sheets

Take On More Clients Without Working More Hours

AutoPredict screenshot
AutoPredict

AutoPredict - Predict How Long Your Car Will Last

About LLM-Powered Invoice & Receipt Extractor (OSS)

We just open-sourced a language-model-powered extractor for invoices and receipts. It turns messy, unstructured text (from OCR or scanned docs) into clean, structured JSON — complete with field-level confidence scores.

What you get

Additional Information

Struggling to get real receipt/invoice data for your AI models? I built an open-source generator using LLMs (JSON output, no templates)

Link: https://github.com/WellApp-ai/Well/tree/main/ai-receipt-generator

Sample output: https://imgur.com/a/YtFSodj

ChatGPT Image May 3, 2025, 11_31_53 PM (1)


When you're building AI systems to extract structured data from receipts, invoices, and other financial docs, there's one big bottleneck: Realistic, diverse, high-volume training data.

Most open datasets are:

  • Too clean (template-generated)
  • Too uniform (Western formats only)
  • Not legally usable at scale

So I built this little open-source tool that uses LLMs to generate synthetic receipts in JSON format, fully customizable via prompt + config. No PDFs, no OCR simulation — just structured text output designed for evals, testing, or fine-tuning.

Key features:

  • Works with OpenAI, local models, Claude, etc. (LLM-agnostic)
  • JSON schema for receipts/invoices, easy to customize
  • Faker fallback if you don’t want to hit a model
  • Locale-aware: useful for global format simulation
  • Configurable weirdness: broken totals, missing fields, typos, etc.

This helped us stress-test our document parser with realistic, non-trivial edge cases that templates couldn’t replicate.


Curious if anyone else here is:

  • Generating synthetic data for document AI
  • Testing LLM-based extractors or OCR+LLM combos
  • Building eval suites for financial AI models

Would love feedback, ideas, or thoughts on how you’d extend this.

Frequently asked questions

  • What is LLM-Powered Invoice & Receipt Extractor (OSS) used for?
    LLM-Powered Invoice & Receipt Extractor (OSS).
  • Is LLM-Powered Invoice & Receipt Extractor (OSS) free?
    LLM-Powered Invoice & Receipt Extractor (OSS) is free to use.
  • What are alternatives to LLM-Powered Invoice & Receipt Extractor (OSS)?
    Top alternatives to LLM-Powered Invoice & Receipt Extractor (OSS) include Tabula, AutoPredict, Genius Sheets, NewsDeck from OneSub, Decile.
  • Where can I get LLM-Powered Invoice & Receipt Extractor (OSS)?
    LLM-Powered Invoice & Receipt Extractor (OSS) is available at https://github.com/WellApp-ai/Well/tree/main/ai-receipt-generator.
  • What category is LLM-Powered Invoice & Receipt Extractor (OSS) in?
    LLM-Powered Invoice & Receipt Extractor (OSS) is listed in Data Analysis.

Compare LLM-Powered Invoice & Receipt Extractor (OSS) with…

Pick any tool to see a structured side-by-side: pricing, deployment, capabilities, and content policy.

See all comparable tools.

Alternatives AI applications for LLM-Powered Invoice & Receipt Extractor (OSS)See all LLM-Powered Invoice & Receipt Extractor (OSS) alternatives →
Tabula screenshot
Tabula

GPT-powered marketing data analytics. Build multi-touch attribution to see how different marketing channels work together to convert leads into customers

AutoPredict screenshot
AutoPredict

AutoPredict - Predict How Long Your Car Will Last

Genius Sheets screenshot
Genius Sheets

Take On More Clients Without Working More Hours