← Back to Projects

German Invoice Data Extraction

Invoice extraction pipeline using OCR + layout-aware models with optional UI review.

German Invoice Data Extraction

Categories

CVNLP

Tech Used

Tesseract OCRPaddleOCRHugging Face TransformersPyTorchOpenCVRegexspaCypandasNumPyFastAPIFlaskStreamlitCSSDocker

Problem

Invoices vary widely; extraction must handle layout changes and multilingual text reliably.

Approach

Results