Metadata-Version: 2.1
Name: receipt-parser-ocr
Version: 1.0.4
Summary: A lightweight library to parse and categorize receipts from OCR text
Home-page: https://github.com/PallaviKandibilla03/Receipt-Management-AWS
Author: Pallavi Kandibilla
Author-email: your-email@example.com
Keywords: receipt parser ocr categorizer finance
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Topic :: Office/Business :: Financial
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE

# Receipt Parser

A lightweight Python library to parse and categorize receipt data from OCR-extracted text.

## Features

- Parse receipt text from OCR output (PDFs, images)
- Extract key data: store name, total amount, payment mode, date
- Auto-categorize receipts (Groceries, Food, Travel, Shopping, etc.)
- Robust text cleaning - handles common OCR encoding issues
- Easy to use - simple, intuitive API

## Installation

```bash
pip install receipt-parser-ocr
