Spaces:
Running
Running
File size: 1,573 Bytes
2625f4f a4d2f09 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 |
---
title: Text Extractor
emoji: π
colorFrom: gray
colorTo: indigo
sdk: gradio
sdk_version: 6.0.2
app_file: app.py
pinned: false
license: mit
---
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
___________
.
π§ OCR Text Extractor + Summarizer
An AI-powered tool that extracts text from images using Tesseract OCR and then summarizes it using a transformer model.
Upload any image (screenshots, photos, scanned documents, notes) β Get clean extracted text + an AI summary.
π Features
π€ Upload an image with text
π Extracts text using Tesseract OCR
β¨ Summarizes extracted text using HuggingFace transformers
β‘ Fast, simple Gradio UI
π οΈ Works on CPU β no GPU required
π§© How it Works
Image is processed with Tesseract OCR
Extracted text is cleaned
Text is fed into a pretrained summarization model
Output summary is displayed instantly
ποΈ Project Structure
βββ app.py
βββ requirements.txt
βββ packages.txt
βββ README.md
π¦ Dependencies
Python packages (requirements.txt)
gradio
pillow
pytesseract
transformers
torch
tesseract
System packages (packages.txt)
tesseract-ocr
tesseract-ocr-eng
These ensure Tesseract OCR runs correctly on HuggingFace Spaces.
βΆοΈ Running Locally
pip install -r requirements.txt
python app.py
πΈ Demo
Just upload an image β click Submit β done!
π Acknowledgements
Tesseract OCR
HuggingFace Transformers
Gradio for UI
π Try the live Space
πhttps://huggingface.co/spaces/prans-cs55/text_extractor |