Gemma 3 Image to Text

A web application that uses Gemma 3 to extract text from images. The application allows you to select a portion of an image and get the text content from that selection.

Features

Upload and select portions of images
Extract text using Gemma 3
Real-time processing time estimation
Option to include/exclude context image
Progress tracking with countdown timer

Example

Upload an image
Select the area containing text
Choose whether to include context
Click "Analyze Selection" to extract text

Setup

Clone the repository
Install dependencies
Run the Flask application
Open in browser at http://localhost:7000

Requirements

Python 3.x
Flask
Ollama with Gemma 3 model

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
templates		templates
.DS_Store		.DS_Store
README.md		README.md
app.py		app.py
example.png		example.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gemma 3 Image to Text

Features

Example

Setup

Requirements

License

About

Releases

Packages

Languages

punnerud/gemma3_image_to_text

Folders and files

Latest commit

History

Repository files navigation

Gemma 3 Image to Text

Features

Example

Setup

Requirements

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages