👁️✨ Gemini Lens

Ask questions about anything on your screen.
Gemini Lens is a desktop tool that lets you draw on your screen, capture it, and get AI-powered explanations about the content.

A demonstration of Gemini Lens analyzing a code snippet.

🚀 Key Features

Screen Overlay Canvas – Draw and annotate anywhere on your screen with a simple, transparent overlay.
Multimodal AI Queries – Uses Google's Gemini Pro Vision to understand both your text prompts and screen captures.
Context-Aware Analysis – Get explanations for code, text, images, or any visual element on your screen.
Real-time Responses – AI-generated answers are streamed back with a typewriter effect for an interactive experience.
Simple & Intuitive UI – A clean, minimal interface accessed via a right-click menu.

🤔 How It Works

Activate & Draw – Run the application to enable the transparent overlay. Use your mouse to draw a circle, arrow, or any annotation over the content you want to ask about.
Ask with Screen – Right-click to open the context menu and select "ask with screen".
Enter Your Prompt – A dialog box will appear. Type your question (e.g., "What does this function do?" or "Summarize this article").
Get an Instant Answer – The tool captures your screen (with your drawings), sends it to the Gemini API along with your prompt, and displays the AI's response in a new window.

🛠️ Installation & Setup

Follow these steps to get Gemini Lens running on your local machine.

1. Prerequisites

Python 3.9 or newer
A Google AI API Key – You can get one from Google AI Studio

2. Clone the Repository

git clone https://github.com/lubaid-01/Gemini-Lens.git
cd gemini-lens

3. Set Up a Virtual Environment

python -m venv venv
.\venv\Scripts\activate

macOS / Linux:

python3 -m venv venv
source venv/bin/activate

Install Dependencies From requirements.txt:

pip install -r requirements.txt

If you don't have a requirements.txt file, you can create one with:

PyQt6
Pillow
google-generativeai
python-dotenv

Or install manually:

pip install PyQt6 Pillow google-generativeai python-dotenv

Configure Your API Key

Create a file named .env in the root directory of the project and add:

GOOGLE_API_KEY="YOUR_API_KEY_HERE"

Ensure your gemini.py script loads this key.

▶️ How to Use

Run the main application script:

python main.py

🔐Controls:

Left-click & drag – Draw on the screen
Right-click – Open the menu:
Clear – Erase all drawings
Minimize – Hide the application window -ask – Send a text-only prompt to the AI
ask with screen – Capture the screen and send it with a prompt to the AI
Quit – Close the application

🙏 Acknowledgements

Built with the powerful PyQt6 framework.
Image processing handled by Pillow.
AI capabilities powered by Google's Gemini API.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
gemini.py		gemini.py
main.py		main.py
requirements.txt		requirements.txt
text.py		text.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

👁️✨ Gemini Lens

🚀 Key Features

🤔 How It Works

🛠️ Installation & Setup

1. Prerequisites

2. Clone the Repository

3. Set Up a Virtual Environment

macOS / Linux:

🔐Controls:

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

👁️✨ Gemini Lens

🚀 Key Features

🤔 How It Works

🛠️ Installation & Setup

1. Prerequisites

2. Clone the Repository

3. Set Up a Virtual Environment

macOS / Linux:

🔐Controls:

🙏 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages