{"id":30850961,"url":"https://github.com/umaarov/screensight-ai","last_synced_at":"2025-09-07T06:40:48.415Z","repository":{"id":305710741,"uuid":"1019495670","full_name":"umaarov/screensight-ai","owner":"umaarov","description":"A desktop tool to instantly capture any part of your screen and ask questions about it using Google Gemini's multimodal capabilities.","archived":false,"fork":false,"pushed_at":"2025-07-14T12:52:04.000Z","size":25305,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"master","last_synced_at":"2025-09-03T02:40:03.742Z","etag":null,"topics":["desktop-app","gemini-api","pillow","pynput","python","screenshot-tool","tkinter"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/umaarov.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-07-14T12:15:26.000Z","updated_at":"2025-08-08T13:37:19.000Z","dependencies_parsed_at":"2025-07-21T17:24:17.243Z","dependency_job_id":"c3cce287-4a7e-4696-a34b-bf08fc005f94","html_url":"https://github.com/umaarov/screensight-ai","commit_stats":null,"previous_names":["umaarov/screensight-ai"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/umaarov/screensight-ai","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/umaarov%2Fscreensight-ai","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/umaarov%2Fscreensight-ai/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/umaarov%2Fscreensight-ai/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/umaarov%2Fscreensight-ai/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/umaarov","download_url":"https://codeload.github.com/umaarov/screensight-ai/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/umaarov%2Fscreensight-ai/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":274005337,"owners_count":25205934,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-09-07T02:00:09.463Z","response_time":67,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["desktop-app","gemini-api","pillow","pynput","python","screenshot-tool","tkinter"],"created_at":"2025-09-07T06:40:37.368Z","updated_at":"2025-09-07T06:40:48.400Z","avatar_url":"https://github.com/umaarov.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# ScreenSight AI\n\nScreenSight AI is a lightweight, powerful desktop application that allows you to capture any portion of your screen and get instant AI-powered analysis using Google's Gemini model. Simply press a hotkey, select a region, and ask your question!\n\n![ScreenSight AI Demo](/assets/asset.gif)\n\n---\n\n## Features\n\n-   **Hotkey Activation**: Instantly trigger the capture mode with a global hotkey (**Ctrl + Shift + F9**).\n-   **Region Selection**: A simple, intuitive overlay to select exactly what you want to analyze.\n-   **Interactive Analysis**: An interactive window displays your screenshot and allows you to ask questions.\n-   **Powered by Gemini**: Leverages the powerful multimodal capabilities of Google's Gemini Vision model.\n-   **Lightweight \u0026 Cross-Platform**: Built with Python and Tkinter for minimal overhead.\n\n---\n\n## How It Works\n\n1.  **Run the App**: The application runs quietly in the background, listening for the hotkey.\n2.  **Press the Hotkey**: Press `Ctrl + Shift + F9` to bring up a semi-transparent overlay.\n3.  **Select a Region**: Click and drag your mouse to draw a box around the area of interest on your screen.\n4.  **Ask a Question**: Once you release the mouse, an analysis window appears with your screenshot. Type your question into the input box.\n5.  **Get Answers**: The AI analyzes the image and your question, providing a detailed response in the text area.\n\n---\n\n## Installation \u0026 Setup\n\nFollow these steps to get ScreenSight AI running on your machine.\n\n### 1. Prerequisites\n\n-   Python 3.8+\n-   A Google Gemini API Key. You can get one from [Google AI Studio](https://aistudio.google.com/app/apikey).\n\n### 2. Clone the Repository\n\n```bash\ngit clone https://github.com/umaarov/ScreenSightAI.git\ncd ScreenSightAI\n```\n\n### 3. Set Up a Virtual Environment\n\nIt's highly recommended to use a virtual environment to manage dependencies.\n\n```bash\n# For Windows\npython -m venv venv\nvenv\\Scripts\\activate\n\n# For macOS \u0026 Linux\npython3 -m venv venv\nsource venv/bin/activate\n```\n\n### 4. Install Dependencies\n\nInstall the required Python packages from the `requirements.txt` file.\n\n```bash\npip install -r requirements.txt\n```\n\n### 5. Configure Your API Key\n\nThe application securely loads your Gemini API key from an environment file.\n\n1.  In the project's root directory, create a new file named `.env`.\n2.  Open the `.env` file and add your API key in the following format:\n\n    ```\n    # .env file\n    GEMINI_API_KEY=\"YOUR_GEMINI_API_KEY_HERE\"\n    ```\n\n**Important**: Never commit your `.env` file or hardcode your API key directly in the source code. The `.gitignore` file in this repository is already configured to prevent this file from being tracked by Git.\n\n---\n\n## Usage\n\nOnce the setup is complete, you can run the application from your terminal.\n\n```bash\npython main.py\n```\n\nYou will see a confirmation message in your terminal: `ScreenSight AI is running. Press Ctrl + Shift + F9 to select a region.`\n\nThe app now runs in the background. You can press the hotkey combination at any time, in any application, to start a capture.\n\n### Customization\n\nYou can easily change the hotkey combination and the UI styling by editing the values in the `src/config.py` file.\n\n---\n\n## Contributing\n\nContributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are **greatly appreciated**.\n\nIf you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag \"enhancement\".\n\n1.  **Fork** the Project\n2.  Create your Feature Branch (`git checkout -b feature/AmazingFeature`)\n3.  Commit your Changes (`git commit -m 'Add some AmazingFeature'`)\n4.  Push to the Branch (`git push origin feature/AmazingFeature`)\n5.  Open a **Pull Request**\n\n---\n\n## License\n\nThis project is distributed under the MIT License. See the `LICENSE` file for more information.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fumaarov%2Fscreensight-ai","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fumaarov%2Fscreensight-ai","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fumaarov%2Fscreensight-ai/lists"}