{"id":26779523,"url":"https://github.com/arulkumarann/visual-sense","last_synced_at":"2025-03-29T06:17:37.613Z","repository":{"id":227883957,"uuid":"772533309","full_name":"arulkumarann/visual-sense","owner":"arulkumarann","description":null,"archived":false,"fork":false,"pushed_at":"2024-03-16T23:38:12.000Z","size":11628,"stargazers_count":0,"open_issues_count":0,"forks_count":2,"subscribers_count":1,"default_branch":"main","last_synced_at":"2024-05-12T09:40:29.136Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/arulkumarann.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-03-15T11:32:58.000Z","updated_at":"2024-05-12T09:40:30.946Z","dependencies_parsed_at":"2024-05-12T09:50:30.628Z","dependency_job_id":null,"html_url":"https://github.com/arulkumarann/visual-sense","commit_stats":null,"previous_names":["arul-5/visual-sense","arulkumarann/visual-sense"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/arulkumarann%2Fvisual-sense","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/arulkumarann%2Fvisual-sense/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/arulkumarann%2Fvisual-sense/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/arulkumarann%2Fvisual-sense/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/arulkumarann","download_url":"https://codeload.github.com/arulkumarann/visual-sense/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246145024,"owners_count":20730495,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-03-29T06:17:37.051Z","updated_at":"2025-03-29T06:17:37.608Z","avatar_url":"https://github.com/arulkumarann.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003ch1 align=\"center\" id=\"title\"\u003eVisualSense\u003c/h1\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"https://github.com/arul-5/visual-sense/assets/142246653/958a1342-dec3-4c68-807d-b0987be837cd\" alt=\"VisualSense\" width=80\u003e\n\u003c/p\u003e\n\n\n\u003cp id=\"description\"\u003eWelcome to the VisualSense repository! The problem being addressed by this project is the significant barrier faced by visually impaired individuals in comprehending visual content in their surroundings. Whether encountering images in real-time through a camera or uploading pictures for analysis, visually impaired individuals often struggle to understand the contents of these images without sighted assistance. This project seeks to bridge this gap by\ndeveloping a web application that utilizes Visual Question Answering (VQA) technology. Through VQA, users can interactively ask questions about images, enabling them to gain a better understanding of the visual content independently. By providing this tool, the project aims to enhance the accessibility of visual information for visually impaired individuals and promote their autonomy and inclusion.\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"https://github.com/arul-5/visual-sense/assets/113288547/985414d9-45cc-4f4b-9959-71190e6fc027\" alt=\"VisualSense\"\u003e\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"https://github.com/arul-5/visual-sense/assets/113288547/673b28c0-2042-436e-bfc7-1b01f3d804e6\" alt=\"VisualSense\"\u003e\n\u003c/p\u003e\n\n\u003ch2\u003eFeatures:\u003c/h2\u003e\n \n* Real-time Image Analysis: Instantly analyze images from the device's camera.\n\n* Interactive Questioning: Users ask questions about image content using natural language.\n\n* AI-driven Answering: Spoken answers generated based on image analysis and user queries.\n  \n* Flexible Image Upload: Ability to upload images for analysis from device storage.\n  \n* Accessibility Features: User-friendly interface with compatibility for screen readers.\n  \n* Promotes Independence: Empowers visually impaired individuals to access visual information independently.\n  \n* Inclusion: Reduces reliance on sighted assistance, fostering greater inclusion.\n\n\n\n\n\u003ch2\u003e🛠 Installation Steps:\u003c/h2\u003e\n\n\u003cp\u003e1. Front-End\u003c/p\u003e\n\n```\ncd frontend\n```\n\n\n```\nnpm i\n```\n\n```\nnpm run dev\n```\n\n\u003cp\u003e2. Back-End\u003c/p\u003e\n\n```\ncd backend\n```\n\n```\nuvicorn index:app --reload\n```\n\n\u003ch2\u003e💻 Built with\u003c/h2\u003e\n\nTechnologies used in the project:\n\n*   PyTorch\n*   FastAPI\n*   NextJS\n*   Hugging Faces Transformers\n*   Gemini API\n   \nIf you encounter any issues or have suggestions for improvements, please create an issue or pull request on GitHub.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Farulkumarann%2Fvisual-sense","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Farulkumarann%2Fvisual-sense","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Farulkumarann%2Fvisual-sense/lists"}