{"id":19607141,"url":"https://github.com/robocorp/example-desktop-image-ocr","last_synced_at":"2025-06-19T04:34:05.897Z","repository":{"id":55013549,"uuid":"320516049","full_name":"robocorp/example-desktop-image-ocr","owner":"robocorp","description":"Example robot for automating GnuCash with image templates and OCR","archived":false,"fork":false,"pushed_at":"2024-01-23T09:10:56.000Z","size":31,"stargazers_count":11,"open_issues_count":4,"forks_count":4,"subscribers_count":17,"default_branch":"master","last_synced_at":"2025-04-27T19:46:19.124Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/robocorp.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2020-12-11T08:42:59.000Z","updated_at":"2024-09-17T13:26:45.000Z","dependencies_parsed_at":"2024-01-18T13:40:15.611Z","dependency_job_id":"e3bd97e3-33a9-42d9-89b2-cd06b43a4d2a","html_url":"https://github.com/robocorp/example-desktop-image-ocr","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/robocorp/example-desktop-image-ocr","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robocorp%2Fexample-desktop-image-ocr","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robocorp%2Fexample-desktop-image-ocr/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robocorp%2Fexample-desktop-image-ocr/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robocorp%2Fexample-desktop-image-ocr/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/robocorp","download_url":"https://codeload.github.com/robocorp/example-desktop-image-ocr/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robocorp%2Fexample-desktop-image-ocr/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":260688048,"owners_count":23046837,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-11T10:09:05.814Z","updated_at":"2025-06-19T04:34:00.887Z","avatar_url":"https://github.com/robocorp.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Desktop automation using images templates and OCR\n\n![GnuCash Logo](https://www.gnucash.org/images/gnc-download.png)\n\nThis robot demonstrates automating a desktop application with image recognition and OCR.\nThe system being automated is a cross-platform free accounting software called [GnuCash](https://www.gnucash.org/).\nThis demo uses a mock account.\n\nMost of the interaction happens by locating a desired text by OCR. Regions and offsets are involved too,\ne.g. when you want to type text into an input field with a text label next to it,\nyou first find the label, then get a region or offset relative to that text and click there.\n\n## Note about OCR\n\nOCR refers to recognizing characters, words, and text in general. Image recognition is more\ngeneric (and simpler) and might refer to the recognition of, e.g. colors and shapes.\n\nOCR or image recognition based automation is usually the last resort in automation.\nWhen thinking whether you should use OCR to automate your application, first investigate if you could:\n\n- Read the raw data programmatically\n- Use native UI selectors\n- Use UI hotkeys\n- Use the clipboard\n\nUsually image recognition must be used when dealing with Citrix (or similar).\nEven then hotkeys and the clipboard are oftentimes more reliable than pure OCR.\n\nAnother use case for OCR is reading text from documents (PDF, JPEG),\nthis demo does not consider that use case at all.\n\nWhen utilizing OCR or image-based locators, it is imperative to execute the process within a consistent environment. This entails maintaining uniform resolution and scaling settings across all executions, as the locators are predominantly calibrated for the specific display configurations for which they were originally designed.\n\n## Requirements\n\nThis robot assumes the running environment has a functional installation\nof [GnuCash](https://www.gnucash.org/). All other dependencies are\nhandled by RCC and conda.yaml.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frobocorp%2Fexample-desktop-image-ocr","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Frobocorp%2Fexample-desktop-image-ocr","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frobocorp%2Fexample-desktop-image-ocr/lists"}