{"id":19246960,"url":"https://github.com/ikivanc/ocrvisualizer","last_synced_at":"2025-04-21T11:33:18.106Z","repository":{"id":40915742,"uuid":"161171610","full_name":"ikivanc/OCRVisualizer","owner":"ikivanc","description":"Microsoft Cognitive Services, Computer Vision API, OCR Visualizer on documents","archived":false,"fork":false,"pushed_at":"2022-12-08T01:47:29.000Z","size":7825,"stargazers_count":19,"open_issues_count":1,"forks_count":5,"subscribers_count":1,"default_branch":"master","last_synced_at":"2023-02-27T14:47:54.182Z","etag":null,"topics":["cognitive-services","computer-vision","microsoft-cognitive-services","ocr","ocr-visualizer"],"latest_commit_sha":null,"homepage":null,"language":"C#","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ikivanc.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-12-10T12:27:54.000Z","updated_at":"2021-09-18T13:34:32.000Z","dependencies_parsed_at":"2023-01-25T02:15:26.451Z","dependency_job_id":null,"html_url":"https://github.com/ikivanc/OCRVisualizer","commit_stats":null,"previous_names":[],"tags_count":null,"template":null,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ikivanc%2FOCRVisualizer","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ikivanc%2FOCRVisualizer/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ikivanc%2FOCRVisualizer/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ikivanc%2FOCRVisualizer/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ikivanc","download_url":"https://codeload.github.com/ikivanc/OCRVisualizer/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":223861423,"owners_count":17215887,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cognitive-services","computer-vision","microsoft-cognitive-services","ocr","ocr-visualizer"],"created_at":"2024-11-09T17:35:45.266Z","updated_at":"2024-11-09T17:35:45.882Z","avatar_url":"https://github.com/ikivanc.png","language":"C#","funding_links":[],"categories":[],"sub_categories":[],"readme":"# OCR Visualizer\nOCRVisualizer is a tool to visualize Microsoft Cognitive Services OCR API json output to get familiar with bounding boxes of Regions, Lines and Words. It's written in C#/WPF.\n\nThis tool will be helful for your data discovery, if you use OCR with your documents.\n\nThe new preview OCR engine is integrated (through \"Recognize Text\" API operation) with even better text recognition results for English.\n\n## Main Functinality\nIn this version;\n\n* You can see bounding boxes of `Regions`, `Lines` and `Words`\n* You can select layers of bounding boxes to visualize under `OCR Text Visibility` menu\n* You can see extracted text over your original document\n* You can extract full text as output\n* You extract Key-Value Pairs\n* Better results with new preview OCR engine (through \"Recognize Text\", only in English)\n\n## Run Executable File\nIf you would like to test OCR visualizer on your local, download Setup File and update with your `subscription Key` and `endpoint` in `OCR Settings`. After defining your endpoint you can visualize your OCR documents via `Browse`.\n\n* [Download OCRVisualizer Executable File on Windows](Setup/Release_Setup.zip) \n\n![](screenshots/setup.png) \n\n\n## Build the project\n\nPlease change in `App.config` file below code snippet with your Cognitive Services Computer Vision API subscription key and if your service hosted other than `northeurope` region, change the region with yours.\n\nMicrosoft Cognitive Services Computer Vision Endpoint details.\n\n```json\n\u003c?xml version=\"1.0\" encoding=\"utf-8\"?\u003e\n\u003cconfiguration\u003e\n  \u003cstartup\u003e\n    \u003csupportedRuntime version=\"v4.0\" sku=\".NETFramework,Version=v4.6.1\" /\u003e\n  \u003c/startup\u003e\n  \u003cappSettings\u003e\n    \u003cadd key=\"subscriptionKey\" value=\"YOUR_COMPUTER_VISION_API_KEY\" /\u003e\n    \u003cadd key=\"endpointRegion\" value=\"https://northeurope.api.cognitive.microsoft.com/vision/v2.0/ocr\" /\u003e\n    \u003cadd key=\"documentLanguage\" value=\"unk\" /\u003e\n    \u003cadd key=\"searchValues\" value=\"Number,Field,Source\" /\u003e\n    \u003cadd key=\"searchValuesWidth\" value=\"300\" /\u003e\n  \u003c/appSettings\u003e\n\u003c/configuration\u003e\n```\n\n## Field Extraction - Key-Value Pairs\n\nIf you're looking for a specific value of a field, you can use this feature. After defining your field names, it'll look for a certain pixel to retrieve group of text as a value. Here is how it works.\n\n![](screenshots/FieldExtraction.png)\n\nNow you can define your `Field Extraction` under Key-Value pairs in `Manage Key-Value` under `Field Extraction` menu and values of these fields will be detected by their positions on the document and detected values will be listed in same panel.\n\n\n![](screenshots/ExtractKeyValueSettings.png)\n\nHere are some examples of of output of documents.\n\nExtract Key-Value Pairs\n\n![](screenshots/ExtractKeyValue.png)\n\n\n## Visualize Bounding Boxes\n\nNow you can select layers of bounding boxes to visualize under `OCR Text Visibility` menu\n![](screenshots/OCRTextVisibility.png)\n\nOCR for unstuctured documents.\n![](screenshots/ocroutput2.png)\n\nOCR for a part of documents\n![](screenshots/ocroutput1.png)\n\nOCR from full page documents \n![](screenshots/ocroutput.png)\n\n\nFor more information about Optical character recognition (OCR) in images | [Demo](https://azure.microsoft.com/en-us/services/cognitive-services/computer-vision/#text) | [Container Support](https://docs.microsoft.com/en-us/azure/cognitive-services/computer-vision/computer-vision-how-to-install-containers)\n\n\n## OCR Supported languages:\n* unk (AutoDetect)\n* zh-Hans (ChineseSimplified)\n* zh-Hant (ChineseTraditional)\n* cs (Czech)\n* da (Danish)\n* nl (Dutch)\n* en (English)\n* fi (Finnish)\n* fr (French)\n* de (German)\n* el (Greek)\n* hu (Hungarian)\n* it (Italian)\n* ja (Japanese)\n* ko (Korean)\n* nb (Norwegian)\n* pl (Polish)\n* pt (Portuguese,\n* ru (Russian)\n* es (Spanish)\n* sv (Swedish)\n* tr (Turkish)\n* ar (Arabic)\n* ro (Romanian)\n* sr-Cyrl (SerbianCyrillic)\n* sr-Latn (SerbianLatin)\n* sk (Slovak)\n\nThanks.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fikivanc%2Focrvisualizer","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fikivanc%2Focrvisualizer","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fikivanc%2Focrvisualizer/lists"}