{"id":14958919,"url":"https://github.com/harshuljain13/ocr","last_synced_at":"2025-10-24T16:30:32.432Z","repository":{"id":70149365,"uuid":"136687010","full_name":"harshuljain13/OCR","owner":"harshuljain13","description":"Optical character recognition Using Deep Learning ","archived":false,"fork":false,"pushed_at":"2018-06-17T07:26:47.000Z","size":7862,"stargazers_count":30,"open_issues_count":0,"forks_count":7,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-01-31T02:44:28.863Z","etag":null,"topics":["cnn","ctc-loss","deeplearning","lstm","lstm-neural-networks","optical-character-recognition","rcnn","tensorflow","tensorflow-experiments"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/harshuljain13.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2018-06-09T03:20:06.000Z","updated_at":"2024-12-10T06:33:45.000Z","dependencies_parsed_at":"2023-02-23T11:46:17.506Z","dependency_job_id":null,"html_url":"https://github.com/harshuljain13/OCR","commit_stats":null,"previous_names":["harshuljain13/ocr"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/harshuljain13%2FOCR","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/harshuljain13%2FOCR/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/harshuljain13%2FOCR/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/harshuljain13%2FOCR/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/harshuljain13","download_url":"https://codeload.github.com/harshuljain13/OCR/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":237999477,"owners_count":19399887,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cnn","ctc-loss","deeplearning","lstm","lstm-neural-networks","optical-character-recognition","rcnn","tensorflow","tensorflow-experiments"],"created_at":"2024-09-24T13:18:31.985Z","updated_at":"2025-10-24T16:30:31.970Z","avatar_url":"https://github.com/harshuljain13.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Optical Character Recognition Using DeepLearning\n ![Er. Harshul Jain, author](https://img.shields.io/badge/Author-Er.%20Harshul%20Jain%20-blue.svg)\n\nText is everywhere! It is present in PDFs, docs as well as images. There are lots of applications where text data is useful for doing analytics. Such applications include receipts recognition, number plate detection, extracting the latex formulas from the images etc. General Computer Vision can be used for such task but it lacks in accuracy. In order to solve the low accuracy and variance problem, we use the state of the art deep neural networks.\n\nThis repository includes:\n```\n1. A TensorFlow implementation of the CNN+LSTM+CTC model for OCR.\n2. supporting scripts to apply the RCNN appraoch for OCR.\n```\n\n### Architecture\n![Architecture](images/cnn_lstm_Architecture.jpeg)\n\n### Instructions on How to run\n\nGet the repository\n```\ngit clone https://github.com/harshul1610/OCR.git\n```\n\nGet the NIST19 dataset\n```\nmkdir data\nwget https://s3.amazonaws.com/nist-srd/SD19/by_class.zip\nunzip by_class.zip\nmv by_class NIST19\n```\n\nGet the Captcha data\n```\ncd OCR\npython2 generate_captcha.py \n```\n\nRun the final notebook for training and testing\n```\nCNN_LSTM_CTC_OCR-captcha.ipynb\n```\n\n### LICENSE\n\nMIT\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fharshuljain13%2Focr","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fharshuljain13%2Focr","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fharshuljain13%2Focr/lists"}