{"id":15132074,"url":"https://github.com/codersomya/amazon-ml-challenge","last_synced_at":"2026-01-18T23:07:45.121Z","repository":{"id":257553614,"uuid":"857290696","full_name":"coderSomya/Amazon-ML-challenge","owner":"coderSomya","description":"Extract dimensional data from image","archived":false,"fork":false,"pushed_at":"2024-09-16T07:21:07.000Z","size":2816,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-05T21:16:38.477Z","etag":null,"topics":["image-processing","machine-learning","opencv","pytesseract","python"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/coderSomya.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-09-14T09:00:59.000Z","updated_at":"2024-09-17T07:45:53.000Z","dependencies_parsed_at":null,"dependency_job_id":"3ed7f0cc-7894-47d3-9f43-62d94d255f1e","html_url":"https://github.com/coderSomya/Amazon-ML-challenge","commit_stats":null,"previous_names":["codersomya/amazon-ml-challenge"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/coderSomya%2FAmazon-ML-challenge","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/coderSomya%2FAmazon-ML-challenge/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/coderSomya%2FAmazon-ML-challenge/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/coderSomya%2FAmazon-ML-challenge/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/coderSomya","download_url":"https://codeload.github.com/coderSomya/Amazon-ML-challenge/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247399884,"owners_count":20932880,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["image-processing","machine-learning","opencv","pytesseract","python"],"created_at":"2024-09-26T04:03:12.499Z","updated_at":"2026-01-18T23:07:45.074Z","avatar_url":"https://github.com/coderSomya.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Amazon-ML-challenge\nExtract dimensional data from image\n\n\n## get started with:\n\n```\ngit clone https://github.com/coderSomya/Amazon-ML-challenge.git\n```\n\n```\npip install pytesseract\npip install tesseract\n```\n\n__ensure that__\n\n- you have the dataset/train.csv in the same level as the root\n- you have modified the __init()__ function in generate_test.ipynb according to your system\n\n# TODOs:\n\n- improve the thresholding, etc to get better text from the image\n- once text is available, handle the regexs for comparison better\n- Ctrl+F the word __TODO:__ in the files, i have left some undone work that you may wanna get done\n\n\n## pipelining:\n\n@Architecture-1\n```\nTo work on the testing data for each new row,\nwe should\n1) first call a function which extracts the text out of the image in the link\n2) then based on the texts, we generate the data in the format we want...\nX - Y - width - height - orientation - group_id\n3) for each such value:\n 3.1) we first scale and encode the values based on the training constants\n 3.2) and then call the respective model to predict the prob of it being 1.\n 3.3) then get the max out of them and return the value.\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcodersomya%2Famazon-ml-challenge","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcodersomya%2Famazon-ml-challenge","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcodersomya%2Famazon-ml-challenge/lists"}