https://github.com/kb-perbyte/assignmentkudos
https://github.com/kb-perbyte/assignmentkudos
Last synced: 7 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/kb-perbyte/assignmentkudos
- Owner: KB-perByte
- Created: 2020-11-13T11:43:17.000Z (over 5 years ago)
- Default Branch: main
- Last Pushed: 2020-11-13T13:24:36.000Z (over 5 years ago)
- Last Synced: 2025-02-16T12:57:35.059Z (about 1 year ago)
- Language: Python
- Size: 16.6 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Scrapping data out of EPFO and MCA
How to get started!
- pip3 install -r requirements.txt
OR
- pip install -r requirements.txt
This is wierd, but needed to solve the capthas
- https://medium.com/@ahmetxgenc/how-to-use-tesseract-on-windows-fe9d2a9ba5c6
- Once, Teseract OCR is installed there needs to be a path specification in the code
- statics\captcha_reader.py -> line 86 -> replace the path of tesseract.exe incase it is different
On this level just execute the command listed below to get the API server started
- [AssignmentKudos\AssignmentKudos> uvicorn main:app --reload]
- uvicorn main:app --reload
Move to the server URL
- http://127.0.0.1:8000/docs
- docs coz, it will give swaggerUI to play with the APIs
API specs
for EPFO data [captcha based]
NOTE - there are CAPTCHA retries as OCR reads some captcha wrong, so API might be slow but working :)
- http://127.0.0.1:8000/epfo/flipkart
- curl -X GET "http://127.0.0.1:8000/epfo/flipkart" -H "accept: application/json"
for MAC data [no captcha]
- http://127.0.0.1:8000/mca/flipkart
- curl -X GET "http://127.0.0.1:8000/mca/flipkart" -H "accept: application/json"
for dev info
- curl -X GET "http://127.0.0.1:8000/devInfo/" -H "accept: application/json"