https://github.com/kb-perbyte/assignmentkudos

Last synced: 9 months ago
JSON representation

Host: GitHub
URL: https://github.com/kb-perbyte/assignmentkudos
Owner: KB-perByte
Created: 2020-11-13T11:43:17.000Z (over 5 years ago)
Default Branch: main
Last Pushed: 2020-11-13T13:24:36.000Z (over 5 years ago)
Last Synced: 2025-02-16T12:57:35.059Z (over 1 year ago)
Language: Python
Size: 16.6 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

Scrapping data out of EPFO and MCA

How to get started!
- pip3 install -r requirements.txt
OR
- pip install -r requirements.txt

This is wierd, but needed to solve the capthas
- https://medium.com/@ahmetxgenc/how-to-use-tesseract-on-windows-fe9d2a9ba5c6
- Once, Teseract OCR is installed there needs to be a path specification in the code
- statics\captcha_reader.py -> line 86 -> replace the path of tesseract.exe incase it is different

On this level just execute the command listed below to get the API server started
- [AssignmentKudos\AssignmentKudos> uvicorn main:app --reload]
- uvicorn main:app --reload

Move to the server URL
- http://127.0.0.1:8000/docs
- docs coz, it will give swaggerUI to play with the APIs

API specs
for EPFO data [captcha based]
NOTE - there are CAPTCHA retries as OCR reads some captcha wrong, so API might be slow but working :)
- http://127.0.0.1:8000/epfo/flipkart
- curl -X GET "http://127.0.0.1:8000/epfo/flipkart" -H "accept: application/json"
for MAC data [no captcha]
- http://127.0.0.1:8000/mca/flipkart
- curl -X GET "http://127.0.0.1:8000/mca/flipkart" -H "accept: application/json"
for dev info
- curl -X GET "http://127.0.0.1:8000/devInfo/" -H "accept: application/json"

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/kb-perbyte/assignmentkudos

Awesome Lists containing this project

README