Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/im-perativa/public_crawler
A collection of crawler project for Indonesia dataset
https://github.com/im-perativa/public_crawler
crawler indonesia indonesia-api scrapy
Last synced: 27 days ago
JSON representation
A collection of crawler project for Indonesia dataset
- Host: GitHub
- URL: https://github.com/im-perativa/public_crawler
- Owner: im-perativa
- Created: 2022-10-03T08:35:11.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2022-10-19T04:59:01.000Z (about 2 years ago)
- Last Synced: 2023-05-05T02:30:20.706Z (over 1 year ago)
- Topics: crawler, indonesia, indonesia-api, scrapy
- Language: Python
- Homepage:
- Size: 26.4 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Public Crawler
A collection of crawler project for Indonesia dataset. Data collected will be saved as separate csv files for each item type.## Contents
- [List](#list)
- [Usage](#usage)## List
| Crawler | Description | Website |
|----------|-------------|------|
| [bpjs](https://github.com/im-perativa/public_crawler/tree/main/bpjs) | Healthcare facilities in Indonesia | [https://faskes.bpjs-kesehatan.go.id/aplicares/](https://faskes.bpjs-kesehatan.go.id/aplicares/) |
| [ekatalog](https://github.com/im-perativa/public_crawler/tree/main/ekatalog) | Procurement of goods and services for government institution | [https://e-katalog.lkpp.go.id/](https://e-katalog.lkpp.go.id/) |
| [jobsid](https://github.com/im-perativa/public_crawler/tree/main/jobsid) | Job vacancy | [https://www.jobs.id/](https://www.jobs.id/) |
| [kpu](https://github.com/im-perativa/public_crawler/tree/main/kpu) | 2019 general election result | [https://pemilu2019.kpu.go.id](https://pemilu2019.kpu.go.id) |
| [master_bps](https://github.com/im-perativa/public_crawler/tree/main/master_bps) | Indonesia administrative list with bridging code between Statistics Indonesia and Ministry of Internal Affairs | [https://sig.bps.go.id/bridging-kode/index](https://sig.bps.go.id/bridging-kode/index) |
| [sirs](https://github.com/im-perativa/public_crawler/tree/main/sirs) | Hospitals data from Ministry of Health | [https://sirs.kemkes.go.id](https://sirs.kemkes.go.id) |## Usage
```
pip install -r requirements.txt
cd
scrapy crawl
```