An open API service indexing awesome lists of open source software.

https://github.com/hendrapaiton/guidestar

Proposal for https://www.upwork.com/jobs/~01cefd390e2a25fcf2 on UpWork dot com
https://github.com/hendrapaiton/guidestar

csv data-extraction freelance pandas python scrapy upwork

Last synced: 15 days ago
JSON representation

Proposal for https://www.upwork.com/jobs/~01cefd390e2a25fcf2 on UpWork dot com

Awesome Lists containing this project

README

          

# About The Project

I make this repository for portfolio when I'm submit data extraction proposal at [UpWork](https://www.upwork.com/jobs/~01cefd390e2a25fcf2). This project using scrapy library for python to download the data from target websites. Parse the data for meet the requirements client. And then transform to data framework and save to csv using pandas framework. I hope this project maybe useful for you untuk learn data extraction using python, scrapy and pandas. Regards!

## Installation

First, clone the repo!
```python
git clone https://github.com/hendrapaiton/guidestar.git
```

Second, make virtual environment in the project.
```python
python3 -m virtualenv venv
source ./venv/bin/activate # in Most Linux
./venv/Scripts/activate # in Windows
```

Third, crawl the spider organization
```python
scrapy crawl organization
```

Last but not least, waiting process until "organization.csv" file created.

### Happy Coding!