https://github.com/hendrapaiton/guidestar
Proposal for https://www.upwork.com/jobs/~01cefd390e2a25fcf2 on UpWork dot com
https://github.com/hendrapaiton/guidestar
csv data-extraction freelance pandas python scrapy upwork
Last synced: 15 days ago
JSON representation
Proposal for https://www.upwork.com/jobs/~01cefd390e2a25fcf2 on UpWork dot com
- Host: GitHub
- URL: https://github.com/hendrapaiton/guidestar
- Owner: hendrapaiton
- License: mit
- Created: 2022-10-30T09:30:14.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2022-11-17T09:31:09.000Z (over 3 years ago)
- Last Synced: 2025-02-22T20:47:48.019Z (over 1 year ago)
- Topics: csv, data-extraction, freelance, pandas, python, scrapy, upwork
- Language: Python
- Homepage:
- Size: 19.5 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# About The Project
I make this repository for portfolio when I'm submit data extraction proposal at [UpWork](https://www.upwork.com/jobs/~01cefd390e2a25fcf2). This project using scrapy library for python to download the data from target websites. Parse the data for meet the requirements client. And then transform to data framework and save to csv using pandas framework. I hope this project maybe useful for you untuk learn data extraction using python, scrapy and pandas. Regards!
## Installation
First, clone the repo!
```python
git clone https://github.com/hendrapaiton/guidestar.git
```
Second, make virtual environment in the project.
```python
python3 -m virtualenv venv
source ./venv/bin/activate # in Most Linux
./venv/Scripts/activate # in Windows
```
Third, crawl the spider organization
```python
scrapy crawl organization
```
Last but not least, waiting process until "organization.csv" file created.
### Happy Coding!