{"id":19151209,"url":"https://github.com/hendrapaiton/guidestar","last_synced_at":"2026-05-17T10:36:18.317Z","repository":{"id":121382286,"uuid":"559490955","full_name":"hendrapaiton/guidestar","owner":"hendrapaiton","description":"Proposal for https://www.upwork.com/jobs/~01cefd390e2a25fcf2 on UpWork dot com","archived":false,"fork":false,"pushed_at":"2022-11-17T09:31:09.000Z","size":20,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-02-22T20:47:48.019Z","etag":null,"topics":["csv","data-extraction","freelance","pandas","python","scrapy","upwork"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/hendrapaiton.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-10-30T09:30:14.000Z","updated_at":"2022-10-31T12:42:26.000Z","dependencies_parsed_at":null,"dependency_job_id":"52859cb0-0fae-4103-96bd-9f5881a7fff5","html_url":"https://github.com/hendrapaiton/guidestar","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/hendrapaiton/guidestar","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hendrapaiton%2Fguidestar","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hendrapaiton%2Fguidestar/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hendrapaiton%2Fguidestar/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hendrapaiton%2Fguidestar/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/hendrapaiton","download_url":"https://codeload.github.com/hendrapaiton/guidestar/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hendrapaiton%2Fguidestar/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":33135105,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-17T09:28:26.183Z","status":"ssl_error","status_checked_at":"2026-05-17T09:27:52.702Z","response_time":107,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["csv","data-extraction","freelance","pandas","python","scrapy","upwork"],"created_at":"2024-11-09T08:14:07.296Z","updated_at":"2026-05-17T10:36:18.279Z","avatar_url":"https://github.com/hendrapaiton.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# About The Project\n\nI make this repository for portfolio when I'm submit data extraction proposal at [UpWork](https://www.upwork.com/jobs/~01cefd390e2a25fcf2). This project using scrapy library for python to download the data from target websites. Parse the data for meet the requirements client. And then transform to data framework and save to csv using pandas framework. I hope this project maybe useful for you untuk learn data extraction using python, scrapy and pandas. Regards!\n\n\n\n## Installation\n\nFirst, clone the repo!\n```python\ngit clone https://github.com/hendrapaiton/guidestar.git\n```\n\nSecond, make virtual environment in the project.\n```python\npython3 -m virtualenv venv\nsource ./venv/bin/activate # in Most Linux\n./venv/Scripts/activate    # in Windows\n```\n\nThird, crawl the spider organization\n```python\nscrapy crawl organization\n```\n\nLast but not least, waiting process until \"organization.csv\" file created.\n\n\n### Happy Coding!\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhendrapaiton%2Fguidestar","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fhendrapaiton%2Fguidestar","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhendrapaiton%2Fguidestar/lists"}