https://github.com/suddi/fundscraper
Collection of web crawlers to scrape fund data using Scrapy
https://github.com/suddi/fundscraper
crawler funds scraper scrapy
Last synced: 16 days ago
JSON representation
Collection of web crawlers to scrape fund data using Scrapy
- Host: GitHub
- URL: https://github.com/suddi/fundscraper
- Owner: suddi
- License: apache-2.0
- Created: 2017-10-04T04:26:56.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2024-05-14T22:16:24.000Z (about 2 years ago)
- Last Synced: 2025-02-12T21:36:38.836Z (over 1 year ago)
- Topics: crawler, funds, scraper, scrapy
- Language: Python
- Size: 28.3 KB
- Stars: 1
- Watchers: 3
- Forks: 2
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# fundscraper
[](https://circleci.com/gh/suddi/fundscraper)
[](https://www.codacy.com/app/Suddi/fundscraper)
[](https://github.com/suddi/fundscraper/blob/master/LICENSE)
[Scrapy](https://scrapy.org/) web crawlers to collect fund data
## Installation
If you have `virtualenvwrapper`, set up a virtual environment with the following command:
````
mkvirtualenv fundscraper
````
Install dependencies:
````
pip install -r requirements.txt
````
Install dev dependencies:
````
python install -r test_requirements.txt
````
## Usage
To crawl utilizing a spider:
````
scrapy crawl aia.com.hk
````
To calculate returns from historic data:
````
python setup.py compute_returns
````
To compute most performant funds:
````
python setup.py compute_performing_funds
````