https://github.com/austinoboyle/scrape_solus

Last synced: 4 months ago
JSON representation

Host: GitHub
URL: https://github.com/austinoboyle/scrape_solus
Owner: austinoboyle
Created: 2018-04-09T01:30:08.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2019-10-02T15:20:14.000Z (about 6 years ago)
Last Synced: 2025-01-30T21:32:43.520Z (8 months ago)
Language: Python
Size: 33.2 KB
Stars: 1
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# scrape_solus

## Introduction

`scrape_solus` is a python utility to scrape all details from from the Queen's
course enrollment site.

## Installation

1. `git clone https://github.com/austinoboyle/scrape_solus` and cd into the folder
2. (Ideal, but not necessary) create a virtual env
3. `pip install -e .` to install the project in development mode. This will allow you to easily make changes on the fly.
4. Set your SOLUS_USER and SOLUS_PASS env variables
5. You should now have the `scrapesolus` command available. Run `scrapesolus --help` to see available commands.

## CLI (scrapsolus)

Usage: scrapesolus [OPTIONS]

Options:

- -t, --scrape_type alpha|interval (default=alpha). alpha: each job scrapes a letter. interval: each job scrapes every Nth course.
- -n, --num_workers INTEGER (default=8) number of selenium instances to run in parallel
- -o, --output_dir PATH Output directory for data dump
- -d, --deep BOOLEAN Do you want Section Data?
- -h, --headless BOOLEAN (default: True). Set to False for debugging.
- -l, --letter TEXT Scrape all courses that start with this letter
- -c, --course_code TEXT Scrape a specific course code
- --help Show this message and exit.

## Examples

### Scrape a specific course

`scrapesolus -c "MATH 281"`

### Scrape a course without headless selenium for debugging

`scrapesolus -h False -c "MATH 281"`

### Scrape a specific course and only want the course info/description (no sections/schedule data)?

`scrapesolus -c "MATH 281" -d False`

### Scrape all course codes beginning with the letter A

`scrapesolus -l A`

### Full scrape of courses with (default) 8 workers

`scrapesolus`

### Full scrape with 2 workers

`scrapesolus -n 2`

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/austinoboyle/scrape_solus

Awesome Lists containing this project

README