https://github.com/joshdata/crs-reports-scraper
Downloads Congressional Research Service (CRS) reports from the CRS.gov website (which is only visible from within the U.S. Capitol computer network).
https://github.com/joshdata/crs-reports-scraper
Last synced: 11 months ago
JSON representation
Downloads Congressional Research Service (CRS) reports from the CRS.gov website (which is only visible from within the U.S. Capitol computer network).
- Host: GitHub
- URL: https://github.com/joshdata/crs-reports-scraper
- Owner: JoshData
- Created: 2016-03-25T14:20:54.000Z (about 10 years ago)
- Default Branch: primary
- Last Pushed: 2020-06-07T21:45:19.000Z (about 6 years ago)
- Last Synced: 2025-04-12T10:55:50.834Z (about 1 year ago)
- Language: HTML
- Homepage:
- Size: 78.1 KB
- Stars: 7
- Watchers: 4
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
crs-reports-scraper
===================
Downloads Congressional Research Service (CRS) reports from the CRS.gov website (which is only visible from within the U.S. Capitol computer network).
On a Windows computer, download one of the [WinPython Zero](https://winpython.github.io/) packages (we last used WinPython64-3.7.4.1Zero.exe) and run it to extract its contents anywhere, like the `WPy` directory in this folder. If you use a different path, edit the `start.bat` script in this directory so that the first line correctly reflects the location of the WinPython folder.
Double-click `start.bat` to run the main script. It will download the CRS reports and metadata into folders created in this directory.
Put `WPy\python-...\pythonw.exe crs_scraper.py` in the Start Menu to run the scraper regularly. Set the working directory to this directory.
To upload reports to Dropbox, create a file:
```
dropbox_access_token.txt
------------------------
TOKEN=dropbox access token generated at https://www.dropbox.com/developers/apps
PATH=/reports
```