https://github.com/pjsier/chicago-police-data-cleanup
Data cleaning for October 2016 Chicago police misconduct FOIA release
https://github.com/pjsier/chicago-police-data-cleanup
Last synced: about 1 year ago
JSON representation
Data cleaning for October 2016 Chicago police misconduct FOIA release
- Host: GitHub
- URL: https://github.com/pjsier/chicago-police-data-cleanup
- Owner: pjsier
- License: mit
- Created: 2016-11-18T03:14:47.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2017-04-17T00:44:25.000Z (about 9 years ago)
- Last Synced: 2025-01-03T23:29:48.691Z (over 1 year ago)
- Language: Jupyter Notebook
- Size: 31.3 MB
- Stars: 0
- Watchers: 3
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Chicago Police Misconduct FOIA Data Cleanup
Data can be accessed through links [here](http://thememoryhole2.org/blog/cpd-complaints)
Used `pdftohtml` with options `-s -i -c` to extract PDF with style attributes for locations.
### Notes
* `NAME_NO_INITIAL` is added to merge the pre and post 2001 data
* `_dedupe.csv` version has all duplicate `CR_NO`/`NAME_NO_INITIAL` removed
* `combined_officer_complaints.csv` leaves in redundant data because there are
some acknowledged inconsistencies