Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/deardurham/ciprs-reader
Python library for reading CIPRS PDFs
https://github.com/deardurham/ciprs-reader
codeforamerica coverage docker pdf pdftotext pytest python
Last synced: about 23 hours ago
JSON representation
Python library for reading CIPRS PDFs
- Host: GitHub
- URL: https://github.com/deardurham/ciprs-reader
- Owner: deardurham
- License: bsd-3-clause
- Created: 2019-04-06T02:01:09.000Z (over 5 years ago)
- Default Branch: main
- Last Pushed: 2023-10-25T02:18:41.000Z (about 1 year ago)
- Last Synced: 2024-04-29T00:10:06.934Z (8 months ago)
- Topics: codeforamerica, coverage, docker, pdf, pdftotext, pytest, python
- Language: Jupyter Notebook
- Homepage:
- Size: 1.61 MB
- Stars: 2
- Watchers: 9
- Forks: 7
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# CIPRS Reader
[![Build Status](https://travis-ci.org/deardurham/ciprs-reader.svg?branch=master)](https://travis-ci.org/deardurham/ciprs-reader)## Setup and Run:
Add pdf file to parse in /ignore folder then run:
```bash
docker build -t ciprs-reader .
docker run --rm -v /$(pwd):/usr/src/app ciprs-reader python ciprs-reader.py ignore/cypress-example.pdf
```Example output:
```json
[
{
"General": {
"County": "DURHAM",
"File No": "00GR000000"
},
"Case Information": {
"Case Status": "DISPOSED",
"Offense Date": "2018-01-01T20:00:00"
},
"Defendant": {
"Date of Birth/Estimated Age": "1990-01-01",
"Name": "DOE,JON,BOJACK",
"Race": "WHITE",
"Sex": "MALE"
},
"District Court Offense Information": [
{
"Records": [
{
"Action": "CHARGED",
"Description": "SPEEDING(70 mph in a 50 mph zone)",
"Severity": "TRAFFIC",
"Law": "20-141(J1)"
}
],
"Disposed On": "2010-01-01",
"Disposition Method": "DISMISSAL WITHOUT LEAVE BY DA"
}
],
"Superior Court Offense Information": [],
}
]
```## Local Setup
Pre-requisites:
Mac
```
brew cask install pdftotext
```Ubuntu
```
sudo apt-get install -y poppler-utils
``````
wget --no-check-certificate https://dl.xpdfreader.com/xpdf-tools-linux-4.04.tar.gz \
&& tar -xvf xpdf-tools-linux-4.04.tar.gz \
&& cp xpdf-tools-linux-4.04/bin64/pdftotext /usr/local/bin/pdftotext-4
```Setup:
```bash
pip install -r requirements.txt
pip install -e .
```Read CIPRS PDF:
```bash
python ciprs_reader.py ./cypress-example.pdf
```Run Jupyter:
```bash
jupyter-lab
```Run tests:
```bash
pytest --pylint
```Code for Durham