{"id":23654278,"url":"https://github.com/deardurham/ciprs-reader","last_synced_at":"2025-09-01T05:31:27.603Z","repository":{"id":34963488,"uuid":"179778073","full_name":"deardurham/ciprs-reader","owner":"deardurham","description":"Python library for reading CIPRS PDFs","archived":false,"fork":false,"pushed_at":"2023-10-25T02:18:41.000Z","size":1691,"stargazers_count":2,"open_issues_count":0,"forks_count":8,"subscribers_count":8,"default_branch":"main","last_synced_at":"2025-04-02T10:05:05.681Z","etag":null,"topics":["codeforamerica","coverage","docker","pdf","pdftotext","pytest","python"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"bsd-3-clause","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/deardurham.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2019-04-06T02:01:09.000Z","updated_at":"2022-01-19T23:01:47.000Z","dependencies_parsed_at":"2022-08-09T01:15:13.153Z","dependency_job_id":"525fa017-e2da-46ff-87cd-89a1c58811ed","html_url":"https://github.com/deardurham/ciprs-reader","commit_stats":null,"previous_names":[],"tags_count":3,"template":false,"template_full_name":null,"purl":"pkg:github/deardurham/ciprs-reader","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deardurham%2Fciprs-reader","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deardurham%2Fciprs-reader/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deardurham%2Fciprs-reader/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deardurham%2Fciprs-reader/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/deardurham","download_url":"https://codeload.github.com/deardurham/ciprs-reader/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deardurham%2Fciprs-reader/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":273077227,"owners_count":25041358,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-09-01T02:00:09.058Z","response_time":120,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["codeforamerica","coverage","docker","pdf","pdftotext","pytest","python"],"created_at":"2024-12-28T18:58:33.542Z","updated_at":"2025-09-01T05:31:27.241Z","avatar_url":"https://github.com/deardurham.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# CIPRS Reader\n[![Build Status](https://travis-ci.org/deardurham/ciprs-reader.svg?branch=master)](https://travis-ci.org/deardurham/ciprs-reader)\n\n## Setup and Run:\n\nAdd pdf file to parse in /ignore folder then run:\n\n```bash\ndocker build -t ciprs-reader .\ndocker run --rm -v /$(pwd):/usr/src/app ciprs-reader python ciprs-reader.py ignore/cypress-example.pdf\n```\n\nExample output:\n\n```json\n[\n    {\n        \"General\": {\n            \"County\": \"DURHAM\",\n            \"File No\": \"00GR000000\"\n        },\n        \"Case Information\": {\n            \"Case Status\": \"DISPOSED\",\n            \"Offense Date\": \"2018-01-01T20:00:00\"\n        },\n        \"Defendant\": {\n            \"Date of Birth/Estimated Age\": \"1990-01-01\",\n            \"Name\": \"DOE,JON,BOJACK\",\n            \"Race\": \"WHITE\",\n            \"Sex\": \"MALE\"\n        },\n        \"District Court Offense Information\": [\n            {\n                \"Records\": [\n                    {\n                        \"Action\": \"CHARGED\",\n                        \"Description\": \"SPEEDING(70 mph in a 50 mph zone)\",\n                        \"Severity\": \"TRAFFIC\",\n                        \"Law\": \"20-141(J1)\"\n                    }\n                ],\n                \"Disposed On\": \"2010-01-01\",\n                \"Disposition Method\": \"DISMISSAL WITHOUT LEAVE BY DA\"\n            }\n        ],\n        \"Superior Court Offense Information\": [],\n    }\n]\n```\n\n## Local Setup\n\nPre-requisites:\n\nMac\n```\nbrew cask install pdftotext\n```\n\nUbuntu\n```\nsudo apt-get install -y poppler-utils\n```\n\n```\nwget --no-check-certificate https://dl.xpdfreader.com/xpdf-tools-linux-4.04.tar.gz \\\n    \u0026\u0026 tar -xvf xpdf-tools-linux-4.04.tar.gz \\\n    \u0026\u0026 cp xpdf-tools-linux-4.04/bin64/pdftotext /usr/local/bin/pdftotext-4\n```\n\nSetup:\n\n```bash\npip install -r requirements.txt\npip install -e .\n```\n\nRead CIPRS PDF:\n\n```bash\npython ciprs_reader.py ./cypress-example.pdf\n```\n\nRun Jupyter:\n\n```bash\njupyter-lab\n```\n\nRun tests:\n\n```bash\npytest --pylint\n```\n\nCode for Durham\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdeardurham%2Fciprs-reader","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdeardurham%2Fciprs-reader","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdeardurham%2Fciprs-reader/lists"}