Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/takluyver/spotting-stat-claims
Fullfact hackathon - identifying statistical claims
https://github.com/takluyver/spotting-stat-claims
Last synced: 14 days ago
JSON representation
Fullfact hackathon - identifying statistical claims
- Host: GitHub
- URL: https://github.com/takluyver/spotting-stat-claims
- Owner: takluyver
- License: mit
- Created: 2017-05-06T14:13:30.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2017-05-06T16:37:56.000Z (over 7 years ago)
- Last Synced: 2024-12-21T11:41:33.036Z (23 days ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 25.4 KB
- Stars: 1
- Watchers: 5
- Forks: 2
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Spotting statistical claims
Full Fact Hackathon at PyData London 2017
We are scanning text to identify statistical claims. We look for numbers,
monetary amounts, percentages, and terms such as 'increase' or 'decrease'. We
try to avoid false positives, such as "Article 37".This works with XML files [from this Hansard data set](https://fullfact.org/media/hansard.zip).
To run the demo:
./statclaims.py --html hansard/src/debates2017-01-23d.xml
Then open `index.html` to see the output. You can pass multiple files at once.
Leaving out `--html` will show output in the terminal.Notes [in a Google Doc](https://docs.google.com/document/d/1Bc3wGsILL6EaWMXLJPDLIwMDk3R-k4o-AF-7ILgCEDQ/edit?usp=sharing).