An open API service indexing awesome lists of open source software.

https://github.com/smartdataanalytics/polisis_benchmark

Reproducing state-of-the-art results
https://github.com/smartdataanalytics/polisis_benchmark

Last synced: 5 months ago
JSON representation

Reproducing state-of-the-art results

Awesome Lists containing this project

README

        

# Polisis_Benchmark
Reproducing state-of-the-art results

This repo is our effort to reproduce Polisis results for privacy policy classification based on their paper: https://arxiv.org/abs/1802.02561

# Setup instructions
1. Setup a virtual environment using any tool (e.g., conda) and activate it: conda -n privacy_policy python=3.6 source activate privacy_policy
2. Install dependecies from the requirement file: pip install -r requirement.txt
3. install NLTK tokenizer: python -m nltk.downloader punkt

# Usage instructions
To run the experiment: python -u cnn_multi_label_classifier.py

Parameters can be found in args.py

Important Note: By default the code will use GloVe embeddings. Due to licesing the in-domain embeddings can be provided only upon request.