An open API service indexing awesome lists of open source software.

https://github.com/bradmontgomery/django-janitor

django-janitor allows you to use bleach to clean HTML stored in a Model's field.
https://github.com/bradmontgomery/django-janitor

django html html-parsing python whitelist

Last synced: 6 months ago
JSON representation

django-janitor allows you to use bleach to clean HTML stored in a Model's field.

Awesome Lists containing this project

README

          

Overview
========

django-janitor allows you to use [bleach](https://bleach.readthedocs.io/)
to clean HTML stored in arbitrary Models.

This is useful when you've got HTML content stored in a 3rd-party app, but
you'd like to use a whitelist for allowed HTML tags.

## Features

* django-janitor listens for a Models' `pre_save` signal, and runs it's content
through bleach.
* Allowed HTML tags, attributes, and css are specified in a whitelist
* Whitelists are managed in Django's Admin
* There are (sensible) default whitelists included

## Requirements

* Requires Bleach
* Works with Django 1.11

**NOTE on upgrading to version 0.5.0**: This version officially supports
Python 3 and Django 1.11. Prior versions of this app included south migrations
and only supported something like Django 1.4. If you're previously used this,
you'll have to figure out how to upgrade your existing models (sorry).

## Installation

To install the most recent release:

pip install django-janitor

Or to install from the current repository:

pip install git+https://github.com/bradmontgomery/django-janitor

Then, add `janitor` to your installed apps, and run the migrations.

python manage.py migrate

## Usage

Visit the Janitor app in Django's Admin, and create a new *Field sanitizer*.
Then select the Model and specify the fieldname which should be cleaned. After
you set up the whitelists for Tags, Attributes, etc, save the Field sanitizer.

From now on, when the Model is saved, it's content in will be cleaned using `bleach`.

![screensho](https://raw.githubusercontent.com/bradmontgomery/django-janitor/master/screenshot.png)

## Tests

There are a few tests in `janitor/tests`. You can run these with:

python manage.py tests janitor

## Management Commands

There are a few managment commands avaialable to make it easier to use
django-janitor. The first is `clean_all` which will look at all of the models
that have a related Field Sanitizer, calling the models' `save` method to
trigger the `pre_save` signal (which forces the fields
to be cleaned):

python manage.py clean_all

This is useful if you've created a `FieldSanitizer` for a model with
existing content.

The second management command is `clean_model`, which works in a similar
fashion, but allows you to specify an app and a model::

python manage.py clean_model myapp.MyModel

Finally, `list_html_elements` and `list_html_elements_for_model` exist to
help you discover what HTML tags are being used in existing content. While
these commands do require that a `FieldSanitizer` be configured for existing
Models, they may be used to help you decide which tags to include in a
whitelist.

You should run these commands before using `clean_all` or `clean_model` to
see what sort of data exists before it's cleaned:

python manage.py list_html_elements

Or:

python manage.py list_html_elements_for_model myapp.MyModel