https://github.com/robinboers/html-sanitize-ex

A fast and straightforward HTML Sanitizer written in Elixir which lets you include HTML authored by third-parties in your web application while protecting against XSS. A fork maintained by me with some minor adjustments.
https://github.com/robinboers/html-sanitize-ex

Last synced: 8 months ago
JSON representation

Host: GitHub
URL: https://github.com/robinboers/html-sanitize-ex
Owner: RobinBoers
License: mit
Created: 2021-11-25T16:03:54.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2021-12-06T15:31:38.000Z (over 4 years ago)
Last Synced: 2024-12-28T17:32:54.873Z (over 1 year ago)
Language: Elixir
Homepage:
Size: 516 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE

Awesome Lists containing this project

README

          # HtmlSanitizeEx [![Build Status](https://travis-ci.org/rrrene/html_sanitize_ex.svg)](https://travis-ci.org/rrrene/html_sanitize_ex) [![Inline docs](http://inch-ci.org/github/rrrene/html_sanitize_ex.svg?branch=master)](http://inch-ci.org/github/rrrene/html_sanitize_ex)

`html_sanitize_ex` provides a fast and straightforward HTML Sanitizer written in Elixir which lets you include HTML authored by third-parties in your web application while protecting against XSS.

This is a fork that adds the `no_image` mode, which allows basic HTML, but strips all the images. I used this in Nindo, because images would be a big performance hit when loading pages and causes problems when the image paths were relative.

It is the first Hex package to come out of the [elixirstatus.com](http://elixirstatus.com) project, where it will be used to sanitize user announcements from the Elixir community.

## What can it do?

`html_sanitize_ex` parses a given HTML string and, based on the used [Scrubber](https://github.com/rrrene/html_sanitize_ex/tree/master/lib/html_sanitize_ex/scrubber), either completely strips it from HTML tags or sanitizes it by only allowing certain HTML elements and attributes to be present.

**NOTE:** The one thing missing at this moment is ***support for styles***. To add this, we have to implement a Scrubber for CSS, to prevent nasty CSS hacks using `` tags and attributes.

Otherwise `html_sanitize_ex` is a full-featured HTML sanitizer.

## Installation

Add `html_sanitize_ex` as a dependency in your `mix.exs` file.

```elixir

defp deps do

  [{:html_sanitize_ex, "~> 1.4"}]

end

```

After adding you are done, run `mix deps.get` in your shell to fetch the new dependency.

The only dependency of `html_sanitize_ex` is `mochiweb` which is used to parse HTML.

## Usage

It can strip all tags from the given string:

```elixir

text = "<a href=\"javascript:alert('XSS');\">text here</a>"

HtmlSanitizeEx.strip_tags(text)

# => "text here"

```

Or allow certain basic HTML elements to remain:

```elixir

text = "<h1>Hello <script>World!</script></h1>"

HtmlSanitizeEx.basic_html(text)

# => "<h1>Hello World!</h1>"

```

The following scrubbing options exist:

```elixir

HtmlSanitizeEx.noscrub(html)

HtmlSanitizeEx.basic_html(html)

HtmlSanitizeEx.no_images(html)

HtmlSanitizeEx.html5(html)

HtmlSanitizeEx.markdown_html(html)

HtmlSanitizeEx.strip_tags(html)

```

**TODO: write more comprehensive usage description**

## Contributing

1. [Fork it!](http://github.com/rrrene/html_sanitize_ex/fork)

2. Create your feature branch (`git checkout -b my-new-feature`)

3. Commit your changes (`git commit -am 'Add some feature'`)

4. Push to the branch (`git push origin my-new-feature`)

5. Create new Pull Request

## Author

René Föhring (@rrrene)

## License

html_sanitize_ex is released under the MIT License. See the LICENSE file for further

details.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/robinboers/html-sanitize-ex

Awesome Lists containing this project

README