https://github.com/rto/phased-array
Generates a privacy-focussed list of tracker domains that have been identified by DuckDuckGo's Tracker Radar for use in ad blocker solutions like pi-hole.
https://github.com/rto/phased-array
blocklist domains pi-hole pi-hole-blocklists privacy trackers tracking
Last synced: 7 months ago
JSON representation
Generates a privacy-focussed list of tracker domains that have been identified by DuckDuckGo's Tracker Radar for use in ad blocker solutions like pi-hole.
- Host: GitHub
- URL: https://github.com/rto/phased-array
- Owner: rto
- License: bsd-3-clause
- Created: 2020-03-10T09:30:38.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2020-03-12T11:25:20.000Z (over 5 years ago)
- Last Synced: 2024-10-28T02:17:56.125Z (12 months ago)
- Topics: blocklist, domains, pi-hole, pi-hole-blocklists, privacy, trackers, tracking
- Language: Python
- Homepage:
- Size: 54.7 KB
- Stars: 11
- Watchers: 2
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Phased Array
## What?
A script that generates a privacy-focussed list of tracker domains that have been identified by [DuckDuckGo's Tracker Radar](https://spreadprivacy.com/duckduckgo-tracker-radar/) for use in ad blocker solutions like pi-hole.
## Requirements
This script requires Python >= 3.6.
## How?
```bash
git clone --recurse-submodules https://github.com/rto/phased-array.git phased-array
cd phased-array
python generate_lists.py
```You can customise the input directory, output file pathname, and the line prefix via the command-line.
You will likely also want to pick which [categories](https://github.com/duckduckgo/tracker-radar/blob/master/docs/CATEGORIES.md) you want to exclude from the list (see Limitations / Warnings below).
Setting a `--line-prefix` can be used to generate a `hosts` formatted list.
```bash
python generate_lists.py \
--input-directory my-tracker/domains \
--output-pathname /path/to/my-output.txt \
--destination-address '127.0.0.1' \
--exclude-uncategorized \
--exclude-category CDN \
--exclude-category 'Embedded Content' \
--exclude-category 'Federated Login'
```See `--help` for a full list of configuration options.
## Limitations / Warnings
_Striking a balance between privacy and usability is tough!_
Blocking by domain name can be a particularly blunt tool. By default the Tracker Radar includes domains for many popular websites and apps that you may wish to use on a daily basis. If you do not set any _exclude categories_ then your output may result in 'undesirable behaviour', i.e. your favourite website/app may stop working.
**For example, github.com, google.com, paypal.com, etc would all be blocked if we included every single domain.**
By default we have chosen to exclude any domain that matches one or more of the following categories: CDN, Embedded Content, Federated Login, Non-tracking, Online Payment, SSO.
Depending on your personal preference or concerns you may wish to filter on different [categories](https://github.com/duckduckgo/tracker-radar/blob/master/docs/CATEGORIES.md).
## Future improvements
- Generate different types of output (domains, hosts, regex)
- Improve the way that we filter domains in or out of the listAny help on these gratefully received! :-)
## Source data
This project makes use of the Tracker Radar data from DuckDuckGo is [licensed](https://raw.githubusercontent.com/duckduckgo/tracker-radar/master/LICENSE) under Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).