Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/bluedragonz/bad-bot-blocker
Block bad, possibly even malicious web crawlers (automated bots) using htaccess.
https://github.com/bluedragonz/bad-bot-blocker
Last synced: 30 days ago
JSON representation
Block bad, possibly even malicious web crawlers (automated bots) using htaccess.
- Host: GitHub
- URL: https://github.com/bluedragonz/bad-bot-blocker
- Owner: bluedragonz
- Created: 2013-01-18T20:59:31.000Z (almost 12 years ago)
- Default Branch: master
- Last Pushed: 2015-07-31T16:44:47.000Z (over 9 years ago)
- Last Synced: 2024-08-04T01:26:26.012Z (4 months ago)
- Size: 186 KB
- Stars: 356
- Watchers: 53
- Forks: 91
- Open Issues: 8
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Bad Bot Blocker
===============223 Apache htaccess rules to block bad bots.
Bad bots are defined as:
- E-mail harvesters
- Content scrapers
- Spam bots
- Vulnerability scanners
- Aggressive bots that provide little value
- Bots linked to viruses or malware
- Government surveillance bots
- Russian search engine Yandex
- Chinese search engine BaiduYandex/Baidu
------------Unless your website is written in Russian or Chinese, you probably don't
get any traffic from them. They mostly just waste bandwidth and consume resources.Bots Are Liars
--------------Bots try to make themselves look like other software by disguising their
useragent. Their useragents may look harmless, perfectly legitimate even.
For example, "^Java" but according to
[Project Honeypot](https://www.projecthoneypot.org/harvester_useragents.php),
it's actually one of the most dangerous.Setup
-----If you have a bizarre or complicated setup, be sure to look everything
over before using it. But for anyone with something that resembles
a standard Apache installation, this should work without any issues.