Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/neysofu/qetesh
Web scraper to train profanity detectors. NSFW!
https://github.com/neysofu/qetesh
bot nsfw python scrapy web-scraping
Last synced: 26 days ago
JSON representation
Web scraper to train profanity detectors. NSFW!
- Host: GitHub
- URL: https://github.com/neysofu/qetesh
- Owner: neysofu
- License: mit
- Created: 2016-06-11T00:26:31.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2019-01-03T10:16:31.000Z (about 6 years ago)
- Last Synced: 2024-12-14T01:03:54.980Z (about 1 month ago)
- Topics: bot, nsfw, python, scrapy, web-scraping
- Language: Python
- Homepage:
- Size: 16.6 KB
- Stars: 9
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# Qetesh ![Build status](https://travis-ci.org/neysofu/qetesh.svg?branch=master) ![NSFW](https://img.shields.io/badge/warning-NSFW-red.svg)
Qetesh is a tiny Python project built on the top of [Scrapy](http://scrapy.org); when executed, the script will start scraping the website youporn.com for users' comments. I haven't explored the possibilities just yet, but it seems like valuable data for profanity detectors and such.