Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dickmao/cl-housing-cars
Craigslist Despamming
https://github.com/dickmao/cl-housing-cars
nlp-parsing spam-filtering
Last synced: 9 days ago
JSON representation
Craigslist Despamming
- Host: GitHub
- URL: https://github.com/dickmao/cl-housing-cars
- Owner: dickmao
- Created: 2017-07-20T00:13:48.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2018-08-20T14:49:48.000Z (over 6 years ago)
- Last Synced: 2024-11-06T08:12:51.072Z (about 2 months ago)
- Topics: nlp-parsing, spam-filtering
- Language: Python
- Homepage:
- Size: 4 MB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# scrapy
Send daily email of filtered listings for broker-spammed categories, i.e., housing and cars.
Top-level invocations in `crontab`.
Python Scrapy crawling software in `tutorial/spiders`, followed by text processing logic in `dedupe.ipynb`.