https://github.com/onetail/crawler-with-kafka-docker
homework to crawler and anaylsis
https://github.com/onetail/crawler-with-kafka-docker
analysis crawler kafka-docker
Last synced: 2 months ago
JSON representation
homework to crawler and anaylsis
- Host: GitHub
- URL: https://github.com/onetail/crawler-with-kafka-docker
- Owner: Onetail
- Created: 2018-04-10T07:14:14.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2018-04-12T07:38:15.000Z (about 7 years ago)
- Last Synced: 2025-01-24T10:47:04.136Z (4 months ago)
- Topics: analysis, crawler, kafka-docker
- Language: Python
- Homepage:
- Size: 5.77 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Crawler-with-kafka-docker
> can crawler yahoo news and analysis
* ```python main.py```
> 
> to run crawler and kafka producer and consumer data
* ``` python main.py consumer ```
> 
> see consumer get data
> This is use kafka-docker for message queue and python crawler get data
> doing analysis cosine similarity for top 5
> csv content
> 