https://github.com/henryhaohao/xiecheng_comment
:rainbow:Xiecheng_Comment多线程Threading爬取携程的丽江古城景点评论并生成词云
https://github.com/henryhaohao/xiecheng_comment
comment mongodb python threading wordcloud xiecheng
Last synced: 3 months ago
JSON representation
:rainbow:Xiecheng_Comment多线程Threading爬取携程的丽江古城景点评论并生成词云
- Host: GitHub
- URL: https://github.com/henryhaohao/xiecheng_comment
- Owner: Henryhaohao
- Created: 2018-10-05T08:12:10.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2018-10-20T02:40:32.000Z (almost 7 years ago)
- Last Synced: 2025-04-09T00:30:04.326Z (6 months ago)
- Topics: comment, mongodb, python, threading, wordcloud, xiecheng
- Language: Python
- Homepage: http://you.ctrip.com/
- Size: 4.9 MB
- Stars: 25
- Watchers: 2
- Forks: 9
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
多线程爬取携程的丽江古城景点评论并生成词云 
===========================
       
### 携程官网 - http://you.ctrip.com/
|Author|:sunglasses:Henryhaohao:sunglasses:|
|---|---
|Email|:hearts:1073064953@qq.com:hearts:
****
## :dolphin:声明
### 软件均仅用于学习交流,请勿用于任何商业用途!感谢大家!
## :dolphin:介绍
### 该项目为多线程Threading爬取携程的丽江古城景点评论并生成词云
- 项目介绍:通过爬取[携程网](http://you.ctrip.com/)中关于[丽江古城](http://you.ctrip.com/sight/lijiang32/3056.html#jieshao)景点的点评(共计3010条),并对这些评论进行中文分词(Jieba库)及预处理,最终生成一张关于“丽江古城”的词云图(Wordcloud词云库)
- 爬虫文件:Spiders目录下的xiecheng_comment.py
- 生成词云图文件:运行Spiders目录下的comment_wordcloud.py
- 数据库配置文件:运行前修改Spiders目录下的config.py,其中为mongodb的相关配置
## :dolphin:运行环境
Version: Python3
## :dolphin:安装依赖库
```
pip3 install -r requirements.txt
```
## :dolphin:运行截图
> - **词云图**

> - **携程景点评论**

> - **爬取评论(3010条耗时1秒)**

> - **评论内容**

> - **评论数**

## :dolphin:**总结**
> **最后,如果你觉得这个项目不错或者对你有帮助,给个Star呗,也算是对我学习路上的一种鼓励!
哈哈哈,感谢大家!笔芯~**:cupid::cupid: