An open API service indexing awesome lists of open source software.

https://github.com/henryhaohao/xiecheng_comment

:rainbow:Xiecheng_Comment多线程Threading爬取携程的丽江古城景点评论并生成词云
https://github.com/henryhaohao/xiecheng_comment

comment mongodb python threading wordcloud xiecheng

Last synced: 3 months ago
JSON representation

:rainbow:Xiecheng_Comment多线程Threading爬取携程的丽江古城景点评论并生成词云

Awesome Lists containing this project

README

          

多线程爬取携程的丽江古城景点评论并生成词云 ![enter image description here](Pic/logo.png)
===========================
![](https://img.shields.io/badge/Python-3.6.3-green.svg) ![](https://img.shields.io/badge/requests-2.18.4-green.svg) ![](https://img.shields.io/badge/pymongo-3.6.1-green.svg) ![](https://img.shields.io/badge/wordcloud-1.4.1-green.svg) ![](https://img.shields.io/badge/numpy-1.15.0-green.svg) ![](https://img.shields.io/badge/jieba-0.39-green.svg) ![](https://img.shields.io/badge/matplotlib-2.2.2-green.svg) ![](https://img.shields.io/badge/Pillow-5.3.0-green.svg)
### 携程官网 - http://you.ctrip.com/
|Author|:sunglasses:Henryhaohao:sunglasses:|
|---|---
|Email|:hearts:1073064953@qq.com:hearts:


****
## :dolphin:声明
### 软件均仅用于学习交流,请勿用于任何商业用途!感谢大家!
## :dolphin:介绍
### 该项目为多线程Threading爬取携程的丽江古城景点评论并生成词云
- 项目介绍:通过爬取[携程网](http://you.ctrip.com/)中关于[丽江古城](http://you.ctrip.com/sight/lijiang32/3056.html#jieshao)景点的点评(共计3010条),并对这些评论进行中文分词(Jieba库)及预处理,最终生成一张关于“丽江古城”的词云图(Wordcloud词云库)
- 爬虫文件:Spiders目录下的xiecheng_comment.py
- 生成词云图文件:运行Spiders目录下的comment_wordcloud.py
- 数据库配置文件:运行前修改Spiders目录下的config.py,其中为mongodb的相关配置
## :dolphin:运行环境
Version: Python3
## :dolphin:安装依赖库
```
pip3 install -r requirements.txt
```
## :dolphin:运行截图
> - **词云图**


![enter image description here](Spiders/wordcloud.jpg)
> - **携程景点评论**


![enter image description here](Pic/comment.png)
> - **爬取评论(3010条耗时1秒)**


![enter image description here](Pic/run.gif)
> - **评论内容**


![enter image description here](Pic/data.png)
> - **评论数**


![enter image description here](Pic/count.png)
## :dolphin:**总结**
> **最后,如果你觉得这个项目不错或者对你有帮助,给个Star呗,也算是对我学习路上的一种鼓励!

哈哈哈,感谢大家!笔芯~**:cupid::cupid: