https://github.com/adamdad/douban_book_spider

beautiful soup based scraper for douban book
https://github.com/adamdad/douban_book_spider

Last synced: about 1 year ago
JSON representation

beautiful soup based scraper for douban book

Host: GitHub
URL: https://github.com/adamdad/douban_book_spider
Owner: Adamdad
Created: 2020-03-14T00:15:26.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2020-03-14T01:03:26.000Z (over 6 years ago)
Last Synced: 2025-01-17T01:27:00.620Z (over 1 year ago)
Language: Python
Size: 5.94 MB
Stars: 1
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# DouBan_book_spider
This is beautiful-soup based scraper for douban book
We mainly scrap the book list with score>9.0.

# Environment
matplotlib
pandas
numpy
wordcloud
bs4

# How to use
1. Scrap douban list and save to a csv(you can add your needed
Field into the code)

python Douban_annul_book_spider.py

2. Cound word and tranferm it into word cloud

python analysis.py

![wordcloud](wordcloud.png)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/adamdad/douban_book_spider

Awesome Lists containing this project

README