https://github.com/adamdad/douban_book_spider
beautiful soup based scraper for douban book
https://github.com/adamdad/douban_book_spider
Last synced: about 1 year ago
JSON representation
beautiful soup based scraper for douban book
- Host: GitHub
- URL: https://github.com/adamdad/douban_book_spider
- Owner: Adamdad
- Created: 2020-03-14T00:15:26.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2020-03-14T01:03:26.000Z (over 6 years ago)
- Last Synced: 2025-01-17T01:27:00.620Z (over 1 year ago)
- Language: Python
- Size: 5.94 MB
- Stars: 1
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# DouBan_book_spider
This is beautiful-soup based scraper for douban book
We mainly scrap the book list with score>9.0.
# Environment
matplotlib
pandas
numpy
wordcloud
bs4
# How to use
1. Scrap douban list and save to a csv(you can add your needed
Field into the code)
python Douban_annul_book_spider.py
2. Cound word and tranferm it into word cloud
python analysis.py
