https://github.com/rmlzy/itpanda_spider
一个IT类电子书网站的爬虫程序
https://github.com/rmlzy/itpanda_spider
ebook nodejs pdf spider
Last synced: about 2 months ago
JSON representation
一个IT类电子书网站的爬虫程序
- Host: GitHub
- URL: https://github.com/rmlzy/itpanda_spider
- Owner: rmlzy
- Created: 2020-04-20T09:47:14.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2021-05-11T11:22:05.000Z (about 4 years ago)
- Last Synced: 2025-03-21T21:51:11.464Z (2 months ago)
- Topics: ebook, nodejs, pdf, spider
- Language: JavaScript
- Homepage:
- Size: 6.13 MB
- Stars: 5
- Watchers: 1
- Forks: 4
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
一个[IT类电子书网站](https://www.itpanda.net)的爬虫程序, 此网站大约有400多本高清的电子书, 包括 PDF、epub、mobi、azw3等格式.
抓取后的结果参见: [output.json](./output.json)
## 如何使用
```shell script
# 下载代码
git clone [email protected]:rmlzy/itpanda_spider.git
cd itpanda_spider# 安装依赖
npm install# 开始抓取程序, 会在 itpanda_spider 目录下生成 output.json 文件
npm run start
```推荐一个 Mac 平台的 epub 阅读器: [Clearview](./docs/Clearview+for+Mac+2.3.2.dmg), 解压密码: `www.ifunmac.com`
一份下载好的前端方向的 zip: 链接:https://pan.baidu.com/s/1K2qbnDlvsCwIsYDWzNXXwA 密码:sfbs