https://github.com/zfb132/python_practice
自己写的一些脚本,主要是网络数据采集
https://github.com/zfb132/python_practice
Last synced: about 1 month ago
JSON representation
自己写的一些脚本,主要是网络数据采集
- Host: GitHub
- URL: https://github.com/zfb132/python_practice
- Owner: zfb132
- License: gpl-3.0
- Created: 2018-08-30T14:27:05.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2018-08-31T02:04:10.000Z (almost 8 years ago)
- Last Synced: 2025-01-15T07:57:27.363Z (over 1 year ago)
- Language: Python
- Size: 31.3 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Python_Practice
## [parseMD](https://github.com/zfb132/Python_Practice/blob/master/parseMD/parseMD.py "点击查看代码")
把`Jekyll`应用发的`markdown`格式的博客进行分词:
* 使用方法为在控制台窗口输入 `python parseMD.py demo_article.md`
* 程序首先将格式控制字符去除,再调用`jieba`分词
## [demo_baike](https://github.com/zfb132/Python_Practice/blob/master/demo_baike.py "点击查看代码")
根据输入的关键词,搜索到对应的百科页面,下载`HTML`源码和图片
## [download_pdf_tencent](https://github.com/zfb132/Python_Practice/blob/master/download_pdf_tencent.py "点击查看代码")
下载腾讯云的所有文档,按照网页对应结构建立目录
## [download_student_img](https://github.com/zfb132/Python_Practice/blob/master/download_student_img.py "点击查看代码")
下载入学时拍的照片
## [download_txt_charle](https://github.com/zfb132/Python_Practice/blob/master/download_txt_charle.py "点击查看代码")
在小说网站下载一部书籍