Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/yachuuuuu/crawler
https://github.com/yachuuuuu/crawler
crawling-python file-format pandas
Last synced: about 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/yachuuuuu/crawler
- Owner: YaChuuuuu
- Created: 2022-07-01T02:25:51.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-07-12T13:41:44.000Z (6 months ago)
- Last Synced: 2024-07-12T15:42:18.724Z (6 months ago)
- Topics: crawling-python, file-format, pandas
- Language: Jupyter Notebook
- Homepage:
- Size: 6.45 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# 課堂作業:爬蟲實作
AI跨領域數據科學人才培訓班之課堂作業**應用技能**
1. 爬蟲技巧 (requests、BeautifulSoup模組)
2. 資料處理 (Panpas模組)
3. 檔案儲存 (csv、openpyxl、sqlite3、pymysql、gspread模組)
---
## 檔案說明
### 20220513_184莊雅竹_台灣彩券作業.ipynb
**操作流程**將網頁中的關鍵資料爬取並顯示
### 20220519_184_莊雅竹_電商平台資料分析實作.ipynb
**操作流程**
1. 分析需求範圍
2. 分析網頁原始碼標籤
3. 進行統計計算
4. 儲存檔案至各個資源