{"id":18412071,"url":"https://github.com/elliotxx/readnewspaper","last_synced_at":"2025-04-12T23:44:55.783Z","repository":{"id":86319252,"uuid":"125957738","full_name":"elliotxx/readNewspaper","owner":"elliotxx","description":"自动获取电子版报纸，方便每天阅读","archived":false,"fork":false,"pushed_at":"2019-11-11T07:55:35.000Z","size":18,"stargazers_count":4,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-04-12T23:44:30.447Z","etag":null,"topics":["crawler","lxml","newspaper","pypdf2","python","requests"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/elliotxx.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2018-03-20T03:50:29.000Z","updated_at":"2024-08-20T14:29:58.000Z","dependencies_parsed_at":"2023-07-07T06:45:59.482Z","dependency_job_id":null,"html_url":"https://github.com/elliotxx/readNewspaper","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/elliotxx%2FreadNewspaper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/elliotxx%2FreadNewspaper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/elliotxx%2FreadNewspaper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/elliotxx%2FreadNewspaper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/elliotxx","download_url":"https://codeload.github.com/elliotxx/readNewspaper/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248647257,"owners_count":21139081,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["crawler","lxml","newspaper","pypdf2","python","requests"],"created_at":"2024-11-06T03:39:37.532Z","updated_at":"2025-04-12T23:44:55.761Z","avatar_url":"https://github.com/elliotxx.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"## readNewspaper\n自动获取电子版报纸，方便每天阅读\n\n## 使用方式\n```python\npython BandaoNewspaper.py\n```\n\n## 目前可以获取的报纸\n* 《半岛都市报》  \n报纸首页：http://bddsb.bandao.cn/\n\n## 特性\n* [x] 自动合并PDF\n* [x] 代理IP池\n* [ ] 运行脚本时自动获取代理IP池\n* [ ] 通过参数选择对应日期的报纸\n* [ ] 每天定时检查是否有最新报纸，如果有，生成pdf发邮件提醒\n\n## 依赖\n* PyPDF2\n* requests\n* lxml\n\n## 参考资料\n* 在windows下安装PyPdf2，将文件夹中的pdf文件合成为一个pdf文件  \nhttp://blog.csdn.net/andy_blogs/article/details/78041679\n\n* Python SMTP 发送带附件电子邮件  \nhttps://blog.csdn.net/zm2714/article/details/7993732\n\n* python+smtp发送邮件附件问题  \nhttps://segmentfault.com/q/1010000009102883\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Felliotxx%2Freadnewspaper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Felliotxx%2Freadnewspaper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Felliotxx%2Freadnewspaper/lists"}