{"id":15576225,"url":"https://github.com/ct83/pyscrapperserver","last_synced_at":"2026-06-01T01:31:31.494Z","repository":{"id":54914630,"uuid":"102807665","full_name":"CT83/PyScrapperServer","owner":"CT83","description":"This script is a Python Scrapper controlled via a Web Interface which uses Bottle; BeautifulSoup 4 is used for scrapping EBooks off Websites which host them for free.","archived":false,"fork":false,"pushed_at":"2017-12-17T07:16:34.000Z","size":218,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-12-01T03:19:30.567Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/CT83.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2017-09-08T02:27:57.000Z","updated_at":"2022-08-08T23:48:01.000Z","dependencies_parsed_at":"2022-08-14T06:30:55.935Z","dependency_job_id":null,"html_url":"https://github.com/CT83/PyScrapperServer","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/CT83/PyScrapperServer","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CT83%2FPyScrapperServer","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CT83%2FPyScrapperServer/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CT83%2FPyScrapperServer/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CT83%2FPyScrapperServer/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/CT83","download_url":"https://codeload.github.com/CT83/PyScrapperServer/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CT83%2FPyScrapperServer/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":33756575,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-05-31T02:00:06.040Z","response_time":95,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-02T18:43:58.116Z","updated_at":"2026-06-01T01:31:31.477Z","avatar_url":"https://github.com/CT83.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# PyScrapperServer\nThis script is a Python Scrapper  controlled vai a Web Interface which uses Bottle;  BeautifulSoup 4  is used for scrapping EBooks off Websites which host them for free.\n\n\u003cb\u003ePreface\u003c/b\u003e\nThis project was entirely created on a Friday when I called in sick from College. This project provided  me an  introduction to Web Servers, Web Frameworks, Scrapping in Python and the Bottle Framework. \n\n\u003cb\u003eIntroduction\u003c/b\u003e\nThe code is run on a Raspberry Pi connected to the local WiFi connection, preferably using  a Static IP. The user accesses the Web Interfaces hosted on the Pi and then pastes the link of the EBook that he wishes to download. The Ebook is then scrapped off the Link provided by the user. It is further converted to PDF format for easy reading.\n\u003cem\u003e\u003cb\u003etl;dr\u003c/b\u003e\u0026nbsp;  Web Server which Scrapes the Web.\u003c/em\u003e\n\n \u003cimg src=\"bottle_scrapper.PNG\"\u003e\n\n\u003cb\u003eHow to Install?\u003c/b\u003e\n1. Clone this Repo\n2. Install Dependencies  \u003cbr/\u003e`sudo pip install reportlab requests bs4 python-dev install bottle`\n3. `sudo nano /etc/rc.local`\n4. Add  `sudo python path_of_this_cloned_repo/BottleServer.py`  to the end of the file before  `exit 0` , to allow the server to run at boot.\n5. `sudo reboot`\n\nDone!\nNow visit the IP Address of the Server  example`192.168.1.10:8080`\n\n\n\u003cb\u003eDependencies\u003c/b\u003e\n1. [txt2pdf](https://github.com/baruchel/txt2pdf%22txt2pdf%22)\n2. [bottle](https://github.com/bottlepy/bottle%22bottle%22)\n3. [reportlab](https://github.com/Distrotech/reportlab)\n4. [Beautiful Soup](https://code.launchpad.net/beautifulsoup)\n5. [requests](https://github.com/requests/requests)\n\n\u003cb\u003eConclusion\u003c/b\u003e\nThis project successfully downloaded several Ebooks from Websites and so was a succcess.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fct83%2Fpyscrapperserver","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fct83%2Fpyscrapperserver","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fct83%2Fpyscrapperserver/lists"}