{"id":26910170,"url":"https://github.com/ztf666/web-scraper","last_synced_at":"2025-10-29T01:35:06.314Z","repository":{"id":51616753,"uuid":"258070808","full_name":"ZTF666/web-scraper","owner":"ZTF666","description":"A small page scraper , NO DYNAMIC SCRAPING tho :tired_face: ","archived":false,"fork":false,"pushed_at":"2021-05-11T03:15:09.000Z","size":146,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2023-03-06T16:11:20.666Z","etag":null,"topics":["api-call","axios","axios-rest","axios-restful","cheerio","page","scraper","scraperjs"],"latest_commit_sha":null,"homepage":"","language":"JavaScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ZTF666.png","metadata":{"files":{"readme":"readme.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2020-04-23T02:13:40.000Z","updated_at":"2021-05-11T02:27:33.000Z","dependencies_parsed_at":"2022-08-22T11:51:19.994Z","dependency_job_id":null,"html_url":"https://github.com/ZTF666/web-scraper","commit_stats":null,"previous_names":[],"tags_count":null,"template":null,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ZTF666%2Fweb-scraper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ZTF666%2Fweb-scraper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ZTF666%2Fweb-scraper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ZTF666%2Fweb-scraper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ZTF666","download_url":"https://codeload.github.com/ZTF666/web-scraper/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246647404,"owners_count":20811301,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["api-call","axios","axios-rest","axios-restful","cheerio","page","scraper","scraperjs"],"created_at":"2025-04-01T13:31:34.794Z","updated_at":"2025-10-29T01:35:01.295Z","avatar_url":"https://github.com/ZTF666.png","language":"JavaScript","readme":"# 💩Scrapy💩\n\nA small page scraper , still a WiP .\nNo dynamic scraping ...\nThis script uses :\n\n\u003cdiv align=\"center\"\u003e\n\u003cimg src=\"https://avatars0.githubusercontent.com/u/7230330?s=200\u0026v=4\"  width='60' height='60' alt='Cheerio'\u003e\n\u003cimg src=\"https://external-content.duckduckgo.com/iu/?u=https%3A%2F%2Fupload.wikimedia.org%2Fwikipedia%2Fcommons%2Fthumb%2F9%2F99%2FUnofficial_JavaScript_logo_2.svg%2F1200px-Unofficial_JavaScript_logo_2.svg.png\u0026f=1\u0026nofb=1\"  width='60' height='60' alt=\"Javascript\"\u003e\n\u003ch6\u003e\u003cstrong\u003eA\u003c/strong\u003exios\u003c/h6\u003e\n\u003c/div\u003e\n\n## How to use\n\n- **Install and run**\n\n```javascript\nnpm install\n```\n\n```javascript\nnpm run scrapy\n```\n\n- **Change the website and add yours**\n\n```javascript\naxios.get(\"https://chouftv.ma/press\");\n```\n\n- **Change the elements by the ones you desire**\n\n```javascript\n$(\".description\").each((index, element) =\u003e {\n  const title = $(element).children().first().text();\n  const links = $(element).children(\"a\").attr(\"href\");\n});\n```\n\n![Screenshot](scr/res.png)\n\n```\nIt looks weird because i used it on a local news website.\n```\n\n- **Limitations**\n  \u003cdetails\u003e\n    \n    \u003cp\u003eThis is a shitty scrapper , i'm still learning.\u003c/p\u003e\n    \u003cp\u003eIt doesn't scrap unloaded links. \u003c/p\u003e\n\n  ![Screenshot](scr/lm.png)\n\n\u003cp\u003eIn the screenshot above , the button litteraly translates to : LOAD MORE\u003c/p\u003e\n\u003cp\u003eSince i suck at this, i can't make it load more so i can grab the links\u003c/p\u003e\n\u003cp\u003eSo it only grabs the latest news articles .\u003c/p\u003e\n\u003cp\u003eThat's a blessing and a curse , beacause if clicked , it will load EVERY ARTICLE WRITTEN \u003c/p\u003e\n\u003cp\u003esince the deployement of the website... \u003c/p\u003e\n\u003c/details\u003e\n\n## Contact\n\n```\nyou can contact me at ZTF666@protonmail.ch\n\n```\n\n\u003cdiv align=\"center\"\u003e\n\n\u003ctable\u003e\n  \u003ctr\u003e\n    \u003ctd align=\"center\"\u003e\u003ca href=\"https://ztfportfolio.web.app/\" target='_blank'\u003e\u003cimg src=\"https://avatars1.githubusercontent.com/u/32502988?v=4\" width=\"100px;\" alt=\"\"/\u003e\u003cbr /\u003e\u003csub\u003e\u003cb\u003eZTF666\u003c/b\u003e\u003c/sub\u003e\u003c/a\u003e\u003c/td\u003e\n  \u003c/tr\u003e\n\u003c/table\u003e\n\n\u003c/div\u003e\n\n## License\n\n\u003cdiv align=\"center\"\u003e\n\n**💩Scrapy💩** released under the [MIT](LICENSE) License.\n\u003cbr\u003e\u003cbr\u003e\n\u003cstrong\u003e\u003cp\u003eMade with 💘 by a 👨‍💻 on a 💻 | 2020 | ZTF666 - N.EA\u003c/p\u003e \u003c/strong\u003e\n\n\u003c/div\u003e\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fztf666%2Fweb-scraper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fztf666%2Fweb-scraper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fztf666%2Fweb-scraper/lists"}