{"id":22303895,"url":"https://github.com/prankshaw/beware-web-scraper","last_synced_at":"2026-03-01T22:37:39.576Z","repository":{"id":40963978,"uuid":"144403969","full_name":"prankshaw/Beware-web-scraper","owner":"prankshaw","description":"Web Scraping project including; C projects scraper from GitHub , ICC rankings scraper, YouTube Trending Scrapper, LinkedIn Profile Scraper, Wikipedia Image Scraper","archived":false,"fork":false,"pushed_at":"2022-06-22T01:47:28.000Z","size":135,"stargazers_count":14,"open_issues_count":2,"forks_count":3,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-10-25T15:14:05.796Z","etag":null,"topics":["batting","c","chrome-webdriver","chromedriver","cricket","github","icc","icc-rankings-scraper","pandas","python","python-3","rankings","scraper","selenium","selenium-webdriver","web-scraping","wikipedia-image-scraper"],"latest_commit_sha":null,"homepage":"https://prankshaw.github.io/Beware-web-scraper/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/prankshaw.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-08-11T17:09:48.000Z","updated_at":"2024-07-02T16:54:09.000Z","dependencies_parsed_at":"2022-09-01T23:11:36.944Z","dependency_job_id":null,"html_url":"https://github.com/prankshaw/Beware-web-scraper","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/prankshaw/Beware-web-scraper","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/prankshaw%2FBeware-web-scraper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/prankshaw%2FBeware-web-scraper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/prankshaw%2FBeware-web-scraper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/prankshaw%2FBeware-web-scraper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/prankshaw","download_url":"https://codeload.github.com/prankshaw/Beware-web-scraper/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/prankshaw%2FBeware-web-scraper/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29987346,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-03-01T21:06:37.093Z","status":"ssl_error","status_checked_at":"2026-03-01T21:05:45.052Z","response_time":124,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["batting","c","chrome-webdriver","chromedriver","cricket","github","icc","icc-rankings-scraper","pandas","python","python-3","rankings","scraper","selenium","selenium-webdriver","web-scraping","wikipedia-image-scraper"],"created_at":"2024-12-03T18:48:50.649Z","updated_at":"2026-03-01T22:37:39.558Z","avatar_url":"https://github.com/prankshaw.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"## Visit The project here  \u003ca href=\"../../issues\"\u003e\u003cimg alt=\"Contributions Welcome\" src=\"https://img.shields.io/badge/contributions-welcome-blue??style=flat\"\u003e\u003c/a\u003e\nhttps://prankshaw.github.io/Beware-web-scraper/\n\n[![Build Status](https://travis-ci.com/prankshaw/Beware-web-scraper.svg?branch=master)](https://travis-ci.com/prankshaw/Beware-web-scraper)\n[![Documentation Status](https://readthedocs.org/projects/beware-web-scraper/badge/?version=latest)](https://beware-web-scraper.readthedocs.io/en/latest/?badge=latest)\n[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/ambv/black)\n[![codecov](https://codecov.io/gh/prankshaw/Beware-web-scraper/branch/master/graph/badge.svg)](https://codecov.io/gh/prankshaw/Beware-web-scraper)\n[![License: MIT](https://img.shields.io/badge/License-MIT-orange.svg)](https://opensource.org/licenses/MIT)\n\u003ca href=\"../../issues\"\u003e\u003cimg alt=\"Issues Open\" src=\"https://img.shields.io/github/issues/prankshaw/Beware-web-scraper?color=pink\"\u003e\u003c/a\u003e\n\u003ca href=\"../../issues\"\u003e\u003cimg alt=\"Forks\" src=\"https://img.shields.io/github/forks/prankshaw/Beware-web-scraper?color=purple\"\u003e\u003c/a\u003e\n\u003ca href=\"../../issues\"\u003e\u003cimg alt=\"Stars\" src=\"https://img.shields.io/github/stars/prankshaw/Beware-web-scraper?color=yellow\"\u003e\u003c/a\u003e\n[![Twitter URL](https://img.shields.io/twitter/url/https/twitter.com/fold_left.svg?style=social\u0026label=Follow%20%40mepranjal31)](https://twitter.com/mepranjal31)\n\n\u003c!--[![Updates](https://pyup.io/repos/github/prankshaw/Beware-web-scraper/shield.svg)](https://pyup.io/repos/github/prankshaw/Beware-web-scraper)--\u003e\n\n# Scrapers available\n\u003col\u003e\n  \n### C-project-scraper\nScrapes the top projects for 'C' language from github. It can be extended to get projects in any language present on GitHub.\u003cbr\u003e\n### ICC Rankings-Scraper\nTells about top 100 ranked batsmen from all over the world for all 3 formats, i.e. Test cricket, One day International and T20 International.\u003cbr\u003e\n### Youtube Trending-Scraper\nScrapes all the information from trending section of youtune, including video name, description available and video liks\u003cbr\u003e\n### LinkedIn-Scraper\nAutomatically LogIn to the profile and scrapes the relavant information from profile, including name, location, title, connections and more\u003cbr\u003e\n### Wikipedia Image-Scraper\nScrapes links of all the images present in the given wikipedia page and prints them\u003cbr\u003e\n\u003cbr\u003e\n\n\u003c/ol\u003e  \n\n## \u003cstrong\u003eThese project use selenium driver.\u003c/strong\u003e\n#### To use project\n\u003e Just fork the project and the install the prerequisities. \u003cbr\u003e\n\u003e \u003cstrong\u003eSimply run, if present in jupyter notebook, else follow below mentioned steps.\u003c/strong\u003e\u003cbr\u003e\n\u003e Python (I am using Python 3.x). After downloading python, pip all the requirements(if any).\u003cbr\u003e\n\u003e Selenium Webdriver for Google Chrome: Chromedriver – Download it and place it anywhere on your machine.\u003cbr\u003e\n\u003e \u003cstrong\u003epip install selenium \u003cbr\u003e\n\u003e pip install pandas\u003c/strong\u003e \u003cbr\u003e\n\u003e Change path of 'chromedriver' with your own path.\u003cbr\u003e\n\u003e Just run in IDLE and see the output \u003cbr\u003e\n# License\nLicensed under MIT-license\nhttps://prankshaw.mit-license.org/\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fprankshaw%2Fbeware-web-scraper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fprankshaw%2Fbeware-web-scraper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fprankshaw%2Fbeware-web-scraper/lists"}