{"id":23114686,"url":"https://github.com/prithivsakthiur/web-data-scraper","last_synced_at":"2025-05-06T21:42:44.111Z","repository":{"id":239739667,"uuid":"800423189","full_name":"PRITHIVSAKTHIUR/Web-Data-Scraper","owner":"PRITHIVSAKTHIUR","description":"Data text successfully scraped! - Put \u0026 Get","archived":false,"fork":false,"pushed_at":"2024-05-31T09:01:41.000Z","size":1089,"stargazers_count":6,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-31T03:12:37.580Z","etag":null,"topics":["data","scraper","streamlit","web"],"latest_commit_sha":null,"homepage":"https://huggingface.co/spaces/prithivMLmods/Web-Data-Scraper","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/PRITHIVSAKTHIUR.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-05-14T09:59:25.000Z","updated_at":"2024-07-27T15:01:56.000Z","dependencies_parsed_at":"2024-05-14T11:26:32.519Z","dependency_job_id":"d40de28c-e77b-46a2-a490-94c4e5ff01d3","html_url":"https://github.com/PRITHIVSAKTHIUR/Web-Data-Scraper","commit_stats":null,"previous_names":["prithivsakthiur/web-data-scraper"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/PRITHIVSAKTHIUR%2FWeb-Data-Scraper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/PRITHIVSAKTHIUR%2FWeb-Data-Scraper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/PRITHIVSAKTHIUR%2FWeb-Data-Scraper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/PRITHIVSAKTHIUR%2FWeb-Data-Scraper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/PRITHIVSAKTHIUR","download_url":"https://codeload.github.com/PRITHIVSAKTHIUR/Web-Data-Scraper/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252775997,"owners_count":21802457,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data","scraper","streamlit","web"],"created_at":"2024-12-17T03:34:18.641Z","updated_at":"2025-05-06T21:42:44.064Z","avatar_url":"https://github.com/PRITHIVSAKTHIUR.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"---\r\ntitle: Web Data Scrapper\r\nemoji: 🐣🔍\r\ncolorFrom: gray\r\ncolorTo: indigo\r\nsdk: streamlit\r\nsdk_version: 1.34.0\r\napp_file: app.py\r\npinned: false\r\nlicense: creativeml-openrail-m\r\n---\r\n\r\n![alt text](assets/33.png)\r\n\r\n🚀Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference\r\n\r\n🚀Huggingface Spaces : https://huggingface.co/spaces/prithivMLmods/Web-Data-Scraper\r\n\r\n🚀Docs for Space clone : git clone https://huggingface.co/spaces/prithivMLmods/Web-Data-Scraper\r\n\r\n## 🔮Entered URL of Microsoft Learn :\r\n\r\n.\r\n\r\n![alt text](assets/wds.png)\r\n\r\n.\r\n\r\n## 🎴The Scraped Result in the Space : \r\n\r\n![alt text](assets/wds2.png)\r\n\r\n.\r\n\r\n.\r\n\r\n.\r\n\r\n## Python Package Index PyPI\r\n\r\n\r\n| Library Name | Version |\r\n| --- | --- |\r\n| aiohttp | 3.8.5 |\r\n| aiosignal | 1.3.1 |\r\n| altair | 5.0.1 |\r\n| async-timeout | 4.0.2 |\r\n| attrs | 23.1.0 |\r\n| beautifulsoup4 | 4.12.2 |\r\n| blinker | 1.6.2 |\r\n| bs4 | 0.0.1 |\r\n| cachetools | 5.3.1 |\r\n| certifi | 2023.7.22 |\r\n| charset-normalizer | 3.2.0 |\r\n| click | 8.1.6 |\r\n| decorator | 5.1.1 |\r\n| frozenlist | 1.4.0 |\r\n| gitdb | 4.0.10 |\r\n| GitPython | 3.1.32 |\r\n| idna | 3.4 |\r\n| importlib-metadata | 6.8.0 |\r\n| Jinja2 | 3.1.2 |\r\n| jsonschema | 4.18.4 |\r\n| jsonschema-specifications | 2023.7.1 |\r\n| markdown-it-py | 3.0.0 |\r\n| MarkupSafe | 2.1.3 |\r\n| mdurl | 0.1.2 |\r\n| multidict | 6.0.4 |\r\n| numpy | 1.25.2 |\r\n| openai | 0.27.8 |\r\n| packaging | 23.1 |\r\n| pandas | 2.0.3 |\r\n| Pillow | 9.5.0 |\r\n| protobuf | 4.23.4 |\r\n| pyarrow | 12.0.1 |\r\n| pydeck | 0.8.0 |\r\n| Pygments | 2.15.1 |\r\n| Pympler | 1.0.1 |\r\n| python-dateutil | 2.8.2 |\r\n| python-dotenv | 1.0.0 |\r\n| pytz | 2023.3 |\r\n| pytz-deprecation-shim | 0.1.0.post0 |\r\n| referencing | 0.30.0 |\r\n| requests | 2.31.0 |\r\n| rich | 13.5.2 |\r\n| rpds-py | 0.9.2 |\r\n| six | 1.16.0 |\r\n| smmap | 5.0.0 |\r\n| soupsieve | 2.4.1 |\r\n| streamlit | 1.25.0 |\r\n| tenacity | 8.2.2 |\r\n| toml | 0.10.2 |\r\n| toolz | 0.12.0 |\r\n| tornado | 6.3.2 |\r\n| tqdm | 4.65.0 |\r\n| typing-extensions | 4.7.1 |\r\n| tzdata | 2023.3 |\r\n| tzlocal | 4.3.1 |\r\n| urllib3 | 2.0.4 |\r\n| validators | 0.20.0 |\r\n| watchdog | 3.0.0 |\r\n| yarl | 1.9.2 |\r\n| zipp | 3.16.2 |\u003c/s\u003e\r\n\r\n\r\n## Make sure about the Lib's \r\n\r\n+ StreamLit\r\n+ Requests \r\n+ BS4\r\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fprithivsakthiur%2Fweb-data-scraper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fprithivsakthiur%2Fweb-data-scraper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fprithivsakthiur%2Fweb-data-scraper/lists"}