Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/prithivsakthiur/web-data-scraper
Data text successfully scraped! - Put & Get
https://github.com/prithivsakthiur/web-data-scraper
data scraper streamlit web
Last synced: about 2 months ago
JSON representation
Data text successfully scraped! - Put & Get
- Host: GitHub
- URL: https://github.com/prithivsakthiur/web-data-scraper
- Owner: PRITHIVSAKTHIUR
- Created: 2024-05-14T09:59:25.000Z (9 months ago)
- Default Branch: main
- Last Pushed: 2024-05-31T09:01:41.000Z (8 months ago)
- Last Synced: 2024-05-31T10:24:32.353Z (8 months ago)
- Topics: data, scraper, streamlit, web
- Language: Python
- Homepage: https://huggingface.co/spaces/prithivMLmods/Web-Data-Scraper
- Size: 1.04 MB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
---
title: Web Data Scrapper
emoji: 🐣🔍
colorFrom: gray
colorTo: indigo
sdk: streamlit
sdk_version: 1.34.0
app_file: app.py
pinned: false
license: creativeml-openrail-m
---![alt text](assets/33.png)
🚀Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
🚀Huggingface Spaces : https://huggingface.co/spaces/prithivMLmods/Web-Data-Scraper
🚀Docs for Space clone : git clone https://huggingface.co/spaces/prithivMLmods/Web-Data-Scraper
## 🔮Entered URL of Microsoft Learn :
.
![alt text](assets/wds.png)
.
## 🎴The Scraped Result in the Space :
![alt text](assets/wds2.png)
.
.
.
## Python Package Index PyPI
| Library Name | Version |
| --- | --- |
| aiohttp | 3.8.5 |
| aiosignal | 1.3.1 |
| altair | 5.0.1 |
| async-timeout | 4.0.2 |
| attrs | 23.1.0 |
| beautifulsoup4 | 4.12.2 |
| blinker | 1.6.2 |
| bs4 | 0.0.1 |
| cachetools | 5.3.1 |
| certifi | 2023.7.22 |
| charset-normalizer | 3.2.0 |
| click | 8.1.6 |
| decorator | 5.1.1 |
| frozenlist | 1.4.0 |
| gitdb | 4.0.10 |
| GitPython | 3.1.32 |
| idna | 3.4 |
| importlib-metadata | 6.8.0 |
| Jinja2 | 3.1.2 |
| jsonschema | 4.18.4 |
| jsonschema-specifications | 2023.7.1 |
| markdown-it-py | 3.0.0 |
| MarkupSafe | 2.1.3 |
| mdurl | 0.1.2 |
| multidict | 6.0.4 |
| numpy | 1.25.2 |
| openai | 0.27.8 |
| packaging | 23.1 |
| pandas | 2.0.3 |
| Pillow | 9.5.0 |
| protobuf | 4.23.4 |
| pyarrow | 12.0.1 |
| pydeck | 0.8.0 |
| Pygments | 2.15.1 |
| Pympler | 1.0.1 |
| python-dateutil | 2.8.2 |
| python-dotenv | 1.0.0 |
| pytz | 2023.3 |
| pytz-deprecation-shim | 0.1.0.post0 |
| referencing | 0.30.0 |
| requests | 2.31.0 |
| rich | 13.5.2 |
| rpds-py | 0.9.2 |
| six | 1.16.0 |
| smmap | 5.0.0 |
| soupsieve | 2.4.1 |
| streamlit | 1.25.0 |
| tenacity | 8.2.2 |
| toml | 0.10.2 |
| toolz | 0.12.0 |
| tornado | 6.3.2 |
| tqdm | 4.65.0 |
| typing-extensions | 4.7.1 |
| tzdata | 2023.3 |
| tzlocal | 4.3.1 |
| urllib3 | 2.0.4 |
| validators | 0.20.0 |
| watchdog | 3.0.0 |
| yarl | 1.9.2 |
| zipp | 3.16.2 |## Make sure about the Lib's
+ StreamLit
+ Requests
+ BS4