https://github.com/kartikmehta8/htmlaas
Why to use a third party API when you can yourself create one? Extract HTML content by tag name, class, id, links from HTML as a Service.
https://github.com/kartikmehta8/htmlaas
beautifulsoup4 fastapi python requests
Last synced: 3 months ago
JSON representation
Why to use a third party API when you can yourself create one? Extract HTML content by tag name, class, id, links from HTML as a Service.
- Host: GitHub
- URL: https://github.com/kartikmehta8/htmlaas
- Owner: kartikmehta8
- License: mit
- Created: 2023-05-14T06:13:23.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-05-14T06:33:29.000Z (about 2 years ago)
- Last Synced: 2025-01-16T05:55:23.857Z (4 months ago)
- Topics: beautifulsoup4, fastapi, python, requests
- Language: Python
- Homepage:
- Size: 5.86 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README

Why to use a third party API when you can yourself create one?HTMLaaS offers users the ability to query and extract specific information from HTML documents. Users can retrieve the page `title`, extract all `links`, access the `entire HTML content`, and query elements by `tag names`, `classes`, and `IDs`. This simplifies working with HTML, allowing users to extract desired data efficiently.
### Technologies
```py
from fastapi import FastAPI
from pydantic import BaseModel
import requests
from bs4 import BeautifulSoup
```### Run
```
pip install uvicorn
uvicorn app:app --reload
``````
Visit /docs route for Swagger documentation.
```
Made & open-sourced with ❤️ by kartikmehta8