Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/yarnaimo/html-content-extractor
Extract content from web page
https://github.com/yarnaimo/html-content-extractor
Last synced: about 1 month ago
JSON representation
Extract content from web page
- Host: GitHub
- URL: https://github.com/yarnaimo/html-content-extractor
- Owner: yarnaimo
- License: mit
- Created: 2020-11-05T03:44:07.000Z (about 4 years ago)
- Default Branch: master
- Last Pushed: 2020-11-10T06:18:09.000Z (about 4 years ago)
- Last Synced: 2024-10-15T15:42:25.742Z (3 months ago)
- Language: TypeScript
- Homepage:
- Size: 104 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# html-content-extractor
Extract main text content from web page
> This library only works on **rendered HTML document** since it extracts contents based on the visual size of the elements.
## Install
```sh
npm i -S html-content-extractor
# or
yarn add html-content-extractor
```## Usage
```ts
import { extractMainContent } from 'html-content-extractor'extractMainContent() // => string | undefined
```