Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/yarnaimo/html-content-extractor

Extract content from web page
https://github.com/yarnaimo/html-content-extractor

Last synced: about 1 month ago
JSON representation

Extract content from web page

Host: GitHub
URL: https://github.com/yarnaimo/html-content-extractor
Owner: yarnaimo
License: mit
Created: 2020-11-05T03:44:07.000Z (about 4 years ago)
Default Branch: master
Last Pushed: 2020-11-10T06:18:09.000Z (about 4 years ago)
Last Synced: 2024-10-15T15:42:25.742Z (3 months ago)
Language: TypeScript
Homepage:
Size: 104 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # html-content-extractor

Extract main text content from web page

> This library only works on **rendered HTML document** since it extracts contents based on the visual size of the elements.

## Install

```sh

npm i -S html-content-extractor

# or

yarn add html-content-extractor

```

## Usage

```ts

import { extractMainContent } from 'html-content-extractor'

extractMainContent() // => string | undefined

```