Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/cdk-dev/link-scraper

Extract Preview Data from Websites
https://github.com/cdk-dev/link-scraper

aws-cdk cdk cdk8s cdktf constructs jsii terraform-cdk

Last synced: 3 months ago
JSON representation

Extract Preview Data from Websites

Awesome Lists containing this project

README

        

# Content Preview Scraper

This uses [Playwright](https://github.com/microsoft/playwright) to extract content previews from a givenn url. This includes:

- Generic Metadata from Dom
- Open Graph Metadata
- Twitter Tags Metadata
- Screenshot (viewport / full)

Still to do: Scrape author data from a social media profile such as Twitter, Github, LinkedIn