Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by scrapinghub

A curated list of projects in awesome lists by scrapinghub .

https://github.com/scrapinghub/portia

Visual scraping for Scrapy

Last synced: 30 Jul 2024

https://github.com/scrapinghub/splash

Lightweight, scriptable browser as a service with an HTTP API

Last synced: 31 Jul 2024

https://github.com/scrapinghub/dateparser

python parser for human readable dates

hacktoberfest

Last synced: 31 Jul 2024

https://github.com/scrapinghub/frontera

A scalable frontier for web crawlers

Last synced: 04 Aug 2024

https://github.com/scrapinghub/slackbot

A chat bot for Slack (https://slack.com).

Last synced: 01 Aug 2024

https://github.com/scrapinghub/extruct

Extract embedded metadata from HTML markup

hacktoberfest json-ld microdata microformats opengraph rdfa semantic-web

Last synced: 01 Aug 2024

https://github.com/scrapinghub/spidermon

Scrapy Extension for monitoring spiders execution.

crawling hacktoberfest monitoring monitoring-tool scraping scrapinghub spiders testing

Last synced: 31 Jul 2024

https://github.com/scrapinghub/price-parser

Extract price amount and currency symbol from a raw text string

hacktoberfest

Last synced: 31 Jul 2024

https://github.com/scrapinghub/article-extraction-benchmark

Article extraction benchmark: dataset and evaluation scripts

Last synced: 04 Aug 2024

https://github.com/scrapinghub/skinfer

Skinfer is a tool for inferring and merging JSON schemas

Last synced: 01 Aug 2024

https://github.com/scrapinghub/scrapy-poet

Page Object pattern for Scrapy

Last synced: 31 Jul 2024

https://github.com/scrapinghub/number-parser

Parse numbers written in natural language

hacktoberfest

Last synced: 04 Aug 2024

https://github.com/scrapinghub/learn.scrapinghub.com

Scrapinghub Learning Center. Report issues in Jira: Report issues in Jira: https://scrapinghub.atlassian.net/projects/WEB

crawling learning python scraping scrapy tutorial

Last synced: 30 Jul 2024

https://github.com/scrapinghub/pycon-speakers

Speakers Spider (PyCon 2014 sprint)

Last synced: 07 Aug 2024

https://github.com/scrapinghub/docker-devpi

pypi caching service using devpi and docker

Last synced: 01 Aug 2024

https://github.com/scrapinghub/andi

Library for annotation-based dependency injection

Last synced: 31 Jul 2024