Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/antchfx/antch

Antch, a fast, powerful and extensible web crawling & scraping framework for Go
https://github.com/antchfx/antch

crawler crawling framework golang scraping web-crawler web-spider

Last synced: about 2 months ago
JSON representation

Antch, a fast, powerful and extensible web crawling & scraping framework for Go

Awesome Lists containing this project

README

        

Antch
====

[![Build Status](https://travis-ci.org/antchfx/antch.svg?branch=master)](https://travis-ci.org/antchfx/antch)
[![Coverage Status](https://coveralls.io/repos/github/antchfx/antch/badge.svg?branch=master)](https://coveralls.io/github/antchfx/antch?branch=master)
[![Go Report Card](https://goreportcard.com/badge/github.com/antchfx/antch)](https://goreportcard.com/report/github.com/antchfx/antch)
[![GoDoc](https://godoc.org/github.com/antchfx/antch?status.svg)](https://godoc.org/github.com/antchfx/antch)

**Antch**, inspired by [Scrapy](https://scrapy.org/). If you're familiar with scrapy,
you can quickly get started.

Antch is a fast, powerful and extensible web crawling & scraping framework for Go, used
to crawl websites and extract structured data from their pages.

Get Started
====

[Getting Started](https://github.com/antchfx/antch-getstarted)

Follow the Getting Started instructions to start your first spider.

- [English](https://github.com/antchfx/antch/wiki/Getting-Started)
- [简体中文](https://github.com/antchfx/antch/wiki/%E5%BF%AB%E9%80%9F%E5%85%A5%E9%97%A8)

Features
====

- Polite, highly concurrent web crawler.
- Powerful and customizable HTTP middleware.
- Item data pipeline for the web spider.
- Built-in proxy support (HTTP, HTTPS, SOCKS5).
- Built-in XPath query support for HTML/XML documents.
- Easy to use and integrate with your project.

Examples
===

[BingWallpaper](https://github.com/zhengchun/bingwallpaper) - Bing daily wallpaper.

Documentation
====

See [https://github.com/antchfx/antch/wiki](https://github.com/antchfx/antch/wiki)