Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/intina47/ee_error

implementation of a web crawler using c++
https://github.com/intina47/ee_error

cpp crawler curl gumbo libcurl stanford-nlp web

Last synced: about 2 months ago
JSON representation

implementation of a web crawler using c++

Awesome Lists containing this project

README

        

# Web Crawler

This is a C++ program that implements a web crawler. It uses the libcurl library for making HTTP requests and the Gumbo HTML parsing library for extracting information from HTML documents.

## Installation

1. Clone the repository:

```
git clone https://github.com/Intina47/EE_error.git
```

2. Install the required dependencies. Make sure you have libcurl and Gumbo installed on your system.

3. Build the project using a C++ compiler

```
make
```
4. Run the executable

```
./crawler
```