An open API service indexing awesome lists of open source software.

https://github.com/maxhalford/web-crawler-l3sid


https://github.com/maxhalford/web-crawler-l3sid

Last synced: 3 months ago
JSON representation

Awesome Lists containing this project

README

          

# Perl Web Crawler

This was one of my first university assignements (``tp5.pdf``) in my junior year. The idea was to crawl through a list of links and extract the relevant data in elegant format (ie. JSON, SQL and HTML). I coded a simple terminal interface so that the user could modify how the script ran (sorry but it's in French!).

![Terminal](example1.png)

If the user runs the script with ``CTRL+D`` then the sample output is the following.

![Sample](example2.png)

A full explanation is available in the PDF file named ``Rapport.pdf``.