An open API service indexing awesome lists of open source software.

https://github.com/surbhitt/qaahl

a crawler that can scrap and visualize the path qrawled
https://github.com/surbhitt/qaahl

beautifulsoup4 pygame webcrawler webscraper

Last synced: 6 months ago
JSON representation

a crawler that can scrap and visualize the path qrawled

Awesome Lists containing this project

README

          



# Objective

To develop an application capable of scrapping data from webpages and to provide visuals to assist the process through a GUI. Implementing a light weight multithreaded interactive application.

# Introduction

Qaahl serves as a minimalist webcrawler providing the basic functionality of a scrapper. A lightweight graphical interface built using pygames.



# Libraries utilised

- Pygame
- Requests
- Beautiful Soup (Bs4)

# Run

```console
pip install -r requirements.txt
python3 main.py $URL $flags
```

| `$URL` | with the link as the base url |
| -------- | ----------------------------------------------- |
| `$flags` | with flags e.g. -d 3 performs depth=3 traversal |