An open API service indexing awesome lists of open source software.

https://github.com/kalebu/link-scraper-in-python

A Python script to scrap all links in a given website using requests and Beautiful soup
https://github.com/kalebu/link-scraper-in-python

link-scraper-python python python-bs4 python-requests python-script python-webscraping-application

Last synced: 7 months ago
JSON representation

A Python script to scrap all links in a given website using requests and Beautiful soup

Awesome Lists containing this project

README

          

# Link-scraper-in-python
A Python script to scrap all links in a given website using requests and Beautiful soup

Detailed article
-----------------
The detailed article on how to scrap all links in a given website can be found on [my blog](kalebujordan.com) with an article titled [How to extract all website link in Python](https://kalebujordan.com/learn-how-to-extract-all-links-from-a-website-in-python/).

Getting started
----------------

To get started exploring this code you might have to clone or download the repository just as shown
below;

```bash
-> git clone https://github.com/Kalebu/Link-scraper-in-python
-> cd Link-scraper-in-python
```

Dependencies
------------
To successfully run this code you're supposed to have requests and BeautifulSoup libary installed on your machine

```bash
-> pip install requests
-> pip install beautifulsoup4
```

Running
--------
Now that we have everything already set up , lets run our code just as shown below;

```bash

Link-scraper-in-python -> python link_spider.py
Enter URL of the site : https://kalebujordan.com/
['#content', 'https://kalebujordan.com/', 'https://kalebujordan.com/', 'https://kalebujordan.com/category/projects/...]

```

Explore it
-----------
Now keep explore it by testing it with various input links to see what links it will scrap

Give it a star
--------------
Did you find this information useful, then give it a star

Credits
-----------
All the credits to [kalebu](github.com/kalebu)