https://github.com/m15kh/web_scraping
A collection of Python scripts for web scraping using Beautiful Soup, starting with a Hacker News scraper.
https://github.com/m15kh/web_scraping
python-scripts web-scraping
Last synced: about 1 month ago
JSON representation
A collection of Python scripts for web scraping using Beautiful Soup, starting with a Hacker News scraper.
- Host: GitHub
- URL: https://github.com/m15kh/web_scraping
- Owner: m15kh
- Created: 2023-10-10T13:48:22.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2025-04-10T07:37:40.000Z (about 1 year ago)
- Last Synced: 2025-04-10T08:38:07.099Z (about 1 year ago)
- Topics: python-scripts, web-scraping
- Language: Python
- Homepage:
- Size: 5.86 KB
- Stars: 3
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Web-Scraping
A collection of web scraping projects to extract useful information from various websites
· Documentation · Report Bug · Request Feature
# :notebook_with_decorative_cover: Table of Contents
- [About the Project](#star2-about-the-project)
- [Projects](#rocket-projects)
- [Hacker News Scraper](#computer-hacker-news-scraper)
- [Getting Started](#toolbox-getting-started)
- [Prerequisites](#bangbang-prerequisites)
- [Installation](#gear-installation)
- [Usage](#eyes-usage)
- [Contributing](#wave-contributing)
- [Contact](#handshake-contact)
## :star2: About the Project
This repository contains a collection of web scraping scripts to extract information from various websites. Web scraping is the process of automatically collecting data from websites, which can be useful for data analysis, monitoring, and research purposes.
## :rocket: Projects
### :computer: Hacker News Scraper
The `hackers_news.py` script scrapes the front page of [Hacker News](https://news.ycombinator.com/news) and extracts the following information for each article:
- Title
- Link
- Score (points)
- Number of comments
The data is collected using Beautiful Soup and printed to the console in an easily readable format.
## :toolbox: Getting Started
### :bangbang: Prerequisites
- Python 3.6 or higher
- pip (Python package installer)
### :gear: Installation
1. Clone the repository:
```
```