An open API service indexing awesome lists of open source software.

https://github.com/comsavvy/punch-scraping-engine

Scraping the top Punch news
https://github.com/comsavvy/punch-scraping-engine

news newsfeed punch python3 scrapy web-scraping

Last synced: 4 months ago
JSON representation

Scraping the top Punch news

Awesome Lists containing this project

README

          

# News
This code is for scraping the latest Punch News (here) by crawling through different NEWS url.


End product:
- The URL of the News
- Title of the news
- News content

*All in one file!*
This project has three branches:
1. main: For storing the NEWS into a text file.
2. CSV: For storing the NEWS into a csv file.
3. deployment:This can be deployed in SCRAPYHUB platform
# Requirement
*scrapy_engine.py* module will handle the installation of the necessary libraries,
are you scared if the libraries is too much?

Don't be!
Because we are only installing one library called **SCRAPY**

But to install it manually,

copy and paste this **pip install scrapy** to your console.

You can visit the **SCRAPY** documentation if you are curious about how it works.