Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/aduda-boaz/web-scrapper
This project is all about web scraping, getting data or information from a website.
https://github.com/aduda-boaz/web-scrapper
Last synced: about 2 months ago
JSON representation
This project is all about web scraping, getting data or information from a website.
- Host: GitHub
- URL: https://github.com/aduda-boaz/web-scrapper
- Owner: Aduda-Boaz
- License: mit
- Created: 2021-05-27T04:23:29.000Z (over 3 years ago)
- Default Branch: development
- Last Pushed: 2021-06-14T14:27:30.000Z (over 3 years ago)
- Last Synced: 2024-10-02T12:41:14.614Z (4 months ago)
- Language: Ruby
- Homepage:
- Size: 599 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Web Scraping using Nokogiri and HTTparty
## Definition of The Project
This project is all about web scraping, geting data or information from a website.
## About Project
The purpose of this project is to develop a scraper tool to achieve web-scraping. This was achieved using Ruby, HTTparty and Nokogiri gem.
- Nokogiri is an HTML, XML, SAX, and Reader parser.
- The above Ruby Gems can be sources from [Ruby Gems](https://rubygems.org/)
In this project, I created a scraper which extracts available remote jobs at ()
## Preview
![screenshot](./scraping.png)
![screenshot](./indeed_scrap.png)
## Built With
- Rubocop
- Ruby
- HTTparty
- Nokogiri
- RSpec
- PP## Getting Started
To get started, you should first clone this file in your local machine.
`
git clone https://github.com/Aduda-Boaz/Capstone2-web-scrapper
`## Prerequisites
Ruby installed on local machine
Text editor (preferably: VSCode, Atom, Sublime)
Git
Chrome Browser## Setup
If you have installed `Ruby` on your machine:
1. Clone the project into your local machine using `git clone` command or download the zip file.
2. Go into the project directory using `cd directory name` command.
3. Install required gems by using `bundle`
- gem install httpart
- gem install nokogiri
- gem install rspec
- gem install pp4. From the root directory type `ruby bin/main.rb` command.
5. Give the project sometime to load then you will be able to see the results in your terminal. Enjoy the scraping as well as in the .csv file created at the root folder.
6. Run command `rspec` to test the various methods in the classes.## Contributing
Contributions, issues and feature requests are welcome!
You can do the following:1. Fork the project
2. Clone the project to your local machine
3. cd into the project directory
4. Run git checkout -b your-branch-name
5. Make your contributions
6. Push your branch up to your forked repository
7. Open a Pull Request with a detailed description to the development branch of the original project for a reviewPlease feel free to contribute to any of these!
Feel free to check the [issues page](https://github.com/Aduda-Boaz/Capstone2-web-scrapper/issues).
## Author
👤 **Boaz Aduda**
- Github: [@Aduda-Boaz](https://github.com/Aduda-Boaz)
- Twitter: [@BoazAduda](https://twitter.com/BoazAduda)## Show your support
Give a :star2: if you like this project! :blush: