https://github.com/tomtom828/mongodb-web-scraper
A Node.js & MongoDB webapp that web-scrapes news data and allows users to comment about it.
https://github.com/tomtom828/mongodb-web-scraper
mongodb nodejs webscraper
Last synced: 4 months ago
JSON representation
A Node.js & MongoDB webapp that web-scrapes news data and allows users to comment about it.
- Host: GitHub
- URL: https://github.com/tomtom828/mongodb-web-scraper
- Owner: tomtom828
- Created: 2016-12-04T23:36:23.000Z (about 8 years ago)
- Default Branch: master
- Last Pushed: 2016-12-08T03:57:20.000Z (about 8 years ago)
- Last Synced: 2024-11-04T18:45:58.914Z (4 months ago)
- Topics: mongodb, nodejs, webscraper
- Language: HTML
- Homepage: http://paper-scraper.herokuapp.com/
- Size: 2.4 MB
- Stars: 3
- Watchers: 2
- Forks: 6
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# :newspaper: Paper Scraper (aka Onion Peeler) :trollface:
A `Node.js` & `MongoDB` webapp that web-scrapes news data from [The Onion](http://www.theonion.com/) and allows users to comment about what they have read. Users can also delete unwanted comments.Please check out the deployed version in Heroku [here](http://paper-scraper.herokuapp.com/)!
## Functionality
On the backend, the app uses `express` to serve routes and `mongoose` to interact with a `MongoDB` database.On the frontend, the app uses `handlebars` for templating each article and `materialize` as a styling framework. The app also uses `jQuery` and `AJAX` to help with making post requests.
And for webscraping, the app uses the `request` and `cheerio` node packages. All webscrapping code can be found in the `controllers.js` file.
## Cloning down the repo
If you wish to clone the app down to your local machine...
1. Ensure that you have MongoDB set up on your laptop
* An amazing repo to get you started with that can be found [here](https://github.com/dannyvassallo/mongo_lesson).
2. Once you are set up, `cd` into this repo and run `npm install`.
3. Then open another bash or terminal window and run `mongod`
4. Run the script with `node server.js`.
5. Navigate to `localhost:3000` in your browser.## Screenshots
#### The `/articles` route renders all the news articles
data:image/s3,"s3://crabby-images/3e9c1/3e9c16419d8419b2a560541b50e93145dd4baece" alt="All Articles"#### Click on the globe icon to view the content.
data:image/s3,"s3://crabby-images/986eb/986eb727edeebdce9160ea8e87f8c036fb41a354" alt="Article Content"#### Click the Chat Bubble icon to add a comment via the `/add/comment/:id` post route
data:image/s3,"s3://crabby-images/74e9f/74e9ff55c4f9f0b2cb0ad51a7799487a1f95d0b9" alt="Add Comment"#### Click the Thumbs up/down icon to view comments
data:image/s3,"s3://crabby-images/b2943/b2943929d42dc563da3b63660a611920a5831787" alt="View Comment"#### Click the Delete button to remove rude comments via the `remove/comment/:id` route
data:image/s3,"s3://crabby-images/bd834/bd83434f9300a71680c644effb9226e26c847d85" alt="Delete Comment"#### Note that the web scraping occurs on the `/scrape` route.
#### On visiting the index route, `/`, express redirects to `/scrape` and then `/articles` routes