https://github.com/rrmn/crowdr

Scrape g2crowd reviews with R
https://github.com/rrmn/crowdr

r web-scraping

Last synced: about 1 year ago
JSON representation

Scrape g2crowd reviews with R

Host: GitHub
URL: https://github.com/rrmn/crowdr
Owner: rrmn
License: mit
Created: 2019-01-04T09:32:13.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2019-01-12T15:32:24.000Z (over 7 years ago)
Last Synced: 2025-01-21T00:49:49.857Z (over 1 year ago)
Topics: r, web-scraping
Language: R
Size: 16.6 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# crowdr
Scrape g2crowd reviews with R

# Usage

## Scrape urls from category pages

scrape_urls(category)

Example usage:

> scrape_urls("crm")
# A tibble: 330 x 4
name no_reviews avg_rating url

1 Salesforce CRM 7177 4.2 https://www.g2crow…
2 HubSpot CRM 2241 4.2 https://www.g2crow…
3 Microsoft Dynamics 365 for Sal… 868 3.6 https://www.g2crow…
4 Freshsales 444 4.7 https://www.g2crow…
5 Pipedrive 787 4.3 https://www.g2crow…
6 Zoho CRM 890 3.8 https://www.g2crow…
7 Zendesk Sell 323 4.2 https://www.g2crow…
8 Copper 388 4.6 https://www.g2crow…
9 PipelineDeals 525 4.4 https://www.g2crow…
10 Nimble 550 4.5 https://www.g2crow…
# … with 320 more rows

## Scrape review texts from urls (not functional, yet)
scrape_reviews(urls)

# Changelog

## 2019-01-12 v0.2.1

Updates

* Output data type bug fixed

## 2019-01-12 v0.2.0

Updates

* scrape_urls() works as intended

Known Bugs

* Output data type is "factor"

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/rrmn/crowdr

Awesome Lists containing this project

README