https://github.com/rrmn/crowdr
Scrape g2crowd reviews with R
https://github.com/rrmn/crowdr
r web-scraping
Last synced: about 1 year ago
JSON representation
Scrape g2crowd reviews with R
- Host: GitHub
- URL: https://github.com/rrmn/crowdr
- Owner: rrmn
- License: mit
- Created: 2019-01-04T09:32:13.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2019-01-12T15:32:24.000Z (over 7 years ago)
- Last Synced: 2025-01-21T00:49:49.857Z (over 1 year ago)
- Topics: r, web-scraping
- Language: R
- Size: 16.6 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# crowdr
Scrape g2crowd reviews with R
# Usage
## Scrape urls from category pages
scrape_urls(category)
Example usage:
> scrape_urls("crm")
# A tibble: 330 x 4
name no_reviews avg_rating url
1 Salesforce CRM 7177 4.2 https://www.g2crow…
2 HubSpot CRM 2241 4.2 https://www.g2crow…
3 Microsoft Dynamics 365 for Sal… 868 3.6 https://www.g2crow…
4 Freshsales 444 4.7 https://www.g2crow…
5 Pipedrive 787 4.3 https://www.g2crow…
6 Zoho CRM 890 3.8 https://www.g2crow…
7 Zendesk Sell 323 4.2 https://www.g2crow…
8 Copper 388 4.6 https://www.g2crow…
9 PipelineDeals 525 4.4 https://www.g2crow…
10 Nimble 550 4.5 https://www.g2crow…
# … with 320 more rows
## Scrape review texts from urls (not functional, yet)
scrape_reviews(urls)
# Changelog
## 2019-01-12 v0.2.1
Updates
* Output data type bug fixed
## 2019-01-12 v0.2.0
Updates
* scrape_urls() works as intended
Known Bugs
* Output data type is "factor"