{"id":21455738,"url":"https://github.com/luminati-io/g2-dataset-sample","last_synced_at":"2026-01-03T14:32:51.065Z","repository":{"id":261614679,"uuid":"884818409","full_name":"luminati-io/G2-dataset-sample","owner":"luminati-io","description":"A sample dataset of over 1000 G2 software product reviews, extracted using the Bright Data API, ideal for product analysis, competitor analysis, and customer insights. ","archived":false,"fork":false,"pushed_at":"2024-11-07T13:19:24.000Z","size":461,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-01-23T13:13:51.443Z","etag":null,"topics":["datasets","g2","g2-data","g2-dataset","g2-reviews"],"latest_commit_sha":null,"homepage":"https://brightdata.com/products/datasets/g2","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/luminati-io.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-11-07T12:53:27.000Z","updated_at":"2024-11-07T13:19:28.000Z","dependencies_parsed_at":"2024-11-07T14:26:17.533Z","dependency_job_id":"1dd0272a-79d9-4bd2-827e-0866df853c5c","html_url":"https://github.com/luminati-io/G2-dataset-sample","commit_stats":null,"previous_names":["luminati-io/g2-dataset-sample"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luminati-io%2FG2-dataset-sample","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luminati-io%2FG2-dataset-sample/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luminati-io%2FG2-dataset-sample/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luminati-io%2FG2-dataset-sample/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/luminati-io","download_url":"https://codeload.github.com/luminati-io/G2-dataset-sample/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243966283,"owners_count":20376041,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["datasets","g2","g2-data","g2-dataset","g2-reviews"],"created_at":"2024-11-23T05:13:16.406Z","updated_at":"2026-01-03T14:32:51.040Z","avatar_url":"https://github.com/luminati-io.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# G2-dataset-samples\n\n\u003ch2\u003eA sample dataset of 1001 G2 software product reviews\u003c/h2\u003e\n\n![G2 dataset header](https://github.com/luminati-io/G2-dataset-sample/blob/main/G2-datasets.png)\n\nA G2 sofware product reviews dataset sample of over 1000 records. Dataset was extracted using the \u003cb\u003eBright Data API\u003c/b\u003e.\n\n\u003ch2\u003eSome of the data points that are included in the dataset:\u003c/h2\u003e\n\n* ```review_id```: The unique identifier for the review\n* ```author_id```: The unique identifier for the author of the review\n* ```author```: The name of the author who wrote the review\n* ```position```: The position or role of the author\n* ```company_size```: The size or type of the author's company\n* ```stars```: The star rating given in the review, indicating the overall satisfaction\n* ```date```: The date when the review was posted\n* ```title```: The title or headline of the review\n* ```text```: The main content or body of the review, providing detailed information about the user's experience\n* ```tags```: Tags associated with the review, indicating key topics or categories\n* ```review_url```: The URL or link to the specific review\n* ```url```: Another URL associated with the review or related content\n* ```product_url```: The URL or link to the software product being reviewed\n* ```page```: The page or location of the review within the G2 website or platform\n* ```product_name```: The name of the product being reviewed\n* ```vendor_name```: The name of the vendor or provider of the product\n* ```pages```: Number of pages in the input\n* ```sort_filter```: Sort or filter option from input\n\nAnd a lot more.\n\nThis is a sample subset which is derived from the \"G2 software product review (public data)\"\ndataset which includes more than \u003cb\u003e580,000 records\u003c/b\u003e.\n\nAvailable dataset file formats: \u003cb\u003eJSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz\u003c/b\u003e.\n\nDataset delivery type options: \u003cb\u003eEmail, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP\u003c/b\u003e.\n\nUpdate frequency: \u003cb\u003eOnce, Daily, Weekly, Monthly, Quarterly, or Custom basis\u003c/b\u003e.\n\nData enrichment available as an addition to the data points extracted: \u003cb\u003eBased on request.\u003c/b\u003e\n\n\u003cb\u003e[Get the full G2 dataset](https://brightdata.com/products/datasets/g2)\u003c/b\u003e.\n\n\u003ch2\u003eWhat are the G2 datasets use cases?\u003c/h2\u003e\n\n\u003ch3\u003e1. Product Analysis and Competitive Benchmarking\u003c/h3\u003e\nLeverage the G2 dataset to evaluate competitor products, customer feedback, and overall market positioning. This analysis helps identify market gaps, refine product features, and adjust your offerings to better meet customer needs and gain a competitive edge.\n\n\u003ch3\u003e2. Customer Insight and Experience Enhancement\u003c/h3\u003e\nUse customer feedback and ratings from the G2 dataset to gain valuable insights into user satisfaction, preferences, and pain points. These insights enable you to tailor marketing strategies, refine product development, and enhance customer experience to better align with customer expectations and needs.\n\n\u003ch3\u003e3. Trend Identification and Market Adaptation\u003c/h3\u003e\nAnalyze the G2 dataset to identify emerging trends in the software industry, helping you anticipate market shifts and align product development with future demands. Staying ahead of these trends enables your business to adapt quickly and maintain a competitive advantage.\n\n\u003ch2\u003eFree access to web scraping tools and datasets for academic researchers and NGOs\u003c/h2\u003e\n\nThe Bright Initiative offers access to Bright Data's \u003cb\u003e[Web Scraper APIs](https://brightdata.com/products/web-scraper)\u003c/b\u003e and \u003cb\u003e[ready-to-use datasets](https://brightdata.com/products/datasets)\u003c/b\u003e to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application \u003cb\u003e[here](https://brightinitiative.com)\u003c/b\u003e.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fluminati-io%2Fg2-dataset-sample","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fluminati-io%2Fg2-dataset-sample","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fluminati-io%2Fg2-dataset-sample/lists"}