Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/luminati-io/instagram-dataset-samples

Sample datasets of over 400 Instagram coding influencers
https://github.com/luminati-io/instagram-dataset-samples

dataset influencers instagram instagram-dataset marketing

Last synced: 9 days ago
JSON representation

Sample datasets of over 400 Instagram coding influencers

Awesome Lists containing this project

README

        

# instagram-dataset-samples

A sample dataset of 2180 Instagram coding influencers

![instagram dataset header](https://github.com/luminati-io/Instagram-dataset-samples/blob/main/instagram-datasets.PNG)

A github dataset sample of over 2000 leading Instagram [Github](https://www.instagram.com/explore/tags/github/) coding influencers. Dataset was extracted using the Bright Data Collector.

Data points included in this free dataset:

* followers count
* profile type
* account type
* engagement score
* categories
* location
* external/bio links
* hashtags used
* brand affiliation
* bio
* highlights
* posts

This is a sample subset which is derived from the "All Instagram account, business & nonbusiness (public data)"
dataset which includes 614,000,000 Instagram profiles.

In this example, the large dataset was filtered down into a smaller subset using smart filter queries available on the Bright Data control panel.

Queries used for filtering this subset:

* $or: [{"post_hashtags":"github"},{"bio_hashtags":"github"}]
* followers: {"$gt":100}

Additional filter query values include: Posts count, cuntry, verified account, multiple hashtag combinations and more.

Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet..

Dataset delivery type options: API download, Amazon S3, Google cloud, Microsoft Azure, SFTP.

Data enrichment available as an addition to the data points extracted: Avg. post engagement rate, brand affiliation and more.

Get the full [Instagram dataset](https://brightdata.com/products/datasets/instagram).

Additional Instagram datasets available:

* 635,000,000 "Instagram profiles dataset"
* 89,000,000 "Instagram posts dataset"
* 12,490,000 "Instagram reels dataset"
* 206,000 "Instagram comments dataset"

Free access to web scraping tools and datasets for academic researchers and NGOs

The Bright Initiative offers access to Bright Data's [Web Scraper APIs](https://brightdata.com/products/web-scraper) to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application [here](https://brightinitiative.com).