Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/luminati-io/instagram-dataset-samples
Sample datasets of over 400 Instagram coding influencers
https://github.com/luminati-io/instagram-dataset-samples
dataset influencers instagram instagram-dataset marketing
Last synced: 9 days ago
JSON representation
Sample datasets of over 400 Instagram coding influencers
- Host: GitHub
- URL: https://github.com/luminati-io/instagram-dataset-samples
- Owner: luminati-io
- Created: 2022-03-27T13:34:03.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2024-12-19T09:44:56.000Z (about 1 month ago)
- Last Synced: 2024-12-19T10:39:01.133Z (about 1 month ago)
- Topics: dataset, influencers, instagram, instagram-dataset, marketing
- Homepage:
- Size: 32 MB
- Stars: 11
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# instagram-dataset-samples
A sample dataset of 2180 Instagram coding influencers
![instagram dataset header](https://github.com/luminati-io/Instagram-dataset-samples/blob/main/instagram-datasets.PNG)
A github dataset sample of over 2000 leading Instagram [Github](https://www.instagram.com/explore/tags/github/) coding influencers. Dataset was extracted using the Bright Data Collector.
Data points included in this free dataset:
* followers count
* profile type
* account type
* engagement score
* categories
* location
* external/bio links
* hashtags used
* brand affiliation
* bio
* highlights
* postsThis is a sample subset which is derived from the "All Instagram account, business & nonbusiness (public data)"
dataset which includes 614,000,000 Instagram profiles.In this example, the large dataset was filtered down into a smaller subset using smart filter queries available on the Bright Data control panel.
Queries used for filtering this subset:
* $or: [{"post_hashtags":"github"},{"bio_hashtags":"github"}]
* followers: {"$gt":100}Additional filter query values include: Posts count, cuntry, verified account, multiple hashtag combinations and more.
Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet..
Dataset delivery type options: API download, Amazon S3, Google cloud, Microsoft Azure, SFTP.
Data enrichment available as an addition to the data points extracted: Avg. post engagement rate, brand affiliation and more.
Get the full [Instagram dataset](https://brightdata.com/products/datasets/instagram).
Additional Instagram datasets available:
* 635,000,000 "Instagram profiles dataset"
* 89,000,000 "Instagram posts dataset"
* 12,490,000 "Instagram reels dataset"
* 206,000 "Instagram comments dataset"Free access to web scraping tools and datasets for academic researchers and NGOs
The Bright Initiative offers access to Bright Data's [Web Scraper APIs](https://brightdata.com/products/web-scraper) to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application [here](https://brightinitiative.com).