{"id":31980112,"url":"https://github.com/instagram-automations/scrape-data-from-instagram","last_synced_at":"2025-10-14T23:27:21.201Z","repository":{"id":318739667,"uuid":"1073874662","full_name":"Instagram-Automations/scrape-data-from-instagram","owner":"Instagram-Automations","description":"scrape data from instagram and automation toolkit","archived":false,"fork":false,"pushed_at":"2025-10-10T19:13:10.000Z","size":2929,"stargazers_count":0,"open_issues_count":2,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-10-14T19:04:42.198Z","etag":null,"topics":["api","automation","bot","data","doker","instagram","nodejs","playwright","procy","scrape","selenium","toolkit"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Instagram-Automations.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-10-10T18:55:24.000Z","updated_at":"2025-10-10T19:15:34.000Z","dependencies_parsed_at":"2025-10-14T19:04:46.716Z","dependency_job_id":"2c418a0a-3be7-4f0d-8c41-2e0c0ba38eee","html_url":"https://github.com/Instagram-Automations/scrape-data-from-instagram","commit_stats":null,"previous_names":["instagram-automations/scrape-data-from-instagram"],"tags_count":null,"template":false,"template_full_name":null,"purl":"pkg:github/Instagram-Automations/scrape-data-from-instagram","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Instagram-Automations%2Fscrape-data-from-instagram","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Instagram-Automations%2Fscrape-data-from-instagram/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Instagram-Automations%2Fscrape-data-from-instagram/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Instagram-Automations%2Fscrape-data-from-instagram/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Instagram-Automations","download_url":"https://codeload.github.com/Instagram-Automations/scrape-data-from-instagram/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Instagram-Automations%2Fscrape-data-from-instagram/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279024800,"owners_count":26087837,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-14T02:00:06.444Z","response_time":60,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["api","automation","bot","data","doker","instagram","nodejs","playwright","procy","scrape","selenium","toolkit"],"created_at":"2025-10-14T23:27:16.316Z","updated_at":"2025-10-14T23:27:21.196Z","avatar_url":"https://github.com/Instagram-Automations.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# scrape data from instagram\n\nA ready-to-use boilerplate for building safe, scalable pipelines to **scrape data from Instagram** with rotating proxies, rate-limit guards, and multi-run orchestration. Perfect for agencies, researchers, and growth teams who need structured exports without the headaches.\n\n\u003cp align=\"center\"\u003e\n  \u003ca href=\"https://t.me/devpilot1\" target=\"_blank\"\u003e\n    \u003cimg src=\"https://img.shields.io/badge/Chat%20on-Telegram-2CA5E0?style=for-the-badge\u0026logo=telegram\u0026logoColor=white\" alt=\"Telegram\"\u003e\n  \u003c/a\u003e\n  \u003ca href=\"https://discord.gg/vBu9huKBvy\" target=\"_blank\"\u003e\n    \u003cimg src=\"https://img.shields.io/badge/Join-Discord-5865F2?style=for-the-badge\u0026logo=discord\u0026logoColor=white\" alt=\"Discord\"\u003e\n  \u003c/a\u003e\n  \u003ca href=\"https://wa.me/447723343390?text=Hi%20Zeeshan%2C%20I%27m%20interested%20in%20automation.\" target=\"_blank\"\u003e\n    \u003cimg src=\"https://img.shields.io/badge/Chat-WhatsApp-25D366?style=for-the-badge\u0026logo=whatsapp\u0026logoColor=white\" alt=\"WhatsApp\"\u003e\n  \u003c/a\u003e\n  \u003ca href=\"mailto:support@appilot.app\" target=\"_blank\"\u003e\n    \u003cimg src=\"https://img.shields.io/badge/Email-support@appilot.app-EA4335?style=for-the-badge\u0026logo=gmail\u0026logoColor=white\" alt=\"Gmail\"\u003e\n  \u003c/a\u003e\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003cstrong\u003eFor discussion, queries, and freelance work — reach out 👆\u003c/strong\u003e\n\u003c/p\u003e\n\n\n---\n\n##  Introduction\n\u003e A developer-friendly template to collect public Instagram data (profiles, posts, comments, followers) with modular drivers (Playwright/Selenium or headless API wrappers), resilience against blocks, and structured JSON/CSV exports. Built for teams who value compliance-aware, rate-limited scraping.\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"scrape-data-from-instagram.png\" alt=\"scrape-data-from-instagram.png\" width=\"80%\"\u003e\n\u003c/p\u003e\n\n###  Key Benefits\n1. Saves time and automates setup.  \n2. Scalable for multiple use cases.  \n3. Safer with anti-detect and proxy logic.  \n\n---\n\n## Features must be in table \n\n| Feature | Description |\n|---|---|\n| Configurable Drivers | Choose Playwright or Selenium with stealth options. |\n| Proxy \u0026 Rotation | Supports residential/mobile proxies with per-task rotation. |\n| Rate-Limit Guard | Backoff + jitter + human-like delays to reduce blocks. |\n| Data Pipelines | Export to JSON/CSV/SQLite; schema-first mapping. |\n| Session Vault | Persist cookies/sessions; auto-refresh flows. |\n\n---\n\n##  Use Cases\n- Competitive research and market analysis  \n- Creator/brand discovery and lead enrichment  \n- Social listening and hashtag trend tracking  \n- Content cataloging and performance benchmarking  \n\n---\n\n##  FAQs\n\n**Q:** How do you protect from scraping?  \n**A:** This repo includes layered protections: request pacing with randomized backoff, user-agent and viewport variance, proxy rotation per job, and session reuse to lower anomaly spikes. It also supports selective field fetching (only what you need) to minimize request volume and exposure.\n\n**Q:** Can screen scraping be detected?  \n**A:** Yes. Platforms flag patterns like high-frequency requests, identical fingerprints, and repeated navigation flows. Mitigation includes human-like timings, realistic mouse/scroll events (in browser mode), diversified fingerprints, and strict concurrency caps.\n\n**Q:** What data can you scrape from Instagram?  \n**A:** Publicly available items such as profile metadata (bio, external URL, followers/following counts), public posts (captions, media URLs, like/comment counts, timestamps), comments (text, author, time), and hashtag/top-post summaries. Private or gated data is out of scope.\n\n---\n\n## Results\n----------------------------------- \n\u003e 10x faster posting schedules  \n\u003e 80% engagement increase on group campaigns  \n\u003e Fully automated lead response system  \n\n##  Performance Metrics\n-----------------------------------\nAverage Performance Benchmarks:  \n- **Speed:** 2x faster than manual posting  \n- **Stability:** 99.2% uptime  \n- **Ban Rate:** \u003c0.5% with safe automation mode  \n- **Throughput:** 100+ posts/hour per session\n\n---\n\n##Do you have a customize project for us ?\nContact Us\n\n\u003cdiv align=\"center\"\u003e\n  \u003ca href=\"https://mail.google.com/mail/u/?authuser=ahmadzee26@gmail.com\"\u003e\n    \u003cimg alt=\"Gmail\" width=\"30px\" src=\"https://edent.github.io/SuperTinyIcons/images/svg/gmail.svg\" /\u003e\n    \u003ccode\u003esupport@appilot.app\u003c/code\u003e\n  \u003c/a\u003e\n  \u003cspan\u003e ┃ \u003c/span\u003e\n  \u003ca href=\"https://t.me/devpilot1\"\u003e\n    \u003cimg alt=\"Telegram\" width=\"30px\" src=\"https://edent.github.io/SuperTinyIcons/images/svg/telegram.svg\" /\u003e\n    \u003ccode\u003epilot\u003c/code\u003e\n  \u003c/a\u003e\n  \u003cspan\u003e ┃ \u003c/span\u003e\n  \u003ca href=\"https://discord.com\"\u003e\n    \u003cimg alt=\"Discord\" width=\"30px\" src=\"https://github.com/Zeeshanahmad4/RealEstateMate-WhatsApp-Group-Management-Bot/blob/main/discord-icon-svgrepo-com.svg\" /\u003e\n    \u003ccode\u003ezee#2655\u003c/code\u003e\n  \u003c/a\u003e\n  \u003cspan\u003e ┃ \u003c/span\u003e\n  \u003ca href=\"https://wa.me/447723343390?text=Hi%20Zeeshan%2C%20I%27m%20interested%20in%20automation.\" target=\"_blank\"\u003e\n    \u003cimg alt=\"WhatsApp\" width=\"30px\" src=\"https://cdn.jsdelivr.net/npm/simple-icons@v11/icons/whatsapp.svg\" /\u003e\n    \u003ccode\u003ewhatsapp\u003c/code\u003e\n  \u003c/a\u003e\n  \u003cbr /\u003e\n\u003c/div\u003e\n\n---\n\n##  Installation\n\n###  Pre-requisites\n- Node.js or Python  \n- Git  \n- Docker (optional)  \n\n###  Steps\n```bash\n# Clone the repo\ngit clone https://github.com/yourusername/scrape-data-from-instagram.git\ncd scrape-data-from-instagram\n\n# Install dependencies\nnpm install\n# or\npip install -r requirements.txt\n\n# Setup environment\ncp .env.example .env\n\n# Run\nnpm start\n# or\npython main.py\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Finstagram-automations%2Fscrape-data-from-instagram","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Finstagram-automations%2Fscrape-data-from-instagram","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Finstagram-automations%2Fscrape-data-from-instagram/lists"}