Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Syn3rman/instaScrape
Scrape n latest posts from a user's instagram profile
https://github.com/Syn3rman/instaScrape
docker go instagram instagram-scraper multiprocessing node-js puppeteer python3
Last synced: 2 months ago
JSON representation
Scrape n latest posts from a user's instagram profile
- Host: GitHub
- URL: https://github.com/Syn3rman/instaScrape
- Owner: Syn3rman
- Created: 2019-08-02T08:47:10.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2022-12-11T00:57:14.000Z (about 2 years ago)
- Last Synced: 2024-08-03T23:29:21.464Z (6 months ago)
- Topics: docker, go, instagram, instagram-scraper, multiprocessing, node-js, puppeteer, python3
- Language: JavaScript
- Size: 1.88 MB
- Stars: 3
- Watchers: 1
- Forks: 1
- Open Issues: 6
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-golang-repositories - instaScrape
README
InstaScrape
[![made-with-puppeteer](https://img.shields.io/badge/made%20with-puppeteer-red)](https://pptr.dev/)
![GitHub language count](https://img.shields.io/github/languages/count/Syn3rman/instaScrape)
![GitHub top language](https://img.shields.io/github/languages/top/Syn3rman/instaScrape?color=green)------------------------------------------
> Uses headless chrome to scrape specified number of images from instagram for a particular user. Also provides blazing fast download scripts to download the images to your system by leveraging [python's multiprocessing](./download_scripts/dwn.py) and [go routines](download_scripts/main.go).
------------------------------------------
### Features
- [x] Support public profiles
- [x] Support private profiles
- [x] Run with docker
- [x] Go and python scripts to download images### To-do's
- [ ] Implement image download in rust to compare performance.
- [ ] Try using pyO3 to integrate rust and python to see if there is a significant boost in performance.### Demo
Try it out
### Set up locally using git
```
$ git clone https://github.com/Syn3rman/instaScrape.git && cd instaScrape$ npm install
$ node run.js
```### Or using docker:
```
$ docker pull syn3rman/instascrape:latest$ docker run --rm -it syn3rman/instascrape
```Navigate to [localhost](http://localhost:8001/public?handle=9gag&limit=20) and change the get request parameters as required.
### Downloading images to filesystem
```
$ cd download_scriptsUsing python:
$ python3 dwn.pyUsing go:
$ go run main.go
```### Performance
With ~500 image url's, the go script takes around 6-7s to complete while python takes around 12-15s.