Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/hehpollon/Instagram-Crawler

Non API. Crawling post (photo, likes, comments, date ...) by username, hashtags
https://github.com/hehpollon/Instagram-Crawler

Last synced: about 2 months ago
JSON representation

Non API. Crawling post (photo, likes, comments, date ...) by username, hashtags

Awesome Lists containing this project

README

        

# Instagram-Crawler
Non API. Crawling post (photo, likes, comments, date ...) by username, hashtags

## Installation
1. Make sure you have Chrome browser installed.
2. Download [chromedriver](https://sites.google.com/a/chromium.org/chromedriver/) and put it into driver folder: `./driver/chromedriver`
3. Install requirements `pip install -r requirements.txt`

## Examples:
> Results: under the ./data folder

Download the first 10 photos and information from username "instagram"
```
$ python3 crawl.py -q 'instagram' -n 10
```
Download the first 7 photos and information(all comments) from hashtags #hello, #hi
```
$ python3 crawl.py -q '#hello, #hi' --a -n 7
```
###### you can enter multiple username or hashtags by separating them with commas
### Example of a files data
```
likes:
5,326

comments:
923

caption:
Art of @kendricklamar by @illestration
Bold, bright and colorful.

commentMessages:
tttt: Amazing 😉
this_is_t.rs: my name says it all

dateTime:
2018-05-30T19:42:03.000Z
```
###### photo of post will download in ./data folder

## Usage
```
Usage:
crawl.py [-q QUERY] [-n NUMBER] [--a] [-h HELP]

Options:
-q QUERY username, add '#' to search for hashtags, e.g. 'username' or '#hashtag'
For multiple query seperate with comma, e.g. 'username1, username2, #hashtag'

-n NUM number of returned posts [default: 10000]

--a collect all comments

-h HELP show this help message and exit
```

## Contributing

This is an open source project so feel free to contribute by

- Opening an [issue](https://github.com/hehpollon/Instagram-Crawler/issues/new)
- Sending me feedback via [email](mailto://[email protected])