Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/kimyk0120/instagram_scraper

This project is an Instagram scraper that uses Selenium to extract data directly from Instagram without relying on its API. It also provides a Streamlit-powered dashboard for visualizing the scraped data.
https://github.com/kimyk0120/instagram_scraper

instagram python selenium streamlit

Last synced: 2 days ago
JSON representation

Host: GitHub
URL: https://github.com/kimyk0120/instagram_scraper
Owner: kimyk0120
Created: 2025-01-15T06:38:30.000Z (about 1 month ago)
Default Branch: main
Last Pushed: 2025-02-08T06:22:18.000Z (6 days ago)
Last Synced: 2025-02-08T07:23:33.315Z (6 days ago)
Topics: instagram, python, selenium, streamlit
Language: Python
Homepage:
Size: 85 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        
[//]: # ([![forthebadge](https://forthebadge.com/images/badges/open-source.svg)](https://forthebadge.com))

[![forthebadge](https://forthebadge.com/images/badges/built-with-love.svg)](https://forthebadge.com)

[![pythonbadge](https://forthebadge.com/images/badges/made-with-python.svg)](https://forthebadge.com)

Instagram Scraper 

=======================

This tool is designed to scrape data from Instagram. It utilizes the Selenium library to perform web scraping and extract information directly from Instagram without relying on the Instagram API.

Additionally, it provides a Streamlit-powered dashboard for visualizing the scraped data in a user-friendly interface.

![main.png](img/main.png)

## Prerequisite

- python >= 3.10 

## Installation

1. Clone this repository

2. Move to the project directory

   

3. Install the required Python packages:

    ```bash

    pip install -r requirements.txt

   ```

## How to Use:

To use this script, follow these steps:

### 1. Set up Configuration (config.ini)

- You need to configure the necessary settings in the {project}/config/config.ini file.

- Set "timout_sec" appropriately to prevent infinite loading.

- Set the maximum number of video to scrape with the "video_limit_cnt" setting.

- If you need proxy settings, put them in "proxy_server"

- For instagram, you need to put account info in INSTAGRAM section

### 2. Run the script with Python:

  ```bash 

    python main.py --keyword "Search Keyword"  --output 

    # If you want to include the dashboard additionally:      

    python main.py --keyword "Search Keyword"  --output  --dashboard

  ```

## Caution

- If you use a proxy, you may be required to provide an authentication code after logging in. This code will typically be sent via email or phone.

- There is a risk of the account being blocked.

- As a result, this script may not be suitable for full automation and is better used for single transactions.

## Update

- 2025.01.15 : Initial implementation started.

## Contact

For any feedback or queries, please reach out to me at [[email protected]]([email protected]).

[![Buy Me A Coffee](https://img.buymeacoffee.com/button-api/?slug=zubdata&button_colour=FFDD00&font_colour=000000&font_family=Lato&outline_colour=000000&coffee_colour=ffffff)](https://www.buymeacoffee.com/kimyk0120)