Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/kimyk0120/instagram_scraper
This project is an Instagram scraper that uses Selenium to extract data directly from Instagram without relying on its API. It also provides a Streamlit-powered dashboard for visualizing the scraped data.
https://github.com/kimyk0120/instagram_scraper
instagram python selenium streamlit
Last synced: 2 days ago
JSON representation
This project is an Instagram scraper that uses Selenium to extract data directly from Instagram without relying on its API. It also provides a Streamlit-powered dashboard for visualizing the scraped data.
- Host: GitHub
- URL: https://github.com/kimyk0120/instagram_scraper
- Owner: kimyk0120
- Created: 2025-01-15T06:38:30.000Z (about 1 month ago)
- Default Branch: main
- Last Pushed: 2025-02-08T06:22:18.000Z (6 days ago)
- Last Synced: 2025-02-08T07:23:33.315Z (6 days ago)
- Topics: instagram, python, selenium, streamlit
- Language: Python
- Homepage:
- Size: 85 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
[//]: # ([![forthebadge](https://forthebadge.com/images/badges/open-source.svg)](https://forthebadge.com))
[![forthebadge](https://forthebadge.com/images/badges/built-with-love.svg)](https://forthebadge.com)
[![pythonbadge](https://forthebadge.com/images/badges/made-with-python.svg)](https://forthebadge.com)Instagram Scraper
=======================
This tool is designed to scrape data from Instagram. It utilizes the Selenium library to perform web scraping and extract information directly from Instagram without relying on the Instagram API.
Additionally, it provides a Streamlit-powered dashboard for visualizing the scraped data in a user-friendly interface.![main.png](img/main.png)
## Prerequisite
- python >= 3.10## Installation
1. Clone this repository
2. Move to the project directory
3. Install the required Python packages:
```bash
pip install -r requirements.txt
```## How to Use:
To use this script, follow these steps:
### 1. Set up Configuration (config.ini)
- You need to configure the necessary settings in the {project}/config/config.ini file.
- Set "timout_sec" appropriately to prevent infinite loading.
- Set the maximum number of video to scrape with the "video_limit_cnt" setting.
- If you need proxy settings, put them in "proxy_server"
- For instagram, you need to put account info in INSTAGRAM section### 2. Run the script with Python:
```bash
python main.py --keyword "Search Keyword" --output
# If you want to include the dashboard additionally:
python main.py --keyword "Search Keyword" --output --dashboard
```## Caution
- If you use a proxy, you may be required to provide an authentication code after logging in. This code will typically be sent via email or phone.
- There is a risk of the account being blocked.
- As a result, this script may not be suitable for full automation and is better used for single transactions.## Update
- 2025.01.15 : Initial implementation started.## Contact
For any feedback or queries, please reach out to me at [[email protected]]([email protected]).
[![Buy Me A Coffee](https://img.buymeacoffee.com/button-api/?slug=zubdata&button_colour=FFDD00&font_colour=000000&font_family=Lato&outline_colour=000000&coffee_colour=ffffff)](https://www.buymeacoffee.com/kimyk0120)