Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sarthak-0-sach/selenium_web-scraping_project

Automation and web scraping of Instagram just became easier!! This repository is a collection of notebooks related to different purposes in process of Instagram web scraping and automation with well documented code
https://github.com/sarthak-0-sach/selenium_web-scraping_project

automation-testing instagram-scraping-bot jupyter-notebook python selenium-python selenium-webdriver web-scraping-python

Last synced: about 1 month ago
JSON representation

Automation and web scraping of Instagram just became easier!! This repository is a collection of notebooks related to different purposes in process of Instagram web scraping and automation with well documented code

Awesome Lists containing this project

README

        

# WebscrapingInstagram with Selenium


This Repository contains a collection of notebooks related to Instagram web scraping and automation.




## CURRENT VERSION (THUMBNAIL EXTRACTION)
Please refer to WebscrapingInstagram_completeUpdated_DEC2022.ipynb generated and tested on December 22th, 2022.


This file showcases the updated Selenium commands, which have changed from the moment of filming my YouTube tutorial and now.


FYI, in the new version of Selenium, commands of this syntax:

```driver.find_elements_by_tag_name("input")```

were replaced with commands of that syntax:

```driver.find_elements(By.TAG_NAME, "input")```

## CURRENT VERSION (IMAGE EXTRACTION)

Please refer to ImageExtracting_Updated-DEC2022.ipynb generated and tested on December 22th, 2022.


This file includes new Selenium syntax, Fixes to scrolling issues and a more efficient keyword search.

## OLD VERSIONS

PLEASE NOTE: the notebooks below were not updated to the current Selenium syntax!!!

- WebscrapingInstagram_completeNotebook:
contained 90% automated code for extracting Instagram Thumbnails


it was working great 2 years ago, now it must be adjusted to the new Selenium syntax.

- WebscrapingInstagram_starterNotebook:
contains the starter files for the Python Simplified tutorial on Youtube:


https://youtu.be/iJGvYBH9mcY

- ImageExtracting_bot:
contains a 100% automated code for extracting Instagram Images


as well as ERROR FIXES and WIDER FUNCTIONALITY


must be adjusted to the new Selenium syntax.

- Commenting_bot:
contains a 100% automated code for commenting on all photos from a certain hashtag presented live with We Are Growth Hackers:


https://youtu.be/XnEgVZsZgco