Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/20101301-alina-hasan/google-autosuggestion-scraper
Just as the name suggests! This scraper utilizes Python Selenium, openpyxl, Pandas libraries and Firefox as its default browser.
https://github.com/20101301-alina-hasan/google-autosuggestion-scraper
firefox-browser geckodriver google-autosuggestion-scraper jupyter-notebook openxyl python selenium
Last synced: 8 days ago
JSON representation
Just as the name suggests! This scraper utilizes Python Selenium, openpyxl, Pandas libraries and Firefox as its default browser.
- Host: GitHub
- URL: https://github.com/20101301-alina-hasan/google-autosuggestion-scraper
- Owner: 20101301-Alina-Hasan
- Created: 2024-08-30T00:07:49.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2024-09-10T03:54:28.000Z (5 months ago)
- Last Synced: 2024-10-11T18:22:18.297Z (4 months ago)
- Topics: firefox-browser, geckodriver, google-autosuggestion-scraper, jupyter-notebook, openxyl, python, selenium
- Language: Jupyter Notebook
- Homepage:
- Size: 1.98 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Google Autosuggestion Scraper
## Overview
This repository contains a Python script for scraping Google Autosuggestions.**Author:** Alina Hasan
## Prerequisites
- **Python**: Ensure you have Python installed on your system.
- **Firefox Browser**: The script uses Firefox as the browser for automation.
- **Geckodriver**: The script requires `geckodriver` to interface with Firefox. Ensure `geckodriver` is located in the same directory as the script or is in your system PATH.## Libraries
- **Selenium**: For web automation and scraping.
- **openpyxl**: For handling Excel files.
- **Pandas**: For handling tabular data and performing operations on Excel sheets.## Setup
1. **Install Dependencies**: Install the required Python libraries using pip:
```bash
pip install selenium openpyxl pandas
```
2. **Download Geckodriver**:
- Download the appropriate version of geckodriver for your operating system from [Geckodriver Releases](https://sourceforge.net/projects/geckodriver.mirror/).*A [Gecko release](https://github.com/20101301-Alina-Hasan/Google-Autosuggestion-Scraper/tree/0319d4374cc121e51ffea0b689ef56ae293f4a92/Setup) is available in the 'Setup' folder of this repository.*
- Place geckodriver in the same directory as the script or add it to your system PATH.3. **Firefox Installation**:
- Ensure Firefox is installed on your system.*A [Firefox Installer](https://github.com/20101301-Alina-Hasan/Google-Autosuggestion-Scraper/tree/0319d4374cc121e51ffea0b689ef56ae293f4a92/Setup) is available in the 'Setup' folder of this repository.*
4. **Run Script**:
- **Either**, open the provided [Jupyter Notebook file](https://github.com/20101301-Alina-Hasan/Google-Autosuggestion-Scraper/blob/ae53400528ec1f394e6f16f2f8adc0c391182aba/4BeatsQ1_Solution.ipynb) in a Jupyter environment to run the script. The notebook demonstrates the functionality and provides instructions for use.
- **Or**, run the [Python file](4BeatsQ1_Solution.py) in a Python environment.
## Sample
*A [Sample Output](https://github.com/20101301-Alina-Hasan/Google-Autosuggestion-Scraper/blob/131cd0b2fee6d5ab10569d1cb81dad33a8116a42/Sample%20Output/Sample_Output.xlsx) is provided in the 'Sample Output' folder.*