Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/20101301-alina-hasan/google-autosuggestion-scraper

Just as the name suggests! This scraper utilizes Python Selenium, openpyxl, Pandas libraries and Firefox as its default browser.
https://github.com/20101301-alina-hasan/google-autosuggestion-scraper

firefox-browser geckodriver google-autosuggestion-scraper jupyter-notebook openxyl python selenium

Last synced: 8 days ago
JSON representation

Just as the name suggests! This scraper utilizes Python Selenium, openpyxl, Pandas libraries and Firefox as its default browser.

Awesome Lists containing this project

README

        

# Google Autosuggestion Scraper

## Overview
This repository contains a Python script for scraping Google Autosuggestions.

**Author:** Alina Hasan

## Prerequisites

- **Python**: Ensure you have Python installed on your system.
- **Firefox Browser**: The script uses Firefox as the browser for automation.
- **Geckodriver**: The script requires `geckodriver` to interface with Firefox. Ensure `geckodriver` is located in the same directory as the script or is in your system PATH.

## Libraries

- **Selenium**: For web automation and scraping.
- **openpyxl**: For handling Excel files.
- **Pandas**: For handling tabular data and performing operations on Excel sheets.

## Setup

1. **Install Dependencies**: Install the required Python libraries using pip:
```bash
pip install selenium openpyxl pandas
```

2. **Download Geckodriver**:
- Download the appropriate version of geckodriver for your operating system from [Geckodriver Releases](https://sourceforge.net/projects/geckodriver.mirror/).

*A [Gecko release](https://github.com/20101301-Alina-Hasan/Google-Autosuggestion-Scraper/tree/0319d4374cc121e51ffea0b689ef56ae293f4a92/Setup) is available in the 'Setup' folder of this repository.*
- Place geckodriver in the same directory as the script or add it to your system PATH.

3. **Firefox Installation**:
- Ensure Firefox is installed on your system.

*A [Firefox Installer](https://github.com/20101301-Alina-Hasan/Google-Autosuggestion-Scraper/tree/0319d4374cc121e51ffea0b689ef56ae293f4a92/Setup) is available in the 'Setup' folder of this repository.*

4. **Run Script**:

- **Either**, open the provided [Jupyter Notebook file](https://github.com/20101301-Alina-Hasan/Google-Autosuggestion-Scraper/blob/ae53400528ec1f394e6f16f2f8adc0c391182aba/4BeatsQ1_Solution.ipynb) in a Jupyter environment to run the script. The notebook demonstrates the functionality and provides instructions for use.

- **Or**, run the [Python file](4BeatsQ1_Solution.py) in a Python environment.

## Sample

*A [Sample Output](https://github.com/20101301-Alina-Hasan/Google-Autosuggestion-Scraper/blob/131cd0b2fee6d5ab10569d1cb81dad33a8116a42/Sample%20Output/Sample_Output.xlsx) is provided in the 'Sample Output' folder.*