https://github.com/nothingnothings/mlscraper
Mercado Livre Website Scraper Built with Python
https://github.com/nothingnothings/mlscraper
docker mercado-livre python scraper scrapy sql
Last synced: about 2 months ago
JSON representation
Mercado Livre Website Scraper Built with Python
- Host: GitHub
- URL: https://github.com/nothingnothings/mlscraper
- Owner: nothingnothings
- License: mit
- Created: 2024-08-26T20:50:08.000Z (almost 2 years ago)
- Default Branch: master
- Last Pushed: 2024-10-03T15:18:58.000Z (over 1 year ago)
- Last Synced: 2025-02-14T17:30:03.313Z (over 1 year ago)
- Topics: docker, mercado-livre, python, scraper, scrapy, sql
- Language: Python
- Homepage:
- Size: 24.4 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
Mlscraper - A Web Scraper Built with Python
This script is an example of a Web Scraper Built with
Python.
## Introduction
[](https://github.com/nothingnothings/mlscraper)
[](https://github.com/nothingnothings/mlscraper/blob/master/README.pt-br.md)
Script built with Python that extracts and stores data about the Mercado Livre website's data in a containerized SQL database.
The script uses scrapy for the website's scraping.
For more information about its usage, read the instructions below.
## Technologies Used
- **Python**: The primary programming language used.
- **Scrapy**: Library responsible for the scraping
- **PyMySQL**: Library for connecting to and interacting with MySQL databases.
- **MySQL**: Database management system for storing product data.
## Installation
1. Run `git clone` to clone the project into your local Git repository.
2. **Install Requirements**: Make sure you have Python installed and run:
```bash
pip install scrapy pymysql
```
3. The docker-compose.yml file contains a ready-to-use SQL database. To initialize it, with Docker installed and running, type the following commands:
```
cd docker
docker-compose up -d
```
## Usage
To run the scraper, execute the following commands:
```
cd mlscraper
scrapy crawl mlscraper -a s=
```
## Example:
```
scrapy crawl mlscraper -a s=smartphone
```