Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/passivebot/facebook-marketplace-scraper
This repository contains a script to scrape Facebook Marketplace data using Playwright, BeautifulSoup and Streamlit.
https://github.com/passivebot/facebook-marketplace-scraper
database facebook facebook-marketing-automation facebook-marketplace playwright playwright-python python sqlite3 web-automation web-scraper web-scraping
Last synced: about 1 month ago
JSON representation
This repository contains a script to scrape Facebook Marketplace data using Playwright, BeautifulSoup and Streamlit.
- Host: GitHub
- URL: https://github.com/passivebot/facebook-marketplace-scraper
- Owner: passivebot
- Created: 2023-03-03T06:43:55.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-01-16T20:37:02.000Z (12 months ago)
- Last Synced: 2024-01-17T04:46:38.938Z (12 months ago)
- Topics: database, facebook, facebook-marketing-automation, facebook-marketplace, playwright, playwright-python, python, sqlite3, web-automation, web-scraper, web-scraping
- Language: Python
- Homepage:
- Size: 664 KB
- Stars: 99
- Watchers: 5
- Forks: 27
- Open Issues: 5
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# facebook-marketplace-scraper
An open-source Python program to scrape Facebook Marketplace using Playwright, BeautifulSoup, and FastAPI with a Streamlit GUI.
```diff
You use the software provided at your own risk. I cannot be held responsible for any potential consequences, including potential bans from Meta.
```
### Overview
This open-source program uses Python to scrape data from Facebook Marketplace. The program uses Playwright to navigate the Facebook Marketplace website and BeautifulSoup to parse the HTML and extract relevant data. It then displays the results in a Streamlit GUI.### Customization
This program can be customized to your personal/organizational needs. For more information, please get in touch with me at [email protected]
- Streamlit
- Playwright
- BeautifulSoup
### Language:
- [Python](https://www.python.org/)
### Flow diagrams:### Requirements:
- Python 3.x
- Playwright
- Streamlit
- BeautifulSoup
### Modules:
- Playwright for web crawling
- BeautifulSoup for HTML parsing
- FastAPI for API creation
- JSON for data formatting
- Uvicorn for running the server
### API:
- Root: Displays a welcome message
- Data scraping: Parameters include city, query, and max price
- IP information retrieval
### Implementation
- Browser automation and data scraping using Playwright
- HTML content parsing with BeautifulSoup
- Data returned in JSON format
- Application server run using Uvicorn### Features:
- List of supported cities for scraping.
- User inputs for city, search query, and maximum price.
- Submission button to start scraping.
- Display of scraping results including number of results, images, prices, locations, and item URLs.