Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ersinaksar/dockerized-firefox-for-web-scraping
A repository containing a Dockerfile and sample Python code to run Firefox in a Docker container for web scraping.
https://github.com/ersinaksar/dockerized-firefox-for-web-scraping
docker-container docker-image dockerfile driver firefox python python-script scraper scraping-websites selenium selenium-webdriver web-driver webdriver webscraping
Last synced: about 2 months ago
JSON representation
A repository containing a Dockerfile and sample Python code to run Firefox in a Docker container for web scraping.
- Host: GitHub
- URL: https://github.com/ersinaksar/dockerized-firefox-for-web-scraping
- Owner: ersinaksar
- Created: 2023-03-12T21:35:21.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-09-08T13:35:12.000Z (5 months ago)
- Last Synced: 2024-09-08T15:07:30.713Z (5 months ago)
- Topics: docker-container, docker-image, dockerfile, driver, firefox, python, python-script, scraper, scraping-websites, selenium, selenium-webdriver, web-driver, webdriver, webscraping
- Language: Python
- Homepage:
- Size: 9.77 KB
- Stars: 5
- Watchers: 1
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Dockerized Firefox for Web Scraping
*Read this in other languages: [English](README.md), [Turkish](README.tr.md).*
A repository containing a Dockerfile and sample Python code to run Firefox in a Docker container for web scraping.
This is a Dockerized Python script that uses Selenium and Firefox to scrape the page source of a given URL. The script is run in headless mode using Xvfb.
π§ Features
Here're some of the project's best features:
* Scrapes the page source of a given URL using Firefox and Selenium.
* Runs in headless mode using Xvfb.π οΈ Installation Steps:
1. Install Docker on your system.
2. Clone the repository.
3. Build the Docker image using the command
```
sudo docker build -t my-firefox .
```4. Run the Docker container using the command
```
sudo docker run my-firefox
```5. The page source of the URL will be printed to the console.
π° Contribution Guidelines:
Contributions to the project are welcome. If you find a bug or have a suggestion for a new feature please open an issue or a pull request.
π» Built with
Technologies used in the project:
* Python
* Docker
* Selenium
* Firefox
* XvfbπLike my work?
If you have any questions or need help with the project please feel free to open an issue on GitHub.
- :beer: support me **[Patreon]**
- :coffee: support me **[buymeacoffee]**[buymeacoffee]: https://www.buymeacoffee.com/in/ersinaksar "buymeacoffee"
[Patreon]: https://www.buymeacoffee.com/ersinaksar "Patreon"