Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/omkarcloud/baidu-scraper
Baidu Scraper helps you scrape search results from Baidu.
https://github.com/omkarcloud/baidu-scraper
baidu baidu-api baidu-python-scraper baidu-scraper baidu-scraper-api baidu-scraper-github baidu-scraper-python baidu-scraping baidu-scrapper baidu-search-results baidu-search-results-python baidu-serp baidu-serp-api baidu-serpapi scrape-baidu scrapebaidu scraping-web serp-api serp-scraper serpapi
Last synced: 3 days ago
JSON representation
Baidu Scraper helps you scrape search results from Baidu.
- Host: GitHub
- URL: https://github.com/omkarcloud/baidu-scraper
- Owner: omkarcloud
- License: mit
- Created: 2023-12-30T06:44:37.000Z (about 1 year ago)
- Default Branch: master
- Last Pushed: 2024-01-24T14:32:49.000Z (12 months ago)
- Last Synced: 2024-12-30T03:44:29.365Z (6 days ago)
- Language: Python
- Homepage: https://www.omkar.cloud/
- Size: 2.58 MB
- Stars: 2
- Watchers: 0
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
![Baidu Scraper Featured Image](https://raw.githubusercontent.com/omkarcloud/baidu-scraper/master/images/baidu-scraper-featured-image.png)
✨ Baidu Scraper 🚀
💦 Baidu Scraper helps you scrape search results from Baidu. 💦
(Programming Language - Python 3)
---## Disclaimer for Baidu Scraper Project
> By using Baidu Scraper, you agree to comply with all applicable local and international laws related to data scraping, copyright, and privacy. The developers of Baidu Scraper will not be held liable for any misuse of this software. It is the user's sole responsibility to ensure adherence to all relevant laws regarding data scraping, copyright, and privacy, and to use Baidu Scraper in an ethical and legal manner, in line with both local and international regulations.
We take concerns related to the Baidu Scraper Project very seriously. If you have any inquiries or issues, please contact Chetan Jain at [[email protected]](mailto:[email protected]). We will take prompt and necessary action in response to your emails.
## 👉 Explore Our Other Awesome Products
- ✅ [Botasaurus](https://github.com/omkarcloud/botasaurus): The All-in-One Web Scraping Framework with Anti-Detection, Parallelization, Asynchronous, and Caching Superpowers.
---
Baidu Scraper helps you scrape search results from Baidu.
## 🚀 Getting Started
1️⃣ **Clone the Magic 🧙♀:**
```shell
git clone https://github.com/omkarcloud/baidu-scraper
cd baidu-scraper
```
2️⃣ **Install Dependencies 📦:**
```shell
python -m pip install -r requirements.txt
```
3️⃣ **Let the Scraping Begin 😎**:
```shell
python main.py
```Find your data in the `output` directory.
![Baidu Scraper CSV Result](https://raw.githubusercontent.com/omkarcloud/baidu-scraper/master/images/baidu-scraper-csv-result.png)
*Note: If you don't have Python installed. Follow this Simple FAQ [here](https://github.com/omkarcloud/baidu-scraper/blob/master/advanced.md#-i-dont-have-python-installed-how-can-i-run-the-scraper) and you will have your Baidu data in next 5 Minutes*
## 🤔 FAQs
### ❓ How to Scrape Baidu?1. Open the `main.py` file.
2. Update the `queries` list with the queries you are interested in. For example:```python
queries = [
"Mango",
"Watermelon",
]Baidu.search(queries, max=10)
```3. Run it.
```bash
python main.py
```Then find your data in the `output` directory.
### ❓ How to Scrape More Baidu Search Results Using Your Baidu API?
To scrape additional data, follow these steps to use our Baidu SERP API. You can make 50 Search requests for free:
1. Sign up on RapidAPI by visiting [this link](https://rapidapi.com/auth/sign-up).
![Sign Up on RapidAPI](https://raw.githubusercontent.com/omkarcloud/assets/master/images/sign-up.png)
2. Then, subscribe to our Free Plan by visiting [this link](https://rapidapi.com/Chetan11dev/api/baidu-scraper/pricing).
![Subscribe to Free Plan](https://raw.githubusercontent.com/omkarcloud/assets/master/images/free-subscription.png)
3. Now, copy the API key.
![Copy the API Key](https://raw.githubusercontent.com/omkarcloud/assets/master/images/api-key.png)
4. Use it in the scraper as follows:
```python
Baidu.search("Orange", max=10, key="YOUR_API_KEY")
```5. Run the script, and you'll find your data in the `output` folder.
```bash
python main.py
```The first 50 requests are free. After that, you can upgrade to the Pro Plan, which will get you 1000 search requests for just $9.
### ❓ How did you build it?
We used Botasaurus, It's an All-in-One Web Scraping Framework with Anti-Detection, Parallelization, Asynchronous, and Caching Superpowers.
Botasaurus helped us cut down the development time by 50% and helped us focus only on the core extraction logic of the scraper.
If you are a Web Scraper, you should learn about Botasaurus [here](https://github.com/omkarcloud/botasaurus), because Botasaurus will save you countless hours in your life as a Web Scraper.
### ❓ Need More Help or Have Additional Questions?
For further help, contact us on WhatsApp. We'll be happy to help you out.
[![Contact Us on WhatsApp about Baidu Scraper](https://raw.githubusercontent.com/omkarcloud/assets/master/images/whatsapp-us.png)](https://api.whatsapp.com/send?phone=918295042963&text=Hi,%20I%20would%20like%20to%20learn%20more%20about%20your%20products.)
## Love It? [Star It! ⭐](https://github.com/omkarcloud/baidu-scraper/stargazers)
## Made with ❤️ using [Botasaurus Web Scraping Framework](https://github.com/omkarcloud/botasaurus)