Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/oxylabs/craigslist-scraper

Gather public Craigslist data like jobs, items, services, and ads with Craigslist Scraper.
https://github.com/oxylabs/craigslist-scraper

craigslist craigslist-api craigslist-javascript craigslist-scraper craigslist-search craigslist-web-scraper scrape-craigslist scraper scraping-craigslist

Last synced: 8 days ago
JSON representation

Gather public Craigslist data like jobs, items, services, and ads with Craigslist Scraper.

Awesome Lists containing this project

README

        

# Craigslist Scraper
[![Oxylabs promo code](https://user-images.githubusercontent.com/129506779/250792357-8289e25e-9c36-4dc0-a5e2-2706db797bb5.png)](https://oxylabs.go2cloud.org/aff_c?offer_id=7&aff_id=877&url_id=112)

[![](https://dcbadge.vercel.app/api/server/eWsVUJrnG5)](https://discord.gg/GbxmdGhZjq)

[Craigslist
Scraper](https://oxylabs.io/products/scraper-api/web/craigslist) is
a scraping tool that overcomes advanced anti-bot systems and helps you
gather public data from Craigslist on any scale you need. This guide
will show you how to scrape Craigslist using Oxylabs’ [Scraper
API](https://oxylabs.io/products/scraper-api).

## How it works

You can get Craigslist data by sending a request to our API with the
URLs you want to access and scrape. The API will return the HTML of any
public Craigslist page.

### Python code example

The below code sample sends a request to our service, which uses a
headless browser to execute JavaScript and sends back the HTML of a
Craigslist page:

```python
import requests
from pprint import pprint

# Structure payload.
payload = {
'source': 'universal',
'url': 'https://berlin.craigslist.org/search/ela#search=1~gallery~0~0',
'geo_location': 'Germany',
'render': 'html'
}

# Get a response.
response = requests.request(
'POST',
'https://realtime.oxylabs.io/v1/queries',
auth=('USERNAME', 'PASSWORD'), #Your credentials go here
json=payload
)

# Instead of response with job status and results URL, this will return the
# JSON response with results.
pprint(response.json())
```

Visit the
[documentation](https://developers.oxylabs.io/scraper-apis/web-scraper-api)
to find more payload parameters and other details.

### Output sample

```json
{
"results": [
{
"content": "\n\n
...
\n\n",
"created_at": "2023-09-21 14:26:52",
"updated_at": "2023-09-21 14:27:10",
"page": 1,
"url": "https://berlin.craigslist.org/search/ela#search=1~gallery~0~0",
"job_id": "7110630468831163393",
"status_code": 200
}
]
}
```

Oxylabs Craigslist Scraper API will ease your scraping processes
significantly. Use it to gather public data, such as jobs, items,
services, and ads. If you have any questions, feel free to get in touch
with us via [live chat](https://oxylabs.io/) or
[email](mailto:[email protected]).

Also, check this tutorial on [pypi](https://pypi.org/project/craiglist-scraper-api/)