Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/luminati-io/Indeed-dataset-samples

A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
https://github.com/luminati-io/Indeed-dataset-samples

api data-analysis datasets indeed jobs web-scraping

Last synced: 1 day ago
JSON representation

A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.

Awesome Lists containing this project

README

        

# Indeed-dataset-samples

A sample dataset of 1001 Indeed job listings

![Indeed dataset header](https://github.com/luminati-io/Indeed-dataset-samples/blob/main/indeed-datasets.PNG)

A Indeed dataset sample of over 1000 job listings. Dataset was extracted using the Bright Data API.

Data points included in this free dataset:

* ```jobid```: Unique job identifier
* ```job_title```: Title of the job position
* ```location```: Job location
* ```country```: Country of job location
* ```date_posted```: Date of job posting
* ```description_text```: Detailed job description
* ```description```: Additional job description
* ```url```: URL of the job listing
* ```company_name```: Name of the hiring company
* ```domain```: Company website domain
* ```job_type```: Type of employment (full-time, part-time, etc.)
* ```company_link```: Link to company profile

And a lot more.

This is a sample subset which is derived from the "Indeed Job Listings Information (public data)"
dataset which includes more than 26,500,000 companies.

Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz.

Dataset delivery type options: Email, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP.

Update frequency: Once, Daily, Weekly, Monthly, Quarterly, or Custom basis.

Data enrichment available as an addition to the data points extracted: Based on request.

[Get the full Indeed dataset](https://brightdata.com/products/datasets/indeed).

![Indeed dataset visual](https://github.com/luminati-io/Indeed-dataset-samples/blob/main/indeed-datasets-image.PNG)

What are the Indeed datasets use cases?

1. Market Analysis

Discover leading companies, professionals, and trends in employee transitions between organizations, enabling more targeted competitive intelligence and analysis. Uncover prevalent skill sets within specific areas by analyzing data based on zip codes.

2. Market Growth

Monitor hiring trends over time to predict a company's growth or decline. Analyze job demand, emerging roles, strategic shifts, market expansion, and recruitment trends across various job categories, demographics, and locations.

3. Talent Tracking

Enhance machine learning models for candidate recommendations by analyzing job descriptions and in-demand skills. Gain insights into required qualifications, track top talent, and keep databases current with the latest company data from Indeed.

Free access to web scraping tools and datasets for academic researchers and NGOs

The Bright Initiative offers access to Bright Data's [Web Scraper APIs](https://brightdata.com/products/web-scraper) and [ready-to-use datasets](https://brightdata.com/products/datasets) to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application [here](https://brightinitiative.com).