Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/luminati-io/Indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
https://github.com/luminati-io/Indeed-dataset-samples
api data-analysis datasets indeed jobs web-scraping
Last synced: 1 day ago
JSON representation
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
- Host: GitHub
- URL: https://github.com/luminati-io/Indeed-dataset-samples
- Owner: luminati-io
- Created: 2024-08-25T11:06:14.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2024-08-25T11:30:39.000Z (3 months ago)
- Last Synced: 2024-08-26T16:21:29.324Z (2 months ago)
- Topics: api, data-analysis, datasets, indeed, jobs, web-scraping
- Homepage: https://brightdata.com/products/datasets/indeed
- Size: 2.7 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-web-scraping - Indeed data
- awesome-web-scraping - Indeed data
README
# Indeed-dataset-samples
A sample dataset of 1001 Indeed job listings
![Indeed dataset header](https://github.com/luminati-io/Indeed-dataset-samples/blob/main/indeed-datasets.PNG)
A Indeed dataset sample of over 1000 job listings. Dataset was extracted using the Bright Data API.
Data points included in this free dataset:
* ```jobid```: Unique job identifier
* ```job_title```: Title of the job position
* ```location```: Job location
* ```country```: Country of job location
* ```date_posted```: Date of job posting
* ```description_text```: Detailed job description
* ```description```: Additional job description
* ```url```: URL of the job listing
* ```company_name```: Name of the hiring company
* ```domain```: Company website domain
* ```job_type```: Type of employment (full-time, part-time, etc.)
* ```company_link```: Link to company profileAnd a lot more.
This is a sample subset which is derived from the "Indeed Job Listings Information (public data)"
dataset which includes more than 26,500,000 companies.Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz.
Dataset delivery type options: Email, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP.
Update frequency: Once, Daily, Weekly, Monthly, Quarterly, or Custom basis.
Data enrichment available as an addition to the data points extracted: Based on request.
[Get the full Indeed dataset](https://brightdata.com/products/datasets/indeed).
![Indeed dataset visual](https://github.com/luminati-io/Indeed-dataset-samples/blob/main/indeed-datasets-image.PNG)
What are the Indeed datasets use cases?
1. Market Analysis
Discover leading companies, professionals, and trends in employee transitions between organizations, enabling more targeted competitive intelligence and analysis. Uncover prevalent skill sets within specific areas by analyzing data based on zip codes.
2. Market Growth
Monitor hiring trends over time to predict a company's growth or decline. Analyze job demand, emerging roles, strategic shifts, market expansion, and recruitment trends across various job categories, demographics, and locations.
3. Talent Tracking
Enhance machine learning models for candidate recommendations by analyzing job descriptions and in-demand skills. Gain insights into required qualifications, track top talent, and keep databases current with the latest company data from Indeed.
Free access to web scraping tools and datasets for academic researchers and NGOs
The Bright Initiative offers access to Bright Data's [Web Scraper APIs](https://brightdata.com/products/web-scraper) and [ready-to-use datasets](https://brightdata.com/products/datasets) to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application [here](https://brightinitiative.com).