{"id":18550070,"url":"https://github.com/luminati-io/Indeed-dataset-samples","last_synced_at":"2025-04-09T21:32:50.450Z","repository":{"id":254699909,"uuid":"847275201","full_name":"luminati-io/Indeed-dataset-samples","owner":"luminati-io","description":"A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.","archived":false,"fork":false,"pushed_at":"2024-08-25T11:30:39.000Z","size":2826,"stargazers_count":0,"open_issues_count":0,"forks_count":1,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-03-17T03:25:54.496Z","etag":null,"topics":["api","data-analysis","datasets","indeed","jobs","web-scraping"],"latest_commit_sha":null,"homepage":"https://brightdata.com/products/datasets/indeed","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/luminati-io.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-08-25T11:06:14.000Z","updated_at":"2024-08-25T11:32:52.000Z","dependencies_parsed_at":"2024-08-25T13:44:10.399Z","dependency_job_id":"90f67298-55c2-4083-b20c-ed11d8dc6e2f","html_url":"https://github.com/luminati-io/Indeed-dataset-samples","commit_stats":null,"previous_names":["luminati-io/indeed-dataset-samples"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luminati-io%2FIndeed-dataset-samples","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luminati-io%2FIndeed-dataset-samples/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luminati-io%2FIndeed-dataset-samples/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luminati-io%2FIndeed-dataset-samples/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/luminati-io","download_url":"https://codeload.github.com/luminati-io/Indeed-dataset-samples/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248114905,"owners_count":21050137,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["api","data-analysis","datasets","indeed","jobs","web-scraping"],"created_at":"2024-11-06T21:03:23.271Z","updated_at":"2025-04-09T21:32:48.534Z","avatar_url":"https://github.com/luminati-io.png","language":null,"funding_links":[],"categories":["Free Dataset Samples"],"sub_categories":[],"readme":"# Indeed-dataset-samples\n\n\u003ch2\u003eA sample dataset of 1001 Indeed job listings\u003c/h2\u003e\n\n![Indeed dataset header](https://github.com/luminati-io/Indeed-dataset-samples/blob/main/indeed-datasets.PNG)\n\nA Indeed dataset sample of over 1000 job listings. Dataset was extracted using the \u003cb\u003eBright Data API\u003c/b\u003e.\n\n\u003ch2\u003eData points included in this free dataset:\u003c/h2\u003e\n\n* ```jobid```: Unique job identifier\n* ```job_title```: Title of the job position\n* ```location```: Job location\n* ```country```: Country of job location\n* ```date_posted```: Date of job posting\n* ```description_text```: Detailed job description\n* ```description```: Additional job description\n* ```url```: URL of the job listing\n* ```company_name```: Name of the hiring company\n* ```domain```: Company website domain\n* ```job_type```: Type of employment (full-time, part-time, etc.)\n* ```company_link```: Link to company profile\n\nAnd a lot more.\n\nThis is a sample subset which is derived from the \"Indeed Job Listings Information (public data)\"\ndataset which includes more than \u003cb\u003e26,500,000 companies\u003c/b\u003e.\n\nAvailable dataset file formats: \u003cb\u003eJSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz\u003c/b\u003e.\n\nDataset delivery type options: \u003cb\u003eEmail, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP\u003c/b\u003e.\n\nUpdate frequency: \u003cb\u003eOnce, Daily, Weekly, Monthly, Quarterly, or Custom basis\u003c/b\u003e.\n\nData enrichment available as an addition to the data points extracted: \u003cb\u003eBased on request.\u003c/b\u003e\n\n\u003cb\u003e[Get the full Indeed dataset](https://brightdata.com/products/datasets/indeed)\u003c/b\u003e.\n\n\n![Indeed dataset visual](https://github.com/luminati-io/Indeed-dataset-samples/blob/main/indeed-datasets-image.PNG)\n\n\u003ch2\u003eWhat are the Indeed datasets use cases?\u003c/h2\u003e\n\n\u003ch3\u003e1. Market Analysis\u003c/h3\u003e\n\nDiscover leading companies, professionals, and trends in employee transitions between organizations, enabling more targeted competitive intelligence and analysis. Uncover prevalent skill sets within specific areas by analyzing data based on zip codes.\n\n\u003ch3\u003e2. Market Growth\u003c/h3\u003e\n\nMonitor hiring trends over time to predict a company's growth or decline. Analyze job demand, emerging roles, strategic shifts, market expansion, and recruitment trends across various job categories, demographics, and locations.\n\n\u003ch3\u003e3. Talent Tracking\u003c/h3\u003e\n\nEnhance machine learning models for candidate recommendations by analyzing job descriptions and in-demand skills. Gain insights into required qualifications, track top talent, and keep databases current with the latest company data from Indeed.\n\n\u003ch2\u003eFree access to web scraping tools and datasets for academic researchers and NGOs\u003c/h2\u003e\n\nThe Bright Initiative offers access to Bright Data's \u003cb\u003e[Web Scraper APIs](https://brightdata.com/products/web-scraper)\u003c/b\u003e and \u003cb\u003e[ready-to-use datasets](https://brightdata.com/products/datasets)\u003c/b\u003e to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application \u003cb\u003e[here](https://brightinitiative.com)\u003c/b\u003e.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fluminati-io%2FIndeed-dataset-samples","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fluminati-io%2FIndeed-dataset-samples","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fluminati-io%2FIndeed-dataset-samples/lists"}