Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jneidel/job-titles
Normalized dataset of 70k job titles
https://github.com/jneidel/job-titles
data dataset jobs json opendata
Last synced: 2 months ago
JSON representation
Normalized dataset of 70k job titles
- Host: GitHub
- URL: https://github.com/jneidel/job-titles
- Owner: jneidel
- Created: 2019-03-09T03:01:39.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2024-03-01T15:40:19.000Z (12 months ago)
- Last Synced: 2024-11-26T10:46:52.018Z (3 months ago)
- Topics: data, dataset, jobs, json, opendata
- Language: Shell
- Homepage:
- Size: 1.06 MB
- Stars: 64
- Watchers: 5
- Forks: 16
- Open Issues: 1
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
# job-titles
> Normalized dataset of 70k job titles
[![](https://img.shields.io/badge/job_titles-73380-brightgreen.svg?style=flat-square)](job-titles.txt)
## Data Normalizations
The data is normalized in the following ways:
- lowercase
- `-` replaced with a \
- `,` removed## Caveats
- Duplicates such as `a and p mechanic` and `a&p mechanic`
- Non-English titles such as `ab initio etl developer`## See also
- [animal names](https://github.com/jneidel/animal-names)
- [nationalities](https://github.com/jneidel/nationalities)## Contribute
Feel free to open a pull request fixing above listed caveats or any other enhancements.
Only edit `job-titles.txt`. After doing so run `./format.sh`.
## Attribution
This dataset is a collection of the following sources:
- [johnpcarty/Thesaurus-of-Job-Titles](https://github.com/johnpcarty/Thesaurus-of-Job-Titles/blob/master/synonym_job_titles_for_search.txt) (GPLv3)
- [onurdegerli/job-titles](https://github.com/onurdegerli/job-titles/blob/master/job_titles.sql)
- [fluquid/find_job_titles](https://github.com/fluquid/find_job_titles/blob/master/src/find_job_titles/data/titles_combined.txt.gz) (MIT)