{"id":25862731,"url":"https://github.com/romaklimenko/imdb-mongo","last_synced_at":"2026-06-09T03:04:27.835Z","repository":{"id":152408919,"uuid":"433595615","full_name":"romaklimenko/imdb-mongo","owner":"romaklimenko","description":null,"archived":false,"fork":false,"pushed_at":"2021-11-30T21:45:44.000Z","size":4,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2024-09-15T05:01:09.777Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/romaklimenko.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-11-30T21:44:43.000Z","updated_at":"2024-09-10T19:31:18.000Z","dependencies_parsed_at":null,"dependency_job_id":"9bc2e102-98fa-4a20-bcf8-c84afec87641","html_url":"https://github.com/romaklimenko/imdb-mongo","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/romaklimenko%2Fimdb-mongo","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/romaklimenko%2Fimdb-mongo/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/romaklimenko%2Fimdb-mongo/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/romaklimenko%2Fimdb-mongo/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/romaklimenko","download_url":"https://codeload.github.com/romaklimenko/imdb-mongo/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":241439768,"owners_count":19963100,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-03-01T23:56:24.440Z","updated_at":"2026-06-09T03:04:22.807Z","avatar_url":"https://github.com/romaklimenko.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# IMDB\n\nhttps://www.imdb.com/interfaces/\n\n## IMDb Datasets\nSubsets of IMDb data are available for access to customers for personal and non-commercial use. You can hold local copies of this data, and it is subject to our terms and conditions. Please refer to the Non-Commercial Licensing and copyright/license and verify compliance.\n\n## Data Location\n\nThe dataset files can be accessed and downloaded from https://datasets.imdbws.com/. The data is refreshed daily.\n\n## IMDb Dataset Details\n\nEach dataset is contained in a gzipped, tab-separated-values (TSV) formatted file in the UTF-8 character set. The first line in each file contains headers that describe what is in each column. A ‘\\N’ is used to denote that a particular field is missing or null for that title/name. The available datasets are as follows:\n\n**title.akas.tsv.gz** - Contains the following information for titles:\n\n* titleId (string) - a tconst, an alphanumeric unique identifier of the title\n* ordering (integer) – a number to uniquely identify rows for a given titleId\n* title (string) – the localized title\n* region (string) - the region for this version of the title\n* language (string) - the language of the title\n* types (array) - Enumerated set of attributes for this alternative title. One or more of the following: \"alternative\", \"dvd\", \"festival\", \"tv\", \"video\", \"working\", \"original\", \"imdbDisplay\". New values may be added in the future without warning\n* attributes (array) - Additional terms to describe this alternative title, not enumerated\n* isOriginalTitle (boolean) – 0: not original title; 1: original title\n\n**title.basics.tsv.gz** - Contains the following information for titles:\n\n* tconst (string) - alphanumeric unique identifier of the title\n* titleType (string) – the type/format of the title (e.g. movie, short, tvseries, tvepisode, video, etc)\n* primaryTitle (string) – the more popular title / the title used by the filmmakers on promotional materials at the point of release\n* originalTitle (string) - original title, in the original language\n* isAdult (boolean) - 0: non-adult title; 1: adult title\n* startYear (YYYY) – represents the release year of a title. In the case of TV Series, it is the series start year\n* endYear (YYYY) – TV Series end year. ‘\\N’ for all other title types\n* runtimeMinutes – primary runtime of the title, in minutes\n* genres (string array) – includes up to three genres associated with the title\n\n**title.crew.tsv.gz** – Contains the director and writer information for all the titles in IMDb. Fields include:\n\n* tconst (string) - alphanumeric unique identifier of the title\n* directors (array of nconsts) - director(s) of the given title\n* writers (array of nconsts) – writer(s) of the given title\n\n**title.episode.tsv.gz** – Contains the tv episode information. Fields include:\n\n* tconst (string) - alphanumeric identifier of episode\n* parentTconst (string) - alphanumeric identifier of the parent TV Series\n* seasonNumber (integer) – season number the episode belongs to\n* episodeNumber (integer) – episode number of the tconst in the TV series\n\n**title.principals.tsv.gz** – Contains the principal cast/crew for titles\n\n* tconst (string) - alphanumeric unique identifier of the title\n* ordering (integer) – a number to uniquely identify rows for a given titleId\n* nconst (string) - alphanumeric unique identifier of the name/person\n* category (string) - the category of job that person was in\n* job (string) - the specific job title if applicable, else '\\N'\n* characters (string) - the name of the character played if applicable, else '\\N'\n\n**title.ratings.tsv.gz** – Contains the IMDb rating and votes information for titles\n\n* tconst (string) - alphanumeric unique identifier of the title\n* averageRating – weighted average of all the individual user ratings\n* numVotes - number of votes the title has received\n\n**name.basics.tsv.gz** – Contains the following information for names:\n\n* nconst (string) - alphanumeric unique identifier of the name/person\n* primaryName (string)– name by which the person is most often credited\n* birthYear – in YYYY format\n* deathYear – in YYYY format if applicable, else '\\N'\n* primaryProfession (array of strings)– the top-3 professions of the person\n* knownForTitles (array of tconsts) – titles the person is known for","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fromaklimenko%2Fimdb-mongo","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fromaklimenko%2Fimdb-mongo","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fromaklimenko%2Fimdb-mongo/lists"}