{"id":34086130,"url":"https://github.com/zembrodt/pymdb","last_synced_at":"2026-04-07T06:31:19.560Z","repository":{"id":50184695,"uuid":"211527115","full_name":"zembrodt/pymdb","owner":"zembrodt","description":"Python package to both parse datsets provided by IMDb and scrape information from imdb.com","archived":false,"fork":false,"pushed_at":"2022-12-08T07:43:52.000Z","size":386,"stargazers_count":6,"open_issues_count":5,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-08-19T05:23:19.232Z","etag":null,"topics":["actor","actress","api","cinema","composer","director","film","imdb","imdb-api","imdb-dataset","imdb-movies","movie-database","moviedb-api","movies","movies-api","pymdb","tvdb","webscraper","webscrapping","writer"],"latest_commit_sha":null,"homepage":"https://pymdb.com","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/zembrodt.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-09-28T16:11:23.000Z","updated_at":"2024-01-30T20:33:05.000Z","dependencies_parsed_at":"2022-09-05T16:41:20.066Z","dependency_job_id":null,"html_url":"https://github.com/zembrodt/pymdb","commit_stats":null,"previous_names":[],"tags_count":5,"template":false,"template_full_name":null,"purl":"pkg:github/zembrodt/pymdb","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zembrodt%2Fpymdb","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zembrodt%2Fpymdb/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zembrodt%2Fpymdb/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zembrodt%2Fpymdb/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/zembrodt","download_url":"https://codeload.github.com/zembrodt/pymdb/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zembrodt%2Fpymdb/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31503380,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-07T03:10:19.677Z","status":"ssl_error","status_checked_at":"2026-04-07T03:10:13.982Z","response_time":105,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["actor","actress","api","cinema","composer","director","film","imdb","imdb-api","imdb-dataset","imdb-movies","movie-database","moviedb-api","movies","movies-api","pymdb","tvdb","webscraper","webscrapping","writer"],"created_at":"2025-12-14T13:23:09.434Z","updated_at":"2026-04-07T06:31:19.556Z","avatar_url":"https://github.com/zembrodt.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# PyMDb\n\n[![PyPI](https://img.shields.io/pypi/v/py-mdb.svg)](https://pypi.org/project/py-mdb/)\n[![Python Versions](https://img.shields.io/pypi/pyversions/py-mdb.svg)](https://pypi.org/project/py-mdb/)\n[![License](https://img.shields.io/pypi/l/py-mdb.svg)](https://github.com/zembrodt/pymdb/blob/master/LICENSE)\n[![Build Status](https://travis-ci.com/zembrodt/pymdb.svg?branch=master)](https://travis-ci.com/zembrodt/pymdb)\n\nPyMDb is a package for both parsing the [datasets provided by IMDb](https://datasets.imdbws.com/) and scraping information from their web pages.\n\nThis package is able to gather information on people, titles, and companies provided by IMDb and is split into two separate modules: one for parsing the IMDb datasets, and one for scraping webpages on [imdb.com](http://imdb.com/).\n\n## Installation\n\nThe latest release of PyMDb can be installed from PyPI with:\n\n```pip install py-mdb```\n\nIf downloading the source from GitHub, PyMDb requires the following packages:\n\n- [requests](https://github.com/psf/requests)\n- [selectolax](https://github.com/rushter/selectolax)\n\n## Usage\n\n```python\n\u003e\u003e\u003e import pymdb\n\u003e\u003e\u003e from collections import defaultdict\n\u003e\u003e\u003e\n\u003e\u003e\u003e parser = pymdb.PyMDbParser(gunzip_files=True)\n\u003e\u003e\u003e genre_count = defaultdict(int)\n\u003e\u003e\u003e for title in parser.get_title_basics(\"path/to/files\"):\n...     for genre in title.genres:\n...             genre_count[genre] += 1\n...\n\u003e\u003e\u003e for genre in genre_count:\n...     print(f\"{genre}: {genre_count[genre]}\")\n...\nDocumentary: 600184\nShort: 837912\nAnimation: 312227\n    ...\nTalk-Show: 584252\nReality-TV: 307037\nAdult: 178493\n\u003e\u003e\u003e\n\u003e\u003e\u003e scraper = pymdb.PyMDbScraper(rate_limit=500)\n\u003e\u003e\u003e title = scraper.get_title(\"tt0076759\")\n\u003e\u003e\u003e print(f\"{title.display_title} came out in {title.release_date.year}!\")\nStar Wars: Episode IV - A New Hope came out in 1977!\n```  \n\n## Documentation\n\nFull documentation can be found at the [PyMDb Read the Docs](https://pymdb.readthedocs.io/) page.\n\n## Disclaimer\n\nPyMDb is still in a pre-release state and has only been tested with a small amount of data found on [imdb.com](http://imdb.com/).\nThe web scraper portion of the code does have a rate limiter value you can customize, please be kind to IMDb.\nIf any bugs or issues are found, please do not hesitate to create an issue or make a pull request on [GitHub](https://github.com/zembrodt/pymdb).\nSuggestions for features to be added to PyMDb in future releases are also welcome!\n\n## License\n\nThis project is licensed under the MIT License. Please see the [LICENSE](https://github.com/zembrodt/pymdb/blob/master/LICENSE) file for details.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzembrodt%2Fpymdb","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fzembrodt%2Fpymdb","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzembrodt%2Fpymdb/lists"}