{"id":14128539,"url":"https://github.com/sethmlarson/pypi-data","last_synced_at":"2025-04-05T12:05:35.207Z","repository":{"id":42865523,"uuid":"312117924","full_name":"sethmlarson/pypi-data","owner":"sethmlarson","description":"Data about packages and maintainers on PyPI","archived":false,"fork":false,"pushed_at":"2024-10-08T19:45:55.000Z","size":3228,"stargazers_count":122,"open_issues_count":6,"forks_count":8,"subscribers_count":7,"default_branch":"main","last_synced_at":"2024-10-13T08:06:25.728Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/sethmlarson.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-11-11T23:43:11.000Z","updated_at":"2024-10-06T14:13:43.000Z","dependencies_parsed_at":"2024-10-25T18:30:46.975Z","dependency_job_id":"f07f167a-733c-44b7-be81-924edfb03f98","html_url":"https://github.com/sethmlarson/pypi-data","commit_stats":{"total_commits":112,"total_committers":11,"mean_commits":"10.181818181818182","dds":0.3660714285714286,"last_synced_commit":"5df30e7c24732bf46de5cf134e69e226b8f64b00"},"previous_names":[],"tags_count":24,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sethmlarson%2Fpypi-data","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sethmlarson%2Fpypi-data/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sethmlarson%2Fpypi-data/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sethmlarson%2Fpypi-data/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/sethmlarson","download_url":"https://codeload.github.com/sethmlarson/pypi-data/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247332604,"owners_count":20921853,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-08-15T16:01:49.131Z","updated_at":"2025-04-05T12:05:35.181Z","avatar_url":"https://github.com/sethmlarson.png","language":"Python","funding_links":[],"categories":["Python"],"sub_categories":[],"readme":"# PyPI Data\n\nMostly up-to-date data about almost every package on PyPI\n\nGet access to the database via [GitHub releases](https://github.com/sethmlarson/pypi-data/releases).\n\n```console\n$ gunzip pypi.db.gz\n$ sqlite3 'pypi.db' 'SELECT * FROM packages LIMIT 10 OFFSET 1000;'\n\nacid-vault|1.3.2|\u003e=3.6|1|0|2021-01-21 04:37:10\nacidcli|1.0.1|\u003e=3.6|0|0|2021-01-21 04:37:10\nacidfile|1.2.1||0|0|2021-01-21 04:37:10\nacidfs|1||0|0|2021-01-21 04:37:10\nacidoseq|1.3.7||0|0|2021-01-21 04:37:10\nacinonyx|0.1.0|\u003e=3.6.0|0|0|2021-01-21 04:37:10\naciops|2.0.0|\u003e=3.6|0|0|2021-01-21 04:37:10\nacitoolkit|0.4||0|0|2021-01-21 04:37:10\nackeras|0.1.1||0|0|2021-01-21 04:37:10\nackg|0.0.5||0|0|2021-01-21 04:37:10\n```\n\n## Data being tracked\n\n- Name, Version, Upload Time\n- Direct requirements (`requires_dist`)\n- Extra requirements (`requires_extras`)\n- Python requirements (`requires_python`)\n- Trove classifiers (`classifiers`)\n- Yanked versions (`yanked`)\n- Wheel data (`build_tag`, `python_tags`, `abi_tags`, `platform_tags`)\n- Maintainers on PyPI\n- URLs used by packages\n- OpenSSF scorecard data\n- Google Assured OSS\n\n### Database Schemas\n\n```sql\n-- Packages --\nCREATE TABLE packages (\n    name STRING,\n    version STRING,\n    requires_python STRING,\n    yanked BOOLEAN DEFAULT FALSE,\n    has_binary_wheel BOOLEAN,\n    has_vulnerabilities BOOLEAN,\n    first_uploaded_at TIMESTAMP,\n    last_uploaded_at TIMESTAMP,\n    recorded_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,\n    downloads INTEGER,\n    scorecard_overall FLOAT,\n    in_google_assured_oss BOOLEAN,\n    PRIMARY KEY (name)\n);\n\n-- Dependencies --\nCREATE TABLE deps (\n    package_name STRING,\n    extra STRING DEFAULT NULL,\n    dep_name STRING,\n    dep_specifier STRING,\n    PRIMARY KEY (package_name, dep_name, dep_specifier)\n);\n\n-- Wheel data --\nCREATE TABLE wheels (\n    package_name STRING,\n    filename STRING,\n    build STRING,\n    python STRING,\n    abi STRING,\n    platform STRING,\n    uploaded_at TIMESTAMP,\n    PRIMARY KEY (package_name, filename)\n);\n\n-- Maintainer data --\nCREATE TABLE maintainers (\n    name STRING,\n    package_name STRING\n);\n\n-- Package URLs --\nCREATE TABLE package_urls (\n    package_name STRING,\n    name STRING,\n    url STRING,\n    public_suffix STRING\n)\n\n-- OpenSSF Scorecard --\nCREATE TABLE scorecard_checks (\n    package_name STRING,\n    name STRING,\n    score INTEGER\n)\n\n-- Trove Classifiers --\nCREATE TABLE classifiers (\n    package_name TEXT,\n    name TEXT,\n    PRIMARY KEY (package_name, name),\n    FOREIGN KEY (package_name) REFERENCES packages(name)\n)\n```\n\n### Download data\n\nDownloads are grabbed from https://github.com/hugovk/top-pypi-packages but only available for the top 5,000 packages.\n\n## Running locally\n\n```\n$ docker build -t pypi-data .\n$ docker run --rm pypi-data\n```\n\n## License\n\nApache-2.0\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsethmlarson%2Fpypi-data","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsethmlarson%2Fpypi-data","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsethmlarson%2Fpypi-data/lists"}