{"id":16130574,"url":"https://github.com/a-r-j/cpdb","last_synced_at":"2026-01-18T09:02:02.568Z","repository":{"id":174758131,"uuid":"623160640","full_name":"a-r-j/CPDB","owner":"a-r-j","description":"Cython implementation of PDB -\u003e DataFrame parsing","archived":false,"fork":false,"pushed_at":"2025-12-23T17:19:28.000Z","size":19588,"stargazers_count":34,"open_issues_count":0,"forks_count":1,"subscribers_count":1,"default_branch":"main","last_synced_at":"2026-01-07T01:45:34.078Z","etag":null,"topics":["bioinformatics","deep-learning","machine-learning","pdb","pdb-files","pdb-parser","protein","protein-structure","structural-biology"],"latest_commit_sha":null,"homepage":"","language":"Cython","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/a-r-j.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-04-03T20:22:20.000Z","updated_at":"2025-12-23T17:18:03.000Z","dependencies_parsed_at":null,"dependency_job_id":"f02623b1-bad5-43e6-9ddf-9e7a7ba059ae","html_url":"https://github.com/a-r-j/CPDB","commit_stats":null,"previous_names":["a-r-j/cpdb"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/a-r-j/CPDB","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/a-r-j%2FCPDB","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/a-r-j%2FCPDB/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/a-r-j%2FCPDB/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/a-r-j%2FCPDB/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/a-r-j","download_url":"https://codeload.github.com/a-r-j/CPDB/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/a-r-j%2FCPDB/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28534154,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-18T00:39:45.795Z","status":"online","status_checked_at":"2026-01-18T02:00:07.578Z","response_time":98,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bioinformatics","deep-learning","machine-learning","pdb","pdb-files","pdb-parser","protein","protein-structure","structural-biology"],"created_at":"2024-10-09T22:16:47.490Z","updated_at":"2026-01-18T09:02:02.535Z","avatar_url":"https://github.com/a-r-j.png","language":"Cython","funding_links":[],"categories":[],"sub_categories":[],"readme":"[![PyPI version](https://badge.fury.io/py/cpdb-protein.svg)](https://badge.fury.io/py/cpdb-protein)\n[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)\n\u003ca href=\"https://github.com/psf/black\"\u003e\u003cimg alt=\"Code style: black\" src=\"https://img.shields.io/badge/code%20style-black-000000.svg\"\u003e\u003c/a\u003e\n\n\n# CPDB\nCython implementation of PDB -\u003e DataFrame parsing\n\n## Installation\n\n```bash\npip install cpdb-protein\n```\n\n\n## Usage\n\n\n### To Dictionary\n```python\n# To dictionary\nfrom cpdb import parse\n\n# From Disk\ndata = parse(\"path_to_pdb.pdb\", df=False)\ndata = parse(\"path_to_pdb.pdb.gz\", df=False)\n\n# From str\nwith open(\"tests/test_data/1htq.pdb\") as f:\n    pdb_file = f.readlines()\ndata = parse(pdb_str=pdb_file, df=False)\n\n# From PDB\ndata = parse(pdb_code=\"3eiy\", df=False)\n\n# From AF2\ndata = parse(uniprot_id=\"Q8W3K0\", df=False)\n```\n\n```\n{'record_name': array(['ATOM', 'ATOM', 'ATOM', ..., 'HETATM', 'HETATM', 'HETATM'],\n      dtype=object), 'atom_number': array([   1,    2,    3, ..., 1773, 1774, 1775], dtype=int32), 'atom_name': array(['N', 'CA', 'C', ..., 'O', 'O', 'O'], dtype=object), 'alt_loc': array(['', '', '', ..., '', '', ''], dtype=object), 'residue_name': array(['GLY', 'GLY', 'GLY', ..., 'HOH', 'HOH', 'HOH'], dtype=object), 'chain_id': array(['A', 'A', 'A', ..., 'A', 'A', 'A'], dtype=object), 'residue_number': array([  30,   30,   30, ..., 2276, 2277, 2278], dtype=int32), 'insertion': array(['', '', '', ..., '', '', ''], dtype=object), 'x_coord': array([31.203, 32.02 , 33.358, ..., 44.665, 41.786, 38.498], dtype=float32), 'y_coord': array([26.31 , 27.046, 26.387, ..., 13.172, 10.059, 12.491], dtype=float32), 'z_coord': array([ 6.06 ,  5.069,  4.79 , ..., 18.445, 22.316, 15.004], dtype=float32), 'occupancy': array([0.5, 0.5, 0.5, ..., 1. , 1. , 1. ], dtype=float32), 'b_factor': array([26.27, 29.29, 30.21, ..., 24.67, 34.64, 41.14], dtype=float32), 'element_symbol': array(['N', 'C', 'C', ..., 'O', 'O', 'O'], dtype=object), 'charge': array(['', '', '', ..., '', '', ''], dtype=object), 'model_idx': array([1, 1, 1, ..., 1, 1, 1], dtype=int32)}\n```\n\n### To Pandas DataFrame\n\n```python\nfrom cpdb import parse\n\n# From Disk\ndata = parse(\"path_to_pdb.pdb\", df=True)\ndata = parse(\"path_to_pdb.pdb.gz\", df=True)\n\n# From str\nwith open(\"tests/test_data/1htq.pdb\") as f:\n    pdb_file = f.readlines()\ndata = parse(pdb_str=pdb_file, df=True)\n\n# From PDB\ndata = parse(pdb_code=\"3eiy\", df=True)\n\n# From AF2\ndata = parse(uniprot_id=\"Q8W3K0\", df=True)\n```\n\n```\n     record_name  atom_number atom_name alt_loc residue_name chain_id  residue_number insertion    x_coord    y_coord    z_coord  occupancy   b_factor element_symbol charge  model_idx\n0           ATOM            1         N                  GLY        A              30            31.202999  26.309999   6.060000       0.50  26.270000              N                 1\n1           ATOM            2        CA                  GLY        A              30            32.020000  27.046000   5.069000       0.50  29.290001              C                 1\n2           ATOM            3         C                  GLY        A              30            33.358002  26.386999   4.790000       0.50  30.209999              C                 1\n3           ATOM            4         O                  GLY        A              30            33.810001  25.535999   5.552000       0.50  29.299999              O                 1\n4           ATOM            5         N                  GLY        A              31            33.987000  26.789000   3.684000       0.50  31.889999              N                 1\n...          ...          ...       ...     ...          ...      ...             ...       ...        ...        ...        ...        ...        ...            ...    ...        ...\n1769      HETATM         1771         O                  HOH        A            2274            42.688999  61.925999  29.589001       1.00  39.950001              O                 1\n1770      HETATM         1772         O                  HOH        A            2275            32.055000  62.648998  30.961000       0.66  15.680000              O                 1\n1771      HETATM         1773         O                  HOH        A            2276            44.665001  13.172000  18.445000       1.00  24.670000              O                 1\n1772      HETATM         1774         O                  HOH        A            2277            41.785999  10.059000  22.316000       1.00  34.639999              O                 1\n1773      HETATM         1775         O                  HOH        A            2278            38.498001  12.491000  15.004000       1.00  41.139999              O                 1\n```\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fa-r-j%2Fcpdb","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fa-r-j%2Fcpdb","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fa-r-j%2Fcpdb/lists"}