{"id":30201469,"url":"https://github.com/esipfed/nbmeta","last_synced_at":"2025-08-13T10:50:14.768Z","repository":{"id":78651875,"uuid":"142690267","full_name":"ESIPFed/NbMeta","owner":"ESIPFed","description":"Notebooks need metadata to be searchable. This is a repository for the FUNding Friday project, NbMeta. The goal of NbMeta is to extract metadata from Jupyter Notebooks, ingest the results into a database, and make it searchable. ","archived":false,"fork":false,"pushed_at":"2018-12-31T23:56:23.000Z","size":1111,"stargazers_count":1,"open_issues_count":8,"forks_count":1,"subscribers_count":13,"default_branch":"master","last_synced_at":"2025-08-12T02:58:58.960Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"https://git.io/fAfd2","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ESIPFed.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2018-07-28T15:55:06.000Z","updated_at":"2018-11-13T20:38:21.000Z","dependencies_parsed_at":null,"dependency_job_id":"a0f02bd8-3542-4bd8-a5e3-4a6eb7448b5b","html_url":"https://github.com/ESIPFed/NbMeta","commit_stats":{"total_commits":7,"total_committers":4,"mean_commits":1.75,"dds":0.7142857142857143,"last_synced_commit":"18541df552b3efe116a6e052059e804149d115d9"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/ESIPFed/NbMeta","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ESIPFed%2FNbMeta","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ESIPFed%2FNbMeta/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ESIPFed%2FNbMeta/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ESIPFed%2FNbMeta/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ESIPFed","download_url":"https://codeload.github.com/ESIPFed/NbMeta/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ESIPFed%2FNbMeta/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":270228429,"owners_count":24548817,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-13T02:00:09.904Z","response_time":66,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-08-13T10:50:10.637Z","updated_at":"2025-08-13T10:50:14.662Z","avatar_url":"https://github.com/ESIPFed.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# NbMeta\nAs of August 2018, there are over [2.5 Million jupyter notebooks on GitHub](http://nbviewer.jupyter.org/github/parente/nbestimate/blob/master/estimate.ipynb).\nThe code and text in these notebooks form an important body of knowledge of\nresearch, practice, and pedagogy. Unfortunately, it is very difficult to find\nnotebooks that might meet a particular need due to a complete lack of metadata\nassociated with these resources and with their exponential growth it is becoming\nharder.\n\nThere has been significant work done on representing software and\nresearch code as searchable, citable resources. (cite force11 and codemeta).  \n\nWe can leverage this work to inform the concept choices for notebook metadata\nrecords. Since there is already a large corpus of publicly available notebooks,\nand there is inherent difficulty in getting content creators to adopt and follow\nconventions consistently, we believe that a good first step is to develop\nautomated metadata extraction tools for Jupyter notebooks in public GitHub\nrepos. This implied metadata can be leveraged for search and discovery of\nrelevant and useful notebooks.\n\nAt the ESIP Summer 2018 meeting, collaborators in this repo proposed a project\nto pilot work on populating a  database with metadata automatically extracted\nfrom Jupyter notebooks on GitHub. This project was awarded a\n[FUNding Friday](http://wiki.esipfed.org/index.php/FUNding_Friday_Projects)\n grant.\n\n## Deliverables\nBy the Winter 2019 ESIP meeting we propose to demonstrate the following outcomes\nfrom this project:\n1. A workflow for querying the GitHub API to find public repositories containing\nJupyter Notebooks\n2. Initial code to transform Git Repo account info into CodeMeta Author\nproperties\n3. Publish a JSON-LD template and example records for proof of concept  \nmetadata, demonstrating essential notebook metadata for citation and access and\na more complete record focused on use and understanding\n4. A metadata repository based on existing software or a simple solution built\non top of Mongo DB containing an example collection of metadata records\n5. Presentation/poster during the Winter ESIP Meeting.\n\n\n## Follow Along\nPlease visit our project\n[site and blog](https://esipfed.github.io/NbMeta/posts/).\n\n## How to Contribute\nWe believe this project is the start of a useful resource for the notebook-using\ncommunity and intend for it to continue past the deliverables at the ESIP Winter\nmeeting. We'd love to work with you if you agree. We are managing tasks via\na [GitHub Project](https://github.com/ESIPFed/NbMeta/projects/1). Please comment\non the issues and feel free to propose your own.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fesipfed%2Fnbmeta","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fesipfed%2Fnbmeta","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fesipfed%2Fnbmeta/lists"}