{"id":16331150,"url":"https://github.com/robertoostenveld/bird","last_synced_at":"2026-03-18T19:02:16.743Z","repository":{"id":211161553,"uuid":"728371904","full_name":"robertoostenveld/bird","owner":"robertoostenveld","description":"BagIt Research Data","archived":false,"fork":false,"pushed_at":"2023-12-09T20:51:01.000Z","size":16197,"stargazers_count":0,"open_issues_count":1,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-05-15T16:13:12.318Z","etag":null,"topics":["bagit","data","fair","open-datasets","repository"],"latest_commit_sha":null,"homepage":"https://robertoostenveld.github.io/bird/","language":"HTML","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/robertoostenveld.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2023-12-06T19:55:10.000Z","updated_at":"2023-12-06T20:20:01.000Z","dependencies_parsed_at":"2023-12-09T21:29:14.243Z","dependency_job_id":null,"html_url":"https://github.com/robertoostenveld/bird","commit_stats":{"total_commits":33,"total_committers":1,"mean_commits":33.0,"dds":0.0,"last_synced_commit":"2bdaf2ace527ec37991a21465a84fa20dbff3917"},"previous_names":["robertoostenveld/bird"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/robertoostenveld/bird","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robertoostenveld%2Fbird","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robertoostenveld%2Fbird/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robertoostenveld%2Fbird/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robertoostenveld%2Fbird/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/robertoostenveld","download_url":"https://codeload.github.com/robertoostenveld/bird/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robertoostenveld%2Fbird/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28755045,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-25T13:59:49.818Z","status":"ssl_error","status_checked_at":"2026-01-25T13:59:33.728Z","response_time":113,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bagit","data","fair","open-datasets","repository"],"created_at":"2024-10-10T23:25:49.470Z","updated_at":"2026-01-25T16:01:38.068Z","avatar_url":"https://github.com/robertoostenveld.png","language":"HTML","funding_links":[],"categories":[],"sub_categories":[],"readme":"# BagIt Research Data (BIRD) repository\n\nThis is a demonstration project inspired by the\n[BagIt](https://en.wikipedia.org/wiki/BagIt) standard for storage\nand network transfer of arbitrary digital content, including\nresearch data.\n\nWith research datasets represented as \"Bags\" with metadata in\nthe `bag-info.txt` file, and the pointers to the actual data\nfiles on a download server listed in the `fetch.txt` file,\nwriting a repository server becomes relatively simple. This project\nis an exploration into such a research data repository server.\n\n\nAt this moment it only includes a few examples with the BagIt\nmetadata format; most of the examples here use various metadata\nschemas from other repositories. In the future I can imagine\nthat tools could be implemented to convert the repository specific\nmetadata (and manifest file list) to the BagIt specification.\n\n## Existing datasets and repositories\n\nThe project demonstrates a research data repository by reusing\nsome public datasets from the following research data repositories:\n\n- \u003chttps://data.ru.nl\u003e\n- \u003chttps://zenodo.org\u003e\n- \u003chttps://openneuro.org\u003e\n- \u003chttps://dataverse.nl\u003e\n- \u003chttps://4tu.ru.nl\u003e\n\n## Metadata\n\nThe web server implemented in this project hosts the metadata of\nthe datasets (or \"research data collections\"). It consists of an\noverview page that lists all datasets, and an individual landing\npage for each dataset.\n\nIf this were a proper repository publishing the original datasets,\nthen the DOI of the datasets shoudl directo to the corresponding\nlanding page here.\n\n## Data\n\nBesides the metadata, the landing page contains (or should contain)\nlinks to the actual data files to download. The files themselves\nare not hosted on the same web server but could be on a FTP server,\na WEBDAV server, an S3 server, etc.\n\n## Dataset creation\n\nAt this moment there is no mechanism implemented nor procedure\ndocumented regarding the construction of datasets. Also the minting\nof DOIs is not part of the current efforts. It is assumed that\na file with metadata is provided in JSON, YAML, TSV, CSV, or XML\nformat, and that the actual files have been made available on a\ndownload server.\n\n## Deploying this website\n\nThis project is implemented using [Jekyll](http://jekyllrb.com/),\na static website generator. To run it on your own computer, you\nshould install Ruby and Gem, and run\n\n    gem install bundler jekyll\n\n    git clone https://github.com/robertoostenveld/bird.git\n    cd bird\n    bundle install\n\nSubsequently, you can convert the markdown into html documents with\n\n    bundle exec jekyll serve --livereload  --incremental\n\nSince I am also running this site on Github pages, you may need to\nedit the `baseurl` field in the `_config.yml` file. For a local\ndeployment it should be empty (`\"\"`), for deployment on Github\nit should correspond to the repository name (`\"bird\"`) .\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frobertoostenveld%2Fbird","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Frobertoostenveld%2Fbird","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frobertoostenveld%2Fbird/lists"}