{"id":33179051,"url":"https://github.com/FINRAOS/herd","last_synced_at":"2025-11-20T21:03:16.930Z","repository":{"id":2453464,"uuid":"42949039","full_name":"FINRAOS/herd","owner":"FINRAOS","description":"Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabytes of data and make it accessible for data processing and analytical purposes by any cloud compute platform. ","archived":false,"fork":false,"pushed_at":"2022-10-01T16:03:56.000Z","size":231525,"stargazers_count":138,"open_issues_count":126,"forks_count":41,"subscribers_count":40,"default_branch":"master","last_synced_at":"2025-08-15T22:28:36.060Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"http://finraos.github.io/herd/","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/FINRAOS.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2015-09-22T17:21:37.000Z","updated_at":"2025-08-13T11:13:59.000Z","dependencies_parsed_at":"2022-08-06T12:15:31.276Z","dependency_job_id":null,"html_url":"https://github.com/FINRAOS/herd","commit_stats":null,"previous_names":[],"tags_count":15,"template":false,"template_full_name":null,"purl":"pkg:github/FINRAOS/herd","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/FINRAOS%2Fherd","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/FINRAOS%2Fherd/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/FINRAOS%2Fherd/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/FINRAOS%2Fherd/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/FINRAOS","download_url":"https://codeload.github.com/FINRAOS/herd/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/FINRAOS%2Fherd/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":285511775,"owners_count":27184237,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-11-20T02:00:05.334Z","response_time":54,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-11-16T03:00:36.818Z","updated_at":"2025-11-20T21:03:16.925Z","avatar_url":"https://github.com/FINRAOS.png","language":"Java","funding_links":[],"categories":["大数据"],"sub_categories":[],"readme":"## Overview [![Build Status](https://travis-ci.org/FINRAOS/herd.svg?branch=master)](https://travis-ci.org/FINRAOS/herd) [![Maven Central](https://img.shields.io/maven-central/v/org.finra.herd/herd.svg?label=Maven%20Central)](https://search.maven.org/search?q=g:%22org.finra.herd%22%20AND%20a:%22herd%22)\n\nHerd is big data governance for the cloud. The herd unified data catalog helps separate compute from storage in the cloud. Herd job orchestration manages your ETL and analytics processes while tracking all data in the catalog. Here is a quick summary of features:\n\n- Unified Data Catalog\nA centralized, auditable catalog for operational usage and data governance.\n- Track Lineage\nCapture data ancestry for regulatory, forensic, and analytical purposes\n- Manage Clusters\nCreate and launch clusters; load data into clusters from catalog entries\n- Orchestrate Jobs\nOrchestrate clusters and catalog services to automate processing jobs\n\nFind out more about herd features on our [GitHub project page](http://finraos.github.io/herd/#get_involved)\n\n## Quick Start\n\nThe best way to start learning about herd is through these links. The demo installation process is quick and easy - you can have herd up and running in AWS in 10-15 minutes and start registering data immediately afterwards.\n\n- [What is herd?](https://github.com/FINRAOS/herd/wiki/what-is-herd)\n- [Demo Installation](https://github.com/FINRAOS/herd/wiki/demo-install)\n- [Quick Start to Registering Data](https://github.com/FINRAOS/herd/wiki/quick-start-to-registering-data)\n\n## Get Involved\n\nWe are actively seeking organizations and individuals that are interested in adopting herd and contributing to the development effort. Find out more in the [contributions section](http://finraos.github.io/herd/#get_involved) of our GitHub project page. If you have any questions or discussion topics, post them on [GitHub Issues](https://github.com/FINRAOS/herd/issues) or email us at herd@finra.org.\n\n## License\n\nHerd is licensed under [Apache License 2.0](http://www.apache.org/licenses/LICENSE-2.0)\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FFINRAOS%2Fherd","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FFINRAOS%2Fherd","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FFINRAOS%2Fherd/lists"}