{"id":13395053,"url":"https://github.com/quiltdata/quilt","last_synced_at":"2025-05-13T22:02:24.008Z","repository":{"id":37336291,"uuid":"81518954","full_name":"quiltdata/quilt","owner":"quiltdata","description":"Quilt is a data mesh for connecting people with actionable data","archived":false,"fork":false,"pushed_at":"2025-04-25T08:49:25.000Z","size":172497,"stargazers_count":1335,"open_issues_count":34,"forks_count":91,"subscribers_count":17,"default_branch":"master","last_synced_at":"2025-04-25T09:44:42.313Z","etag":null,"topics":["data","data-engineering","data-version-control","data-versioning","parquet","python","serialization"],"latest_commit_sha":null,"homepage":"https://quiltdata.com","language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/quiltdata.png","metadata":{"files":{"readme":"docs/README.md","changelog":null,"contributing":"docs/CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":"docs/CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2017-02-10T02:46:03.000Z","updated_at":"2025-04-24T11:22:23.000Z","dependencies_parsed_at":"2023-11-13T19:56:16.191Z","dependency_job_id":"af6331d7-1fb3-45c2-83b6-e76bed0fc306","html_url":"https://github.com/quiltdata/quilt","commit_stats":{"total_commits":3968,"total_committers":52,"mean_commits":76.3076923076923,"dds":0.8639112903225806,"last_synced_commit":"e52a6d5c988ed132c8aeef455fb0ebb9b597965d"},"previous_names":[],"tags_count":113,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/quiltdata%2Fquilt","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/quiltdata%2Fquilt/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/quiltdata%2Fquilt/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/quiltdata%2Fquilt/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/quiltdata","download_url":"https://codeload.github.com/quiltdata/quilt/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":250796259,"owners_count":21488702,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data","data-engineering","data-version-control","data-versioning","parquet","python","serialization"],"created_at":"2024-07-30T17:01:40.512Z","updated_at":"2025-04-28T15:27:56.974Z","avatar_url":"https://github.com/quiltdata.png","language":"TypeScript","readme":"# Quilt: A Data Lakehouse for Actionable Data\n\nQuilt connects teams to actionable data by simplifying data discovery, sharing,\nand analysis. It’s designed to serve data-driven organizations with powerful\ntools for managing data as code, enabling rapid experimentation, and ensuring\ndata integrity at scale.\n\n---\n\n## How to Get Started\n\nQuilt consists of three main elements:\n\n- [Quilt Platform](#quilt-platform-overview) which is a cloud platform for\n  interacting with, visualizing, searching and querying Quilt Packages, which is\n  hosted in an organization's AWS Account.\n- [Quilt Python SDK](#quilt-python-sdk) which provides the ability to create,\n  push, install and delete Quilt Packages.\n- [Quilt Ecosystem](#quilt-ecosystem-and-integrations) which provide extension\n  of the core Quilt Capabilities to enable typical elements of life sciences\n  workflows, such as incorporating orchestration data, and connecting packages\n  to Electronic Lab Notebooks.\n\nTo dive deeper into the capabilities of Quilt, start with our [Quick Start\nGuide](Quickstart.md) or explore the [Installation\nInstructions](Installation.md) for setting up your environment.\n\nIf you have any questions or need help, join our [Slack\ncommunity](https://slack.quiltdata.com/) or submit a support request to\n\u003csupport@quiltdata.io\u003e.\n\n---\n\n## Navigating the Documentation\n\nThe Quilt documentation is structured to guide users through different layers of\nthe platform, from basic concepts to advanced integrations. Whether you're a\nbusiness user, developer, or platform administrator, the docs will help you\nquickly find the information you need.\n\n### Quilt Platform Overview\n\nThe **Quilt Platform** powers the core features of the Quilt data catalog,\nproviding tools for browsing, searching, and visualizing data stored in AWS S3.\nThe platform is ideal for teams needing to collaborate on data, with\ncapabilities like embeddable previews and metadata collection.\n\n**Core Sections:**\n\n- [Architecture](Architecture.md) - Learn how Quilt is architected.\n- [Mental Model](MentalModel.md) - Understand the guiding principles behind Quilt.\n- [Metadata Management](Catalog/Metadata.md) - Manage metadata at scale.\n\nFor users of the Quilt Platform (often referred to as the Catalog):\n\n- [Bucket Browsing](Catalog/FileBrowser.md) - Navigate through S3 buckets.\n- [Document Previews](Catalog/Preview.md) - Visualize documents and datasets\n  directly in the web interface.\n- [Search](Catalog/Search.md) - Leverage Quilt’s powerful search\n  capabilities.\n- [Visualization \u0026 Dashboards](Catalog/VisualizationDashboards.md) - Create\n  visual dashboards for data insights.\n\nFor administrators managing Quilt deployments:\n\n- [Admin Settings UI](Catalog/Admin.md) - Control platform settings and user access.\n- [Catalog Configuration](Catalog/Preferences.md) - Set platform preferences.\n- [Cross-Account Access](CrossAccount.md) - Manage multi-account access to S3 data.\n\n### Quilt Python SDK\n\nThe **Quilt Python SDK** allows users to programmatically manage data packages,\nversion datasets, and automate data workflows. Whether you're uploading a\npackage, fetching data, or scripting custom workflows, the SDK provides the\nflexibility needed for deeper integrations.\n\n- [Installation](Installation.md) - Get started with the Quilt SDK.\n- [Quick Start](Quickstart.md) - Follow a step-by-step guide to building and\n  managing data packages.\n- [Editing and Uploading Packages](walkthrough/editing-a-package.md) - Learn how\n  to version, edit, and share data.\n- [API Reference](api-reference/api.md) - Detailed API documentation for developers.\n\n### Quilt Ecosystem and Integrations\n\nThe **Quilt Ecosystem** extends the platform with integrations and plugins to\nfit your workflow. Whether you're managing scientific data or automating\npackaging tasks, Quilt can be tailored to your needs with these tools:\n\n- [Benchling Packager](examples/benchling.md) - Package electronic lab notebooks\n  from Benchling.\n- [Nextflow Plugin](examples/nextflow.md) - Integrate with Nextflow pipelines\n  for bioinformatics.\n\n---\n\n## Who Should Use Quilt?\n\nQuilt is for teams across industries like machine learning, biotech, and\nanalytics who need to manage large datasets, collaborate seamlessly, and track\nthe lifecycle of their data. Whether you're a data scientist, engineer, or\nadministrator, Quilt helps streamline your data management workflows.\n\n## What Can You Do with Quilt?\n\n- **Share**: Easily share versioned data using simple URLs and email invites.\n- **Understand**: Enrich data with inline documentation and visualizations for\n  better insights.\n- **Discover**: Use metadata and search tools to explore data relationships\n  across projects.\n- **Model**: Version and manage large data sets that don't fit traditional git repositories.\n- **Decide**: Empower your team with auditable data for better decision-making.\n\n---\n","funding_links":[],"categories":["Jupyter Notebook","Data Management","TypeScript","Model and Data Versioning","Data","python","data","Model, Data and Experiment Management"],"sub_categories":["Data Management"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fquiltdata%2Fquilt","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fquiltdata%2Fquilt","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fquiltdata%2Fquilt/lists"}