{"id":36957741,"url":"https://github.com/openaleph/ftm-lakehouse","last_synced_at":"2026-02-02T12:57:01.527Z","repository":{"id":272803091,"uuid":"867479210","full_name":"openaleph/ftm-lakehouse","owner":"openaleph","description":"Data standard and archive storage for structured FollowTheMoney data, leaked data, private and public document collections.","archived":false,"fork":false,"pushed_at":"2025-12-18T17:49:17.000Z","size":28835,"stargazers_count":4,"open_issues_count":1,"forks_count":1,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-12-18T22:33:59.122Z","etag":null,"topics":["aleph","archive","datalake","deltalake","followthemoney","lakehouse","openaleph","opensanctions"],"latest_commit_sha":null,"homepage":"https://openaleph.org/docs/lib/ftm-lakehouse","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"agpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/openaleph.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":"NOTICE","maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2024-10-04T06:32:03.000Z","updated_at":"2025-12-18T14:35:30.000Z","dependencies_parsed_at":"2025-01-16T18:47:51.196Z","dependency_job_id":"32809661-0b71-499e-81aa-921a6679fd5b","html_url":"https://github.com/openaleph/ftm-lakehouse","commit_stats":null,"previous_names":["investigativedata/leakrfc","dataresearchcenter/leakrfc","openaleph/leakrfc","openaleph/ftm-lakehouse"],"tags_count":7,"template":false,"template_full_name":null,"purl":"pkg:github/openaleph/ftm-lakehouse","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/openaleph%2Fftm-lakehouse","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/openaleph%2Fftm-lakehouse/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/openaleph%2Fftm-lakehouse/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/openaleph%2Fftm-lakehouse/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/openaleph","download_url":"https://codeload.github.com/openaleph/ftm-lakehouse/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/openaleph%2Fftm-lakehouse/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28390830,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-13T14:36:09.778Z","status":"ssl_error","status_checked_at":"2026-01-13T14:35:19.697Z","response_time":56,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["aleph","archive","datalake","deltalake","followthemoney","lakehouse","openaleph","opensanctions"],"created_at":"2026-01-13T15:52:49.595Z","updated_at":"2026-02-02T12:57:01.519Z","avatar_url":"https://github.com/openaleph.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"[![Docs](https://img.shields.io/badge/docs-live-brightgreen)](https://openaleph.org/docs/lib/ftm-lakehouse)\n[![ftm-lakehouse on pypi](https://img.shields.io/pypi/v/ftm-lakehouse)](https://pypi.org/project/ftm-lakehouse/)\n[![PyPI Downloads](https://static.pepy.tech/badge/ftm-lakehouse/month)](https://pepy.tech/projects/ftm-lakehouse)\n[![PyPI - Python Version](https://img.shields.io/pypi/pyversions/ftm-lakehouse)](https://pypi.org/project/ftm-lakehouse/)\n[![Python test and package](https://github.com/openaleph/ftm-lakehouse/actions/workflows/python.yml/badge.svg)](https://github.com/openaleph/ftm-lakehouse/actions/workflows/python.yml)\n[![pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit)](https://github.com/pre-commit/pre-commit)\n[![Coverage Status](https://coveralls.io/repos/github/openaleph/ftm-lakehouse/badge.svg?branch=main)](https://coveralls.io/github/openaleph/ftm-lakehouse?branch=main)\n[![AGPLv3+ License](https://img.shields.io/pypi/l/ftm-lakehouse)](./LICENSE)\n[![Pydantic v2](https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/pydantic/pydantic/main/docs/badge/v2.json)](https://pydantic.dev)\n\n# ftm-lakehouse\n\n`ftm-lakehouse` provides a _data standard_ and _archive storage_ for leaked data, private and public document collections. The concepts and implementations are originally inspired by [mmmeta](https://github.com/simonwoerpel/mmmeta) and [Aleph's servicelayer archive](https://github.com/alephdata/servicelayer).\n\n`ftm-lakehouse` acts as a multi-tenant storage and retrieval mechanism for structured entity data, documents and their metadata. It provides a high-level interface for generating and sharing document collections and importing them into various search and analysis platforms, such as [_OpenALeph_](https://openaleph.org), [_ICIJ Datashare_](https://datashare.icij.org/) or [_Liquid Investigations_](https://github.com/liquidinvestigations/)\n\n## Installation\n\nRequires python 3.11 or later.\n\n```bash\npip install ftm-lakehouse\n```\n\n## Documentation\n\n[openaleph.org/docs/lib/ftm-lakehouse](https://openaleph.org/docs/lib/ftm-lakehouse)\n\n## Development\n\nThis package is using [poetry](https://python-poetry.org/) for packaging and dependencies management, so first [install it](https://python-poetry.org/docs/#installation).\n\nClone [this repository](https://github.com/openaleph/ftm-lakehouse) to a local destination.\n\nWithin the repo directory, run\n\n    poetry install --with dev\n\nThis installs a few development dependencies, including [pre-commit](https://pre-commit.com/) which needs to be registered:\n\n    poetry run pre-commit install\n\nBefore creating a commit, this checks for correct code formatting (isort, black) and some other useful stuff (see: `.pre-commit-config.yaml`)\n\n### Testing\n\n`ftm-lakehouse` uses [pytest](https://docs.pytest.org/en/stable/) as the testing framework.\n\n    make test\n\n## License and Copyright\n\n`ftm-lakehouse`, (c) 2024 [investigativedata.io](https://investigativedata.io)\n\n`ftm-lakehouse`, (c) 2025 [Data and Research Center – DARC](https://dataresearchcenter.org)\n\n`ftm-lakehouse` is licensed under the AGPLv3 or later license.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fopenaleph%2Fftm-lakehouse","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fopenaleph%2Fftm-lakehouse","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fopenaleph%2Fftm-lakehouse/lists"}