https://github.com/linkedin/openhouse
Open Control Plane for Tables in Data Lakehouse
https://github.com/linkedin/openhouse
big-data catalog datalake datalakehouse declarative iceberg management tables
Last synced: 3 months ago
JSON representation
Open Control Plane for Tables in Data Lakehouse
- Host: GitHub
- URL: https://github.com/linkedin/openhouse
- Owner: linkedin
- License: bsd-2-clause
- Created: 2024-02-13T00:52:30.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2025-01-21T20:28:01.000Z (3 months ago)
- Last Synced: 2025-01-21T20:32:59.588Z (3 months ago)
- Topics: big-data, catalog, datalake, datalakehouse, declarative, iceberg, management, tables
- Language: Java
- Homepage: https://www.openhousedb.org/
- Size: 6.15 MB
- Stars: 321
- Watchers: 15
- Forks: 52
- Open Issues: 24
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
Awesome Lists containing this project
- awesome-datalake - OpenHouse - Open Control Plane for Tables in Data Lakehouse. (Lakehouse)
- awesome-datalake - OpenHouse - Open Control Plane for Tables in Data Lakehouse. (Lakehouse)
README
![]()
Control Plane for Tables in Open Data Lakehouses
OpenHouse is an open source control plane designed for efficient management of tables within open data lakehouse
deployments. The control plane comprises a **declarative catalog** and a suite of **data services**. Users can
seamlessly define Tables, their schemas, and associated metadata declaratively within the catalog.
OpenHouse reconciles the observed state of Tables with the desired state by orchestrating various
data services.
![]()
## Getting Started
### Prerequisites
For building and running locally in [Docker Compose](SETUP.md), you would need the following:
- [Java](https://www.oracle.com/java/technologies/downloads/)
- OpenHouse is currently built with Java 17.
- Set the `JAVA_HOME` environment variable to the location of your JDK17.
- [Docker](https://www.docker.com/)
- [Docker Compose](https://docs.docker.com/compose/)
- [Python3](https://www.python.org/downloads/)For deploying OpenHouse to [Kubernetes](DEPLOY.md), you would need the following:
- [Helm](https://helm.sh/docs/intro/install/)
- [Kubernetes](https://kubernetes.io/docs/setup/)### Building OpenHouse
To build OpenHouse, you can use the following command:
```bash
./gradlew build
```### Running OpenHouse with Docker Compose
To run OpenHouse, we recommend the [SETUP](SETUP.md) guide. You would bring up all the OpenHouse services, MySQL,
Prometheus, Apache Spark and HDFS.### Deploying OpenHouse to Kubernetes
To deploy OpenHouse to Kubernetes, you can use the [DEPLOY](DEPLOY.md) guide. You would build the container images for
all the OpenHouse services, and deploy them to a Kubernetes cluster using Helm.### Compability Matrix
OpenHouse is built with the following versions of the open-source projects:
| Project | Version |
| --- | --- |
| [Apache Iceberg](https://iceberg.apache.org/releases/#120-release) | 1.2.0 |
| [Apache Spark](https://spark.apache.org/releases/) | 3.1.2 |
| [Apache Livy](https://livy.apache.org/) | 0.7.0-incubating |
| [Apache Hadoop Client](https://hadoop.apache.org/releases.html) | 2.10.0 |
| [Springboot Framework](https://spring.io/projects/spring-boot) | 2.6.6 |
| [OpenAPI](https://swagger.io/specification/) | 3.0.3 |## Contributing
We welcome contributions to OpenHouse. To get involved:
- Join [OpenHouse Slack](https://join.slack.com/t/openhouse-bap9266/shared_invite/zt-2bsi0t8pi-wUOeDvQr8j8d5yl3X8WQJQ)
- Open [Github Issue](https://github.com/linkedin/openhouse/issues) for the feature or bug you want to collaborate onPlease refer to the [CONTRIBUTING](CONTRIBUTING.md) guide for more details.
To get started on the high-level architecture, please refer to the [ARCHITECTURE](ARCHITECTURE.md) guide.