Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Netflix/metacat
https://github.com/Netflix/metacat
Last synced: 9 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/Netflix/metacat
- Owner: Netflix
- License: apache-2.0
- Created: 2016-03-19T20:06:34.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2024-10-28T01:32:38.000Z (12 days ago)
- Last Synced: 2024-10-29T17:55:29.767Z (10 days ago)
- Language: Java
- Size: 6.71 MB
- Stars: 1,608
- Watchers: 401
- Forks: 280
- Open Issues: 51
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- jimsghstars - Netflix/metacat - (Java)
- awesome-mlops - Metacat - Unified metadata exploration API service for Hive, RDS, Teradata, Redshift, S3 and Cassandra. (Data Catalog)
- awesome-datalake - Metacat - Metacat is a unified metadata exploration API service. You can explore Hive, RDS, Teradata, Redshift, S3 and Cassandra. (Data Catalog)
- awesome-datalake - Metacat - Metacat is a unified metadata exploration API service. You can explore Hive, RDS, Teradata, Redshift, S3 and Cassandra. (Data Catalog)
- awesome-starred - Netflix/metacat - (others)
- awesome-production-machine-learning - Metacat - Metacat is a unified metadata exploration API service. Metacat focusses on solving these problems: 1) federated views of metadata systems; 2) arbitrary metadata storage about data sets; 3) metadata discovery. (Metadata Management)
- awesome-dataops - Metacat - Provides you information about what data you have, where it resides and how to process it. (Metadata Service / Vector Database)
README
# Metacat
[![Download](https://api.bintray.com/packages/netflixoss/maven/metacat/images/download.svg)](https://bintray.com/netflixoss/maven/metacat/_latestVersion)
[![License](https://img.shields.io/github/license/Netflix/metacat.svg)](http://www.apache.org/licenses/LICENSE-2.0)
[![Issues](https://img.shields.io/github/issues/Netflix/metacat.svg)](https://github.com/Netflix/metacat/issues)
[![NetflixOSS Lifecycle](https://img.shields.io/osslifecycle/Netflix/metacat.svg)]()## Introduction
Metacat is a unified metadata exploration API service. You can explore Hive, RDS, Teradata, Redshift, S3 and Cassandra.
Metacat provides you information about what data you have, where it resides and how to process it. Metadata in the end
is really data about the data. So the primary purpose of Metacat is to give a place to describe the data so that we
could do more useful things with it.Metacat focusses on solving these three problems:
* Federate views of metadata systems.
* Allow arbitrary metadata storage about data sets.
* Metadata discovery## Documentation
TODO
## Releases
[Releases](https://github.com/Netflix/metacat/releases/)
## Builds
Metacat builds are run on Travis CI [here](https://travis-ci.com/Netflix/metacat).
[![Build Status](https://travis-ci.com/Netflix/metacat.svg?branch=master)](https://travis-ci.com/Netflix/metacat)## Getting Started
```
git clone [email protected]:Netflix/metacat.git
cd metacat
./gradlew clean build
```Once the build is completed, the metacat WAR file is generated under `metacat-war/build/libs` directory. Metacat needs
two basic configurations:* `metacat.plugin.config.location`: Path to the directory containing the catalog configuration. Please look at
catalog [samples](https://github.com/Netflix/metacat/tree/master/metacat-functional-tests/metacat-test-cluster/etc-metacat/catalog) used for functional testing.
* `metacat.usermetadata.config.location`: Path to the configuration file containing the connection properties to store
user metadata. Please look at this [sample](https://github.com/Netflix/metacat/blob/master/metacat-functional-tests/metacat-test-cluster/etc-metacat/usermetadata.properties).### Running Locally
Take the build WAR in `metacat-war/build/libs` and deploy it to an existing Tomcat as `ROOT.war`.
The REST API can be accessed @ [http://localhost:8080/mds/v1/catalog](http://localhost:8080/mds/v1/catalog)
Swagger API documentation can be accessed @ [http://localhost:8080/swagger-ui/index.html](http://localhost:8080/swagger-ui/index.html)
### Docker Compose Example
**Pre-requisite: Docker compose is installed**
To start a self contained Metacat environment with some sample catalogs run the command below.
This will start a `docker compose` cluster containing a Metacat container, a Hive Metastore Container, a Cassandra
container and a PostgreSQL container.```
./gradlew metacatPorts
```* `metacatPorts` - Prints out what exposed ports are mapped to the internal container ports.
Look for the mapped port (`MAPPED_PORT`) to port 8080.REST API can be accessed @ `http://localhost:/mds/v1/catalog`
Swagger API documentation can be accessed @ `http://localhost:/swagger-ui/index.html`
To stop the docker compose cluster:
```
./gradlew stopMetacatCluster
```