{"id":29132320,"url":"https://github.com/expediagroup/apiary-metastore-docker","last_synced_at":"2026-03-09T14:07:14.847Z","repository":{"id":34132469,"uuid":"129869911","full_name":"ExpediaGroup/apiary-metastore-docker","owner":"ExpediaGroup","description":"Docker image for Apiary Data Lake metastore","archived":false,"fork":false,"pushed_at":"2025-11-26T11:34:29.000Z","size":247,"stargazers_count":5,"open_issues_count":5,"forks_count":9,"subscribers_count":34,"default_branch":"master","last_synced_at":"2025-11-29T09:55:23.328Z","etag":null,"topics":["apiary","docker","hive"],"latest_commit_sha":null,"homepage":"https://github.com/ExpediaGroup/apiary","language":"Shell","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ExpediaGroup.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":"CODE-OF-CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":".github/CODEOWNERS","security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":"NOTICE.txt","maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2018-04-17T08:12:13.000Z","updated_at":"2025-11-26T11:32:43.000Z","dependencies_parsed_at":"2023-10-03T18:54:18.627Z","dependency_job_id":"7119466e-e537-4546-b47f-3fa671464dcb","html_url":"https://github.com/ExpediaGroup/apiary-metastore-docker","commit_stats":null,"previous_names":[],"tags_count":61,"template":false,"template_full_name":null,"purl":"pkg:github/ExpediaGroup/apiary-metastore-docker","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ExpediaGroup%2Fapiary-metastore-docker","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ExpediaGroup%2Fapiary-metastore-docker/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ExpediaGroup%2Fapiary-metastore-docker/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ExpediaGroup%2Fapiary-metastore-docker/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ExpediaGroup","download_url":"https://codeload.github.com/ExpediaGroup/apiary-metastore-docker/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ExpediaGroup%2Fapiary-metastore-docker/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":30297947,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-03-09T13:46:43.843Z","status":"ssl_error","status_checked_at":"2026-03-09T13:46:42.821Z","response_time":61,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apiary","docker","hive"],"created_at":"2025-06-30T06:39:14.644Z","updated_at":"2026-03-09T14:07:14.839Z","avatar_url":"https://github.com/ExpediaGroup.png","language":"Shell","funding_links":[],"categories":[],"sub_categories":[],"readme":"\n# Overview\n\nFor more information please refer to the main [Apiary](https://github.com/ExpediaGroup/apiary) project page.\n\n## Environment Variables\n| Environment Variable                           | Required | Description                                                                                                                                                                                                                                           |\n|------------------------------------------------|----------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|\n| APIARY_S3_INVENTORY_PREFIX                     | No       | Prefix used by S3 Inventory when creating data in the inventory bucket. Default is `EntireBucketDaily`.                                                                                                                                               |\n| APIARY_S3_INVENTORY_TABLE_FORMAT               | No       | Format of S3 inventory data. Valid options are `ORC`, `Parquet`, or `CSV`. Default is `ORC`.                                                                                                                                                          |\n| APIARY_SYSTEM_SCHEMA                           | No       | Name for internal system database. Default is `apiary_system`.                                                                                                                                                                                        |\n| AWS_REGION                                     | Yes      | AWS region to configure various AWS clients.                                                                                                                                                                                                          |\n| AWS_WEB_IDENTITY_TOKEN_FILE                    | No       | Path of the AWS Web Identity Token File for IRSA/OIDC AWS authentication.                                                                                                                                                                             |\n| DATANUCLEUS_CONNECTION_POOLING_TYPE            | No       | Type of connection pooling. Valid options are `BoneCP`, `DBCP`, `DBCP2`, `C3P0`, `HikariCP`.                                                                                                                                                          |\n| DATANUCLEUS_CONNECTION_POOL_MAX_POOLSIZE       | No       | Maximum pool size for the connection pool.                                                                                                                                                                                                            |\n| DATANUCLEUS_CONNECTION_POOL_MIN_POOLSIZE       | No       | Minimum pool size for the connection pool.                                                                                                                                                                                                            |\n| DATANUCLEUS_CONNECTION_POOL_INITIAL_POOLSIZE   | No       | Initial pool size for the connection pool (C3P0 only).                                                                                                                                                                                                |\n| DATANUCLEUS_CONNECTION_POOL_MAX_IDLE           | No       | Maximum idle connections for the connection pool.                                                                                                                                                                                                     |\n| DATANUCLEUS_CONNECTION_POOL_MIN_IDLE           | No       | Minimum idle connections for the connection pool.                                                                                                                                                                                                     |\n| DATANUCLEUS_CONNECTION_POOL_MIN_ACTIVE         | No       | Maximum active connections for the connection pool (DBCP/DBCP2 only).                                                                                                                                                                                 |\n| DATANUCLEUS_CONNECTION_POOL_MAX_WAIT           | No       | Maximum wait time for the connection pool (DBCP/DBCP2 only).                                                                                                                                                                                          |\n| DATANUCLEUS_CONNECTION_POOL_VALIDATION_TIMEOUT | No       | Validation timeout for the connection pool (DBCP/DBCP2/HikariCP only).                                                                                                                                                                                |\n| DATANUCLEUS_CONNECTION_POOL_LEAK_DETECTION_THRESHOLD | No  | Leak detection threshold for the connection pool (HikariCP only).                                                                                                                                                                                     |\n| DATANUCLEUS_CONNECTION_POOL_LEAK_MAX_LIFETIME  | No       | Maximum lifetime for the connection pool (HikariCP only).                                                                                                                                                                                             |\n| DATANUCLEUS_CONNECTION_POOL_AUTO_COMMIT        | No       | Auto commit for the connection pool (HikariCP only).                                                                                                                                                                                                  |\n| DATANUCLEUS_CONNECTION_POOL_IDLE_TIMEOUT       | No       | Idle timeout for the connection pool (HikariCP only).                                                                                                                                                                                                 |\n| DATANUCLEUS_CONNECTION_POOL_CONNECTION_WAIT_TIMEOUT | No  | Connection wait timeout for the connection pool (HikariCP only).                                                                                                                                                                                      |\n| DATANUCLEUS_CONNECTION_POOL_READ_ONLY          | No       | Read only mode for the connection pool (HikariCP only).                                                                                                                                                                                               |\n| DATANUCLEUS_CONNECTION_POOL_NAME               | No       | Connection pool name (HikariCP only).                                                                                                                                                                                                                 |\n| DATANUCLEUS_CONNECTION_POOL_CATALOG            | No       | Connection pool catalog (HikariCP only).                                                                                                                                                                                                              |\n| DATANUCLEUS_CONNECTION_POOL_REGISTER_MBEANS    | No       | Register MBeans for the connection pool (HikariCP only).                                                                                                                                                                                              |\n| DISALLOW_INCOMPATIBLE_COL_TYPE_CHANGES         | No       | `true`/`false` value for hive.metastore.disallow.incompatible.col.type.changes, default `true`.                                                                                                                                                       |\n| ENABLE_GLUESYNC                                | No       | Option to turn on GlueSync Hive Metastore listener.                                                                                                                                                                                                   |\n| ENABLE_HIVE_LOCK_HOUSE_KEEPER                  | No       | Option to turn on Hive Metastore Hive Lock House Keeper.                                                                                                                                                                                              |\n| ENABLE_METRICS                                 | No       | Option to enable sending Hive Metastore and JMX metrics to Prometheus.                                                                                                                                                                                |\n| ENABLE_S3_INVENTORY                            | No       | Option to create Hive tables on top of S3 inventory data if enabled in `apiary-data-lake`. Enabled if value is not null/empty.                                                                                                                        |\n| ENABLE_S3_LOGS                                 | No       | Option to create Hive tables on top of S3 access logs data if enabled in `apiary-data-lake`. Enabled if value is not null/empty.                                                                                                                      |\n| EXTERNAL_DATABASE                              | No       | Option to enable external database mode, when specified it disables managing Hive Metastore MySQL database schema.                                                                                                                                    |\n| GLUE_PREFIX                                    | No       | Prefix added to Glue databases to handle database name collisions when synchronizing multiple Hive Metastores to the Glue catalog.                                                                                                                    |\n| HADOOP_HEAPSIZE                                | No       | Hive Metastore Java process heapsize. Default is `1024`.                                                                                                                                                                                              |\n| HMS_AUTOGATHER_STATS                           | No       | Whether or not to create basic statistics on table/partition creation. Valid values are `true` or `false`. Default is `true`.                                                                                                                         |\n| LIMIT_PARTITION_REQUEST_NUMBER                 | No       | To protect the cluster, this controls how many partitions can be scanned for each partitioned table. The default value `-1` means no limit. The limit on partitions does not affect metadata-only queries.                                            |\n| HIVE_METASTORE_ACCESS_MODE                     | No       | Hive Metastore access mode, applicable values are: readwrite, readonly.                                                                                                                                                                               |\n| HIVE_DB_NAMES                                  | No       | Comma separated list of Hive database names, when specified Hive databases will be created and mapped to corresponding S3 buckets.                                                                                                                    |\n| HIVE_METASTORE_LOG_LEVEL                       | No       | Hive Metastore service Log4j log level. Default is `INFO`.                                                                                                                                                                                            |\n| HMS_MIN_THREADS                                | No       | Minimum size of the Hive metastore thread pool. Default is `200`.                                                                                                                                                                                     |\n| HMS_MAX_THREADS                                | No       | Maximum size of the Hive metastore thread pool. Default is `1000`.                                                                                                                                                                                    |\n| INSTANCE_NAME                                  | Yes      | Apiary instance name, will be used as prefix on most AWS resources to allow multiple Apiary instance deployments.                                                                                                                                     |\n| KAFKA_BOOTSTRAP_SERVERS                        | No       | Kafka Bootstrap Servers to enable Kafka Metastore listener and send Metastore events to Kafka.                                                                                                                                                        |\n| KAFKA_CLIENT_ID                                | No       | Kafka label you define that names the Kafka producer.                                                                                                                                                                                                 |\n| KAFKA_COMPRESSION_TYPE                         | No       | Kafka Compression type, if none is specified there is no compression enabled. Values available are gzip, lz4 and snappy. Default is `1048576`.                                                                                                        |\n| KAFKA_MAX_REQUEST_SIZE                         | No       | The maximum size of a request in bytes. This setting will limit the number of record batches the producer will send in a single request to avoid sending huge requests. This is also effectively a cap on the maximum uncompressed record batch size. |\n| LDAP_BASE                                      | No       | LDAP base DN used to search for user groups.                                                                                                                                                                                                          |\n| LDAP_CA_CERT                                   | No       | Base64 encoded Certificate Authority Bundle to validate LDAP SSL connection.                                                                                                                                                                          |\n| LDAP_SECRET_ARN                                | No       | LDAP bind DN SecretsManager secret ARN.                                                                                                                                                                                                               |\n| LDAP_URL                                       | No       | Active Directory URL to enable group mapping in metastore.                                                                                                                                                                                            |\n| MYSQL_CONNECTION_DRIVER_NAME                   | No       | Hive Metastore MySQL database JDBC connection Driver Name. Default is `com.mysql.jdbc.Driver`.                                                                                                                                                        |\n| MYSQL_CONNECTION_POOL_SIZE                     | No       | MySQL Connection pool size for Hive Metastore. Default is `10`. See [here](https://github.com/apache/hive/blob/master/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java#L1181) for more info.                                                 |\n| MYSQL_DB_HOST                                  | Yes      | Hive Metastore MySQL database hostname.                                                                                                                                                                                                               |\n| MYSQL_DB_NAME                                  | Yes      | Hive Metastore MySQL database name.                                                                                                                                                                                                                   |\n| MYSQL_SECRET_ARN                               | Yes      | Hive Metastore MySQL SecretsManager secret ARN.                                                                                                                                                                                                       |\n| MYSQL_SECRET_USERNAME_KEY                      | No       | Hive Metastore MySQL SecretsManager secret username key. Default is `username`.                                                                                                                                                                       |\n| MYSQL_TYPE                                     | No       | Hive Metastore MySQL database Type (mariadb, mysql). Default is `mysql`.                                                                                                                                                                              |\n| MYSQL_DRIVER_JAR                               | No       | Hive Metastore MySQL connector JAR location. Default is `/usr/share/java/mysql-connector-java.jar`.                                                                                                                                                    |\n| RANGER_AUDIT_DB_URL                            | No       | Ranger audit database JDBC URL.                                                                                                                                                                                                                       |\n| RANGER_AUDIT_SECRET_ARN                        | No       | Ranger audit database secret ARN.                                                                                                                                                                                                                     |\n| RANGER_AUDIT_SOLR_URL                          | No       | Ranger Solr audit URL.                                                                                                                                                                                                                                |\n| RANGER_POLICY_MANAGER_URL                      | No       | Ranger admin URL from where policies will be downloaded.                                                                                                                                                                                              |\n| RANGER_SERVICE_NAME                            | No       | Ranger service name used to configure RangerAuth plugin.                                                                                                                                                                                              |\n| SNS_ARN                                        | No       | The SNS topic ARN to which metadata updates will be\n                                                                                                                     |\n\n# Contact\n\n## Mailing List\nIf you would like to ask any questions about or discuss Apiary please join our mailing list at\n\n  [https://groups.google.com/forum/#!forum/apiary-user](https://groups.google.com/forum/#!forum/apiary-user)\n\n# Legal\nThis project is available under the [Apache 2.0 License](http://www.apache.org/licenses/LICENSE-2.0.html).\n\nCopyright 2018-2019 Expedia, Inc.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fexpediagroup%2Fapiary-metastore-docker","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fexpediagroup%2Fapiary-metastore-docker","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fexpediagroup%2Fapiary-metastore-docker/lists"}