{"id":19877551,"url":"https://github.com/yuokada/embulk-output-orc","last_synced_at":"2025-10-03T17:41:03.605Z","repository":{"id":58146819,"uuid":"95798749","full_name":"yuokada/embulk-output-orc","owner":"yuokada","description":null,"archived":false,"fork":false,"pushed_at":"2025-08-26T01:27:21.000Z","size":398,"stargazers_count":4,"open_issues_count":14,"forks_count":4,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-08-30T14:14:58.383Z","etag":null,"topics":["embulk","embulk-output-plugin","embulk-plugin","java","orc"],"latest_commit_sha":null,"homepage":null,"language":"Scala","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/yuokada.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":".github/CODEOWNERS","security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2017-06-29T16:44:41.000Z","updated_at":"2025-08-26T01:25:51.000Z","dependencies_parsed_at":"2024-03-02T04:22:17.126Z","dependency_job_id":"30d0dd48-bcd4-4557-9189-b55d53fc8d93","html_url":"https://github.com/yuokada/embulk-output-orc","commit_stats":{"total_commits":108,"total_committers":5,"mean_commits":21.6,"dds":"0.20370370370370372","last_synced_commit":"bbad56e8a09db67f9842163d7ecaf9d40b7cfd36"},"previous_names":[],"tags_count":7,"template":false,"template_full_name":null,"purl":"pkg:github/yuokada/embulk-output-orc","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yuokada%2Fembulk-output-orc","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yuokada%2Fembulk-output-orc/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yuokada%2Fembulk-output-orc/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yuokada%2Fembulk-output-orc/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/yuokada","download_url":"https://codeload.github.com/yuokada/embulk-output-orc/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yuokada%2Fembulk-output-orc/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":278048848,"owners_count":25921309,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-02T02:00:08.890Z","response_time":67,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["embulk","embulk-output-plugin","embulk-plugin","java","orc"],"created_at":"2024-11-12T16:37:45.071Z","updated_at":"2025-10-03T17:41:03.573Z","avatar_url":"https://github.com/yuokada.png","language":"Scala","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Orc output plugin for Embulk\n\n[![Build Status](https://github.com/yuokada/embulk-output-orc/workflows/Java%20CI/badge.svg)](https://github.com/yuokada/embulk-output-orc/actions)\n[![Gem Version](https://badge.fury.io/rb/embulk-output-orc.svg)](https://badge.fury.io/rb/embulk-output-orc)\n\n## Overview\n\n* **Plugin type**: output\n* **Load all or nothing**: no\n* **Resume supported**: no\n* **Cleanup supported**: yes\n\n## Configuration\n\n- **path_prefix**: A prefix of output path. (string, required)\n  - support: `file`, `s3`, `s3n` and `s3a`.\n- **file_ext**: An extension of output file. (string, default: `.orc`)\n- **sequence_format**: (string, default: `.%03d`)\n- **buffer_size**: Set the ORC buffer size (integer, default: `262144(256KB)` )\n- **strip_size**: Set the ORC strip size (integer,  default: `67108864(64MB)` )\n- **block_size**: Set the ORC block size (integer, default: `268435456(256MB)`)\n- **compression_kind**: description (string, default: `'ZLIB'`)\n    - `NONE`, `ZLIB`, `SNAPPY`, `LZO`, `LZ4`\n- **overwrite**: Overwrite if output files already exist. (boolean, default: `false`)\n    - Support: `LocalFileSystem`, `S3(s3, s3a, s3n)`\n- **default_from_timezone** Time zone of timestamp columns. This can be overwritten for each column using column_options (DateTimeZone, default: `UTC`)\n\n- **auth_method**: name of mechanism to authenticate requests (basic, env, instance, profile, properties, anonymous, or session. default: basic)  \n  see: https://github.com/embulk/embulk-input-s3#configuration\n\n    - `env`, `basic`, `profile`, `default`, `session`, `anonymous`, `properties`\n    \n\n## Example\n\n```yaml\nout:\n  type: orc\n  path_prefix: \"/tmp/output\"\n  compression_kind: ZLIB\n  overwrite:   true\n```\n\n## ChangeLog\n\n### ver 0.3.4\n\n- Bump `orc` library to `1.5.4`\n- bugfix\n  - https://github.com/yuokada/embulk-output-orc/pull/17\n\n### ver 0.3.3\n\n- bugfix\n- Bump `orc` library to `1.4.4`\n\n### ver 0.3.2\n\n- Update `orc` libraries to `1.4.3`\n\n### ver 0.3.0\n\n- Change default value : (block_size, buffer_size, strip_size)\n\n    - default value is Hive's default value.  \n      (see: https://orc.apache.org/docs/hive-config.html)\n\n### ver 0.2.0\n\n- support: output to s3\n\n    - `s3n`, `s3a` protocol\n\n### ver 0.1.0\n\n- initial release\n\n## Build\n\n```\n$ ./gradlew gem  # -t to watch change of files and rebuild continuously\n```\n\n## SonarQube\n\n[embulk-output-orc](https://sonarcloud.io/dashboard?id=embulk-output-orc \"embulk-output-orc - Yukihiro Okada\")\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fyuokada%2Fembulk-output-orc","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fyuokada%2Fembulk-output-orc","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fyuokada%2Fembulk-output-orc/lists"}