{"id":26032188,"url":"https://github.com/apache/orc-format","last_synced_at":"2025-04-09T17:12:30.385Z","repository":{"id":210949639,"uuid":"727611783","full_name":"apache/orc-format","owner":"apache","description":"Apache ORC - the smallest, fastest columnar storage for Hadoop workloads","archived":false,"fork":false,"pushed_at":"2025-04-09T03:05:33.000Z","size":513,"stargazers_count":11,"open_issues_count":4,"forks_count":7,"subscribers_count":13,"default_branch":"main","last_synced_at":"2025-04-09T17:11:49.454Z","etag":null,"topics":["apache","big-data","cpp","java","orc"],"latest_commit_sha":null,"homepage":"https://orc.apache.org/","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/apache.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-12-05T08:07:14.000Z","updated_at":"2025-04-09T03:05:39.000Z","dependencies_parsed_at":"2023-12-05T18:31:47.017Z","dependency_job_id":"f109188c-b6d5-4c43-b221-ba9820f195fa","html_url":"https://github.com/apache/orc-format","commit_stats":null,"previous_names":["apache/orc-format"],"tags_count":4,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Forc-format","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Forc-format/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Forc-format/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Forc-format/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/apache","download_url":"https://codeload.github.com/apache/orc-format/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248074960,"owners_count":21043490,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apache","big-data","cpp","java","orc"],"created_at":"2025-03-06T21:28:25.131Z","updated_at":"2025-04-09T17:12:30.358Z","avatar_url":"https://github.com/apache.png","language":null,"readme":"\u003c!--\n  - Licensed to the Apache Software Foundation (ASF) under one\n  - or more contributor license agreements.  See the NOTICE file\n  - distributed with this work for additional information\n  - regarding copyright ownership.  The ASF licenses this file\n  - to you under the Apache License, Version 2.0 (the\n  - \"License\"); you may not use this file except in compliance\n  - with the License.  You may obtain a copy of the License at\n  -\n  -   http://www.apache.org/licenses/LICENSE-2.0\n  -\n  - Unless required by applicable law or agreed to in writing,\n  - software distributed under the License is distributed on an\n  - \"AS IS\" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY\n  - KIND, either express or implied.  See the License for the\n  - specific language governing permissions and limitations\n  - under the License.\n  --\u003e\n\n# [Apache ORC](https://orc.apache.org/)\n\nORC is a self-describing type-aware columnar file format designed for\nHadoop workloads. It is optimized for large streaming reads, but with\nintegrated support for finding required rows quickly. Storing data in\na columnar format lets the reader read, decompress, and process only\nthe values that are required for the current query. Because ORC files\nare type-aware, the writer chooses the most appropriate encoding for\nthe type and builds an internal index as the file is written.\nPredicate pushdown uses those indexes to determine which stripes in a\nfile need to be read for a particular query and the row indexes can\nnarrow the search to a particular set of 10,000 rows. ORC supports the\ncomplete set of types in Hive, including the complex types: structs,\nlists, maps, and unions.\n\n## ORC Format\n\nThis project includes ORC specifications and the protobuf definition.\n`Apache ORC Format 1.0.0` is designed to be used for `Apache ORC 2.0+`.\n\nReleases:\n* Maven Central: \u003ca href=\"https://search.maven.org/#search%7Cga%7C1%7Cg%3A%22org.apache.orc%22\"\u003e![Maven Central](https://maven-badges.herokuapp.com/maven-central/org.apache.orc/orc/badge.svg)\u003c/a\u003e\n* Downloads: \u003ca href=\"https://orc.apache.org/downloads\"\u003eApache ORC downloads\u003c/a\u003e\n* Release tags: \u003ca href=\"https://github.com/apache/orc-format/releases\"\u003eApache ORC Format releases\u003c/a\u003e\n* Plan: \u003ca href=\"https://github.com/apache/orc-format/milestones\"\u003eApache ORC Format future release plan\u003c/a\u003e\n\nThe current build status:\n* Main branch \u003ca href=\"https://github.com/apache/orc-format/actions/workflows/build_and_test.yml?query=branch%3Amain\"\u003e\n  ![main build status](https://github.com/apache/orc-format/actions/workflows/build_and_test.yml/badge.svg?branch=main)\u003c/a\u003e\n\nBug tracking: \u003ca href=\"https://github.com/apache/orc-format/issues\"\u003eApache ORC Format Issues\u003c/a\u003e\n\n## Building\n\n```\n./mvnw install\n```\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fapache%2Forc-format","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fapache%2Forc-format","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fapache%2Forc-format/lists"}