{"id":50350500,"url":"https://github.com/apache/datafusion-java","last_synced_at":"2026-06-15T12:01:40.633Z","repository":{"id":357495908,"uuid":"1237105117","full_name":"apache/datafusion-java","owner":"apache","description":"Java bindings for Apache DataFusion","archived":false,"fork":false,"pushed_at":"2026-06-10T19:48:39.000Z","size":3138,"stargazers_count":25,"open_issues_count":13,"forks_count":12,"subscribers_count":1,"default_branch":"main","last_synced_at":"2026-06-10T21:16:34.733Z","etag":null,"topics":["apache","arrow","datafusion","java","jni","jvm","query-engine","sql"],"latest_commit_sha":null,"homepage":"https://datafusion.apache.org/","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/apache.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":"NOTICE.txt","maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2026-05-12T22:03:12.000Z","updated_at":"2026-06-10T19:48:15.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/apache/datafusion-java","commit_stats":null,"previous_names":["apache/datafusion-java"],"tags_count":2,"template":false,"template_full_name":null,"purl":"pkg:github/apache/datafusion-java","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Fdatafusion-java","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Fdatafusion-java/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Fdatafusion-java/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Fdatafusion-java/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/apache","download_url":"https://codeload.github.com/apache/datafusion-java/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/apache%2Fdatafusion-java/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":34361403,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-06-15T02:00:07.085Z","response_time":63,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apache","arrow","datafusion","java","jni","jvm","query-engine","sql"],"created_at":"2026-05-29T21:00:23.920Z","updated_at":"2026-06-15T12:01:40.626Z","avatar_url":"https://github.com/apache.png","language":"Java","funding_links":[],"categories":["大数据"],"sub_categories":["微服务框架"],"readme":"# Apache DataFusion Java\n\nJava bindings for [Apache DataFusion]. Queries run in native Rust and results\nreturn to the JVM as [Apache Arrow] batches via the Arrow C Data Interface.\n\n[Apache DataFusion]: https://datafusion.apache.org/\n[Apache Arrow]: https://arrow.apache.org/\n\n\u003e Early development: the API will change between releases. Bug reports\n\u003e and contributions welcome.\n\n## Install\n\nReleased to [Maven Central](https://central.sonatype.com/artifact/org.apache.datafusion/datafusion-java).\nThe JAR bundles the native library for Linux and macOS on x86_64 and\naarch64. Windows users need to build from source.\n\nMaven:\n\n```xml\n\u003cdependency\u003e\n    \u003cgroupId\u003eorg.apache.datafusion\u003c/groupId\u003e\n    \u003cartifactId\u003edatafusion-java\u003c/artifactId\u003e\n    \u003cversion\u003e0.1.0\u003c/version\u003e\n\u003c/dependency\u003e\n```\n\nGradle:\n\n```kotlin\nimplementation(\"org.apache.datafusion:datafusion-java:0.1.0\")\n```\n\nArrow needs `--add-opens=java.base/java.nio=ALL-UNNAMED` on the JVM\ncommand line. See the [installation guide](docs/source/user-guide/installation.md)\nfor details and for building from source.\n\n## Quickstart\n\n```java\nimport org.apache.arrow.memory.RootAllocator;\nimport org.apache.arrow.vector.ipc.ArrowReader;\nimport org.apache.datafusion.DataFrame;\nimport org.apache.datafusion.SessionContext;\n\ntry (var allocator = new RootAllocator();\n     var ctx = new SessionContext()) {\n\n    ctx.registerParquet(\"orders\", \"/path/to/orders.parquet\");\n\n    try (DataFrame df = ctx.sql(\n            \"SELECT o_orderpriority, COUNT(*) AS n \" +\n            \"FROM orders GROUP BY o_orderpriority\");\n         ArrowReader reader = df.collect(allocator)) {\n        while (reader.loadNextBatch()) {\n            var batch = reader.getVectorSchemaRoot();\n            // ...\n        }\n    }\n}\n```\n\n`SessionContext` and `DataFrame` are `AutoCloseable` and not thread-safe.\n\n## Documentation\n\nThe full documentation lives under [`docs/source/`](docs/source/index.md)\nand is built with Sphinx (see [`docs/README.md`](docs/README.md) for the\nbuild steps):\n\n- [User guide](docs/source/user-guide/index.md) — installation, the\n  DataFrame and SQL APIs, Parquet ingestion.\n- [Contributor guide](docs/source/contributor-guide/index.md) — build,\n  test, code style, and how to bump the DataFusion version.\n\n## Requirements\n\nJDK 17+. Building from source: see\n[`docs/source/contributor-guide/development.md`](docs/source/contributor-guide/development.md).\n\n## Contributing\n\nOpen an issue to discuss non-trivial changes before sending a PR. See the\n[contributor guide](docs/source/contributor-guide/index.md).\n\n## License\n\nApache License 2.0. See [LICENSE.txt](LICENSE.txt) and [NOTICE.txt](NOTICE.txt).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fapache%2Fdatafusion-java","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fapache%2Fdatafusion-java","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fapache%2Fdatafusion-java/lists"}