{"id":27624462,"url":"https://github.com/gaborcsardi/nanoparquet-cli","last_synced_at":"2026-02-11T22:13:24.894Z","repository":{"id":267770376,"uuid":"902300002","full_name":"gaborcsardi/nanoparquet-cli","owner":"gaborcsardi","description":"Command line Docker app to query and manipulate Parquet files","archived":false,"fork":false,"pushed_at":"2024-12-12T22:13:15.000Z","size":14,"stargazers_count":3,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-13T20:07:35.735Z","etag":null,"topics":["cli","docker","parquet-files"],"latest_commit_sha":null,"homepage":"","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/gaborcsardi.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2024-12-12T09:51:03.000Z","updated_at":"2024-12-16T16:31:06.000Z","dependencies_parsed_at":null,"dependency_job_id":"b2dc65f2-b5ee-4a9b-ab8b-8cf59592741c","html_url":"https://github.com/gaborcsardi/nanoparquet-cli","commit_stats":null,"previous_names":["gaborcsardi/nanoparquet-cli"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/gaborcsardi/nanoparquet-cli","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gaborcsardi%2Fnanoparquet-cli","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gaborcsardi%2Fnanoparquet-cli/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gaborcsardi%2Fnanoparquet-cli/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gaborcsardi%2Fnanoparquet-cli/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/gaborcsardi","download_url":"https://codeload.github.com/gaborcsardi/nanoparquet-cli/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gaborcsardi%2Fnanoparquet-cli/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":265926920,"owners_count":23850886,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cli","docker","parquet-files"],"created_at":"2025-04-23T11:47:19.632Z","updated_at":"2026-02-11T22:13:19.850Z","avatar_url":"https://github.com/gaborcsardi.png","language":"R","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Parquet CLI via Docker and [nanoparquet](https://github.com/r-lib/nanoparquet)\n\n\u003e [!WARNING]\n\u003e This project is experimental.\n\n## Features\n\n* Supports all Parquet files that [https://github.com/r-lib/nanoparquet](\n  nanoparquet) does.\n* Small size: the Docker image is 34MB compressed.\n\n## Requirements\n\n* Docker, on Windows, macOS or Linux.\n\n## Installation\n\nDownload the [`nanoparquet.cmd`](./nanoparquet.cmd) file and save it into\na place that is on your PATH.\n\n### Linux and macOS\n\nYou could put the script in `/usr/local/bin` and name it `nanoparquet`.\nMake sure that it is executable.\n\n```\ncurl -L -o /usr/local/bin/nanoparquet \\\n  https://raw.githubusercontent.com/gaborcsardi/nanoparquet-cli/refs/heads/main/nanoparquet.cmd\nchmod +x /usr/local/bin/nanoparquet\n```\n\n### Windows\n\nThere is no standard place for user-installed scripts on Windows, so\nit is up to you where to save `nanoparquet.cmd`. The example below saves\nit into `tools` within the user's home directory. Make sure you add this\ndirectory to the `PATH`.\n\n```\nInvoke-WebRequest https://raw.githubusercontent.com/gaborcsardi/nanoparquet-cli/refs/heads/main/nanoparquet.cmd -OutFile ~/tools/nanoparquet.cmd\n```\n\n## Usage\n\nCall the script without arguments to see the help screen:\n```\n$ nanoparquet\nError:\nUsage:\n  nanoparquet ( --help | --version )\n  nanoparquet info \u003cparquet-file\u003e\n  nanoparquet schema \u003cparquet-file\u003e\n  nanoparquet export \u003cparquet-file\u003e [ \u003ccsv-file\u003e ]\n\nCommands:\n  info    Show information about a Parquet file.\n  schema  Show the schema of tha Parquet file.\n  export  Export a Parquet file to a CSV file.\n\nArguments:\n  \u003cparquet-file\u003e  Parquet file.\n  \u003ccsv-file\u003e      CSV file. Uses standard output if missing.\n\nOptions:\n  --help     Print this help message.\n  --version  Print nanoparquet version.\nExecution halted\n```\n\n### Input and output files\n\nThe nanoparquet script bind mounts the current directory into the\nnanoparquet container. So you can call nanoparquet on any file that is\nanywhere inside the current working directory.\n\nOutput files work the same way: they must reside somewhere within the\nworking directory. For example:\n\n```\nnanoparquet export data/2024-01.parquet data/2024-01.csv\n```\n\nYou can use backward or forward slashes as path separators on Windows.\nPaths will always use forward slashed in nanoparquet output.\n\n## **:blue_book:** Licenses\n\n### GNU R\n\nGPL-2 or GPL-3 © The R Foundation\n\n### The tools in this repo\n\nMIT © [Posit Software, PBC](https://posit.co)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgaborcsardi%2Fnanoparquet-cli","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgaborcsardi%2Fnanoparquet-cli","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgaborcsardi%2Fnanoparquet-cli/lists"}