{"id":32122625,"url":"https://github.com/data-lake-visualizer/vscode-parquet-visualizer","last_synced_at":"2026-01-05T13:14:08.713Z","repository":{"id":187272254,"uuid":"676179940","full_name":"Data-Lake-Visualizer/vscode-parquet-visualizer","owner":"Data-Lake-Visualizer","description":"VScode extension for SQL querying and visualizing parquet- and CSV files ","archived":false,"fork":false,"pushed_at":"2025-10-05T18:57:35.000Z","size":25577,"stargazers_count":29,"open_issues_count":30,"forks_count":1,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-10-20T20:56:37.832Z","etag":null,"topics":["csv","csv-export","csv-files","explorer","parquet","parquet-files","parquet-viewer","query","viewer","visualizer","vscode-extension"],"latest_commit_sha":null,"homepage":"https://marketplace.visualstudio.com/items?itemName=lucien-martijn.parquet-visualizer","language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Data-Lake-Visualizer.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2023-08-08T15:56:30.000Z","updated_at":"2025-10-10T10:07:13.000Z","dependencies_parsed_at":"2024-06-01T13:29:26.745Z","dependency_job_id":"ec645c81-e6ee-488c-974c-99457577a7a7","html_url":"https://github.com/Data-Lake-Visualizer/vscode-parquet-visualizer","commit_stats":null,"previous_names":["luusluus/parquet-visualizer-vscode","data-lake-visualizer/vscode-parquet-visualizer","luusluus/vscode-parquet-visualizer"],"tags_count":51,"template":false,"template_full_name":null,"purl":"pkg:github/Data-Lake-Visualizer/vscode-parquet-visualizer","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Data-Lake-Visualizer%2Fvscode-parquet-visualizer","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Data-Lake-Visualizer%2Fvscode-parquet-visualizer/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Data-Lake-Visualizer%2Fvscode-parquet-visualizer/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Data-Lake-Visualizer%2Fvscode-parquet-visualizer/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Data-Lake-Visualizer","download_url":"https://codeload.github.com/Data-Lake-Visualizer/vscode-parquet-visualizer/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Data-Lake-Visualizer%2Fvscode-parquet-visualizer/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":280159254,"owners_count":26282555,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-20T02:00:06.978Z","response_time":62,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["csv","csv-export","csv-files","explorer","parquet","parquet-files","parquet-viewer","query","viewer","visualizer","vscode-extension"],"created_at":"2025-10-20T20:56:50.192Z","updated_at":"2025-10-20T20:56:51.255Z","avatar_url":"https://github.com/Data-Lake-Visualizer.png","language":"TypeScript","readme":"# Parquet Visualizer\n\n### Inspect and query very large parquet files fast\n![sql](media/sql.gif)\n\n## What's Parquet Visualizer\nParquet Visualizer is a tool that helps you easily query with SQL and inspect very large parquet files fast and easy.\n\n## What can you do with Parquet Visualizer?\n### Inspect Data\nYou can inspect the binary data of parquet files in a human readable tabular format with pagination. You can also change the page size.\n\n![data](media/inspect_data.gif)\n\n### Inspect Struct Value\nYou can easily inspect complex struct values by clicking on the cell, which shows a popup containing the value of the struct.\n\n![complex](media/inspect_complex.gif)\n\n### Run SQL Queries on a Parquet File\nYou can query a parquet file with DuckDB SQL. You can also search within the result, paginate the result or change the page size. \n\n![sql](media/sql.gif)\n\n### Advanced Autocomplete in Query Editor\nBy typing in the editor, an autocomplete box with column suggestions appear. This makes it much easier to write queries, by selecting the suggested columns.\n\n### Search within rows of Query Result\nBy typing free text in the search box, find specific values of the query result page.\n\n### Export Query Result as CSV, Excel, JSON, ndJSON or Parquet to Disk\nBy clicking on the export button in the query tab, you can save your query result to a specified location in CSV, Excel, JSON, ndJSON or Parquet format to disk.\n\n### Copy Query Result to Clipboard\nBy clicking on the copy button in the Query tab, you can copy the query result page data to the clipboard.\n\n### Inspect Schema\nYou can inspect the schema of the parquet file by clicking on the Schema tab, in which you can paginate if the file has many columns.\n\nYou can also inspect the struct type by clicking on the cell, which will show a popup containing the struct data type.\n\n![schema](media/inspect_schema.gif)\n\n### Inspect Metadata\nYou can inspect the metadata in tabular format by clicking on the Metadata tab.\n\n![metadata](media/inspect_metadata.gif)\n\n### Color Theme\nThe theme of the extension (dark or light) is based on your VS Code Color theme setting. If the color theme is light, the extension will load it's light theme. When you change your theme settings, all active documents will change theme automatically.\n\n## Configuration\nThe following configuration options are available:\n\n|name|default|description|\n|----|-------|-----------|\n|`parquet-visualizer.backend`|`duckdb`| Backend for reading the parquet file. Options: `duckdb`, `parquet-wasm`|\n|`parquet-visualizer.defaultPageSizes`|`[\"20\", \"50\", \"100\", \"500\", \"all\"]`|Set the default page size for data and query tab.|\n|`parquet-visualizer.defaultQuery`|`SELECT *\\r\\nFROM data\\r\\nLIMIT 1000;`|Default SQL query for parquet file. The table `data` should remain the same.|\n|`parquet-visualizer.RunQueryKeyBinding`|`Ctrl-Enter`|Default Key Binding for running queries. If Ctrl is written, it will be translated to Command for mac and vica versa. E.g., Ctrl-E will be synonymous to Command-E.|\n|`parquet-visualizer.dateTimeFormat`|`ISO8601`|Set datetime format for columns of timestamp type. Defaults to ISO8601. You can set a custom format like `YYYY-MM-DD HH:mm:ss.SSS Z`. Find rules for formatting [here](https://www.npmjs.com/package/date-and-time#formatdateobj-arg-utc).|\n|`parquet-visualizer.outputDateTimeFormatInUTC`|`true`|Outputs the datetime format for timestamp columns in UTC or in local time.|\n\n\n## Parquet backends\nThis extension supports two different types of backends for visualizing and querying parquet files.\n\n### DuckDB\n[DuckDB](https://duckdb.org/docs/index) is the primary backend used for uncompressed and compressed parquet files (except for the BROTLI compression codec.)\n\n### Parquet-wasm\n[parquet-wasm](https://kylebarron.dev/parquet-wasm) is a backend that uses a Rust implementation of arrow and parquet. It supports all compression codecs except LZ4.\n\n## Frontend\nThe tables of the frontend are powered by [tabulator](https://tabulator.info/).\n\nThe query editor of the frontend is powered by [ace](https://github.com/ajaxorg/ace).\n\n\n## Release Notes\nSee the [CHANGELOG.MD](CHANGELOG.md)\n\n## Contributors\nThe following people have contributed time and effort to improve Parquet Visualizer:\n- [Darryl Thompson](https://github.com/darrylthom): Testing, Design\n\n## Telemetry\nTo improve the quality of Parquet Visualizer, the extension collects the following analytics such as:\n- Extension load times\n- File parsing success or failure\n- Frequency of features like Data tab or query tab\n\nOur telemetry implementation [respects](https://code.visualstudio.com/api/extension-guides/telemetry#dos-and-donts) the vscode `isTelemetryEnabled` and `onDidChangeTelemetryEnabled` API, which allows you to disable telemetry dynamically and zero telemetry will be sent. \n\nYou can disable it via the settings by following the instructions [here](https://code.visualstudio.com/docs/supporting/FAQ#_how-to-disable-telemetry-reporting).\n\nYou can view all the possible telemetry events that are sent by following instructions [here](https://code.visualstudio.com/docs/getstarted/telemetry#_viewing-all-telemetry-events).\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdata-lake-visualizer%2Fvscode-parquet-visualizer","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdata-lake-visualizer%2Fvscode-parquet-visualizer","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdata-lake-visualizer%2Fvscode-parquet-visualizer/lists"}