Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/kamiazya/web-csv-toolbox
๐A CSV Toolbox utilizing Web Standard APIs.
https://github.com/kamiazya/web-csv-toolbox
csv web-streams
Last synced: 2 months ago
JSON representation
๐A CSV Toolbox utilizing Web Standard APIs.
- Host: GitHub
- URL: https://github.com/kamiazya/web-csv-toolbox
- Owner: kamiazya
- License: mit
- Created: 2023-11-23T00:53:52.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-09-14T16:20:48.000Z (4 months ago)
- Last Synced: 2024-09-15T19:42:45.365Z (4 months ago)
- Topics: csv, web-streams
- Language: TypeScript
- Homepage: https://kamiazya.github.io/web-csv-toolbox/
- Size: 2.04 MB
- Stars: 7
- Watchers: 2
- Forks: 4
- Open Issues: 23
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
Awesome Lists containing this project
README
[![npm version](https://badge.fury.io/js/web-csv-toolbox.svg)](https://badge.fury.io/js/web-csv-toolbox)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
[![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg)](http://makeapullrequest.com)
![node version](https://img.shields.io/node/v/web-csv-toolbox)
[![FOSSA Status](https://app.fossa.com/api/projects/git%2Bgithub.com%2Fkamiazya%2Fweb-csv-toolbox.svg?type=shield)](https://app.fossa.com/projects/git%2Bgithub.com%2Fkamiazya%2Fweb-csv-toolbox?ref=badge_shield)![npm package minimized gzipped size](https://img.shields.io/bundlejs/size/web-csv-toolbox)
![GitHub code size in bytes](https://img.shields.io/github/languages/code-size/kamiazya/web-csv-toolbox)
![npm](https://img.shields.io/npm/dm/web-csv-toolbox)
[![codecov](https://codecov.io/gh/kamiazya/web-csv-toolbox/graph/badge.svg?token=8RbDcXHTFl)](https://codecov.io/gh/kamiazya/web-csv-toolbox)# `๐ web-csv-toolbox ๐งฐ`
A CSV Toolbox utilizing Web Standard APIs.
๐
[![GitHub](https://img.shields.io/badge/-GitHub-181717?logo=GitHub&style=flat)](https://github.com/kamiazya/web-csv-toolbox)
[![npm](https://img.shields.io/badge/-npm-CB3837?logo=npm&style=flat)](https://www.npmjs.com/package/web-csv-toolbox)
[![API Reference](https://img.shields.io/badge/-API%20Refarence-3178C6?logo=TypeScript&style=flat&logoColor=fff)](https://kamiazya.github.io/web-csv-toolbox/)
[![Sponsor](https://img.shields.io/badge/-GitHub%20Sponsor-fff?logo=GitHub%20Sponsors&style=flat)](https://github.com/sponsors/kamiazya)
[![CodSpeed Badge](https://img.shields.io/endpoint?url=https://codspeed.io/badge.json)](https://codspeed.io/kamiazya/web-csv-toolbox)[![format: Biome](https://img.shields.io/badge/format%20with-Biome-F7B911?logo=biome&style=flat)](https://biomejs.dev/)
[![test: Vitest](https://img.shields.io/badge/tested%20with-Vitest-6E9F18?logo=vitest&style=flat)](https://vitest.dev/)
[![build: Vite](https://img.shields.io/badge/build%20with-Vite-646CFF?logo=vite&style=flat)](https://rollupjs.org/)---
## Key Concepts โจ
- ๐ **Web Standards first.**
- Utilizing the Web Standards APIs, such as the [Web Streams API](https://developer.mozilla.org/en/docs/Web/API/Streams_API).
- โค๏ธ **TypeScript friendly & User friendly.**
- Fully typed and documented.
- 0๏ธโฃ **Zero dependencies.**
- Using only Web Standards APIs.
- ๐ช **Property-based testing.**
- Using [fast-check](https://fast-check.dev/) and [vitest](https://vitest.dev).
- โ **Cross-platform.**
- Works on browsers, Node.js, and Deno.## Key Features ๐
- ๐ **Efficient CSV Parsing with Streams**
- ๐ป Leveraging the [WHATWG Streams API](https://streams.spec.whatwg.org/) and other Web APIs for seamless and efficient data processing.
- ๐ **AbortSignal and Timeout Support**: Ensure your CSV processing is cancellable, including support for automatic timeouts.
- โ Integrate with [`AbortController`](https://developer.mozilla.org/docs/Web/API/AbortController) to manually cancel operations as needed.
- โณ Use [`AbortSignal.timeout`](https://developer.mozilla.org/docs/Web/API/AbortSignal/timeout_static) to automatically cancel operations that exceed a specified time limit.
- ๐จ **Flexible Source Support**
- ๐งฉ Parse CSVs directly from `string`s, `ReadableStream`s, or `Response` objects.
- โ๏ธ **Advanced Parsing Options**: Customize your experience with various delimiters and quotation marks.
- ๐ Defaults to `,` and `"` respectively.
- ๐พ **Specialized Binary CSV Parsing**: Leverage Stream-based processing for versatility and strength.
- ๐ Flexible BOM handling.
- ๐๏ธ Supports various compression formats.
- ๐ค Charset specification for diverse encoding.
- ๐ **Using WebAssembly for High Performance**: WebAssembly is used for high performance parsing. (_Experimental_)
- ๐ฆ WebAssembly is used for high performance parsing.
- ๐ฆ **Lightweight and Zero Dependencies**: No external dependencies, only Web Standards APIs.
- ๐ **Fully Typed and Documented**: Fully typed and documented with [TypeDoc](https://typedoc.org/).## Installation ๐ฅ
### With Package manager ๐ฆ
This package can then be installed using a package manager.
```sh
# Install with npm
$ npm install web-csv-toolbox
# Or Yarn
$ yarn add web-csv-toolbox
# Or pnpm
$ pnpm add web-csv-toolbox
```### From CDN (unpkg.com) ๐
#### UMD Style ๐
```html
const csv = `name,age
Alice,42
Bob,69`;(async function () {
for await (const record of CSV.parse(csv)) {
console.log(record);
}
})();```
#### ESModule Style ๐ฆ
```html
import { parse } from 'https://unpkg.com/web-csv-toolbox?module';
const csv = `name,age
Alice,42
Bob,69`;for await (const record of parse(csv)) {
console.log(record);
}```
#### Deno ๐ฆ
You can install and use the package by specifying the following:
```js
import { parse } from "npm:web-csv-toolbox";
```## Usage ๐
### Parsing CSV files from strings
```js
import { parse } from 'web-csv-toolbox';const csv = `name,age
Alice,42
Bob,69`;for await (const record of parse(csv)) {
console.log(record);
}
// Prints:
// { name: 'Alice', age: '42' }
// { name: 'Bob', age: '69' }
```### Parsing CSV files from `ReadableStream`s
```js
import { parse } from 'web-csv-toolbox';const csv = `name,age
Alice,42
Bob,69`;const stream = new ReadableStream({
start(controller) {
controller.enqueue(csv);
controller.close();
},
});for await (const record of parse(stream)) {
console.log(record);
}
// Prints:
// { name: 'Alice', age: '42' }
// { name: 'Bob', age: '69' }
```### Parsing CSV files from `Response` objects
```js
import { parse } from 'web-csv-toolbox';const response = await fetch('https://example.com/data.csv');
for await (const record of parse(response)) {
console.log(record);
}
// Prints:
// { name: 'Alice', age: '42' }
// { name: 'Bob', age: '69' }
```### Parsing CSV files with different delimiters and quotation characters
```js
import { parse } from 'web-csv-toolbox';const csv = `name\tage
Alice\t42
Bob\t69`;for await (const record of parse(csv, { delimiter: '\t' })) {
console.log(record);
}
// Prints:
// { name: 'Alice', age: '42' }
// { name: 'Bob', age: '69' }
```### Parsing CSV files with headers
```js
import { parse } from 'web-csv-toolbox';const csv = `Alice,42
Bob,69`;for await (const record of parse(csv, { headers: ['name', 'age'] })) {
console.log(record);
}
// Prints:
// { name: 'Alice', age: '42' }
// { name: 'Bob', age: '69' }
```### `AbortSignal` / `AbortController` Support
Support for [`AbortSignal`](https://developer.mozilla.org/docs/Web/API/AbortSignal) / [`AbortController`](https://developer.mozilla.org/docs/Web/API/AbortController), enabling you to cancel ongoing asynchronous CSV processing tasks.
This feature is useful for scenarios where processing needs to be halted, such as when a user navigates away from the page or other conditions that require stopping the task early.
#### Example Use Case: Abort with user action
```js
import { parse } from 'web-csv-toolbox';const controller = new AbortController();
const csv = "name,age\nAlice,30\nBob,25";try {
// Parse the CSV data then pass the AbortSignal to the parse function
for await (const record of parse(csv, { signal: controller.signal })) {
console.log(record);
}
} catch (error) {
if (error instanceof DOMException && error.name === 'AbortError') {
// The CSV processing was aborted by the user
console.log('CSV processing was aborted by the user.');
} else {
// An error occurred during CSV processing
console.error('An error occurred:', error);
}
}// Some abort logic, like a cancel button
document.getElementById('cancel-button')
.addEventListener('click', () => {
controller.abort();
});
```#### Example Use Case: Abort with timeout
```js
import { parse } from 'web-csv-toolbox';// Set up a timeout of 5 seconds (5000 milliseconds)
const signal = AbortSignal.timeout(5000);const csv = "name,age\nAlice,30\nBob,25";
try {
// Pass the AbortSignal to the parse function
const result = await parse.toArray(csv, { signal });
console.log(result);
} catch (error) {
if (error instanceof DOMException && error.name === 'TimeoutError') {
// Handle the case where the processing was aborted due to timeout
console.log('CSV processing was aborted due to timeout.');
} else {
// Handle other errors
console.error('An error occurred during CSV processing:', error);
}
}
```## Supported Runtimes ๐ป
### Works on Node.js
| Versions | Status |
| -------- | ------ |
| 20.x | โ |
| 18.x | โ |### Works on Browser
| OS | Chrome | FireFox | Default |
| ------- | ------ | ------- | ------------- |
| Windows | โ | โ | โ (Edge) |
| macos | โ | โ | โฌ (Safari *) |
| Linux | โ | โ | - |> **\* To Be Tested**: [I couldn't launch Safari in headless mode](https://github.com/vitest-dev/vitest/blob/main/packages/browser/src/node/providers/webdriver.ts#L39-L41) on GitHub Actions, so I couldn't verify it, but it probably works.
### Others
- Verify that JavaScript is executable on the Deno. [![Deno CI](https://github.com/kamiazya/web-csv-toolbox/actions/workflows/deno.yaml/badge.svg)](https://github.com/kamiazya/web-csv-toolbox/actions/workflows/deno.yaml)
## APIs ๐งโ๐ป
### High-level APIs ๐
These APIs are designed for **Simplicity and Ease of Use**,
providing an intuitive and straightforward experience for users.- **`function parse(input[, options]): AsyncIterableIterator`**: [๐](https://kamiazya.github.io/web-csv-toolbox/functions/parse-1.html)
- Parses various CSV input formats into an asynchronous iterable of records.
- **`function parse.toArray(input[, options]): Promise`**: [๐](https://kamiazya.github.io/web-csv-toolbox/functions/parse.toArray.html)
- Parses CSV input into an array of records, ideal for smaller data sets.The `input` paramater can be a `string`, a [ReadableStream](https://developer.mozilla.org/docs/Web/API/ReadableStream)
of `string`s or [Uint8Array](https://developer.mozilla.org/docs/Web/JavaScript/Reference/Global_Objects/Uint8Array)s,
or a [Uint8Array](https://developer.mozilla.org/docs/Web/JavaScript/Reference/Global_Objects/Uint8Array) object,
or a [ArrayBuffer](https://developer.mozilla.org/docs/Web/JavaScript/Reference/Global_Objects/ArrayBuffer) object,
or a [Response](https://developer.mozilla.org/docs/Web/API/Response) object.### Middle-level APIs ๐งฑ
These APIs are optimized for **Enhanced Performance and Control**,
catering to users who need more detailed and fine-tuned functionality.- **`function parseString(string[, options])`**: [๐](https://kamiazya.github.io/web-csv-toolbox/functions/parseString-1.html)
- Efficient parsing of CSV strings.
- **`function parseBinary(buffer[, options])`**: [๐](https://kamiazya.github.io/web-csv-toolbox/functions/parseBinary-1.html)
- Parse CSV Binary of ArrayBuffer or Uint8Array.
- **`function parseResponse(response[, options])`**: [๐](https://kamiazya.github.io/web-csv-toolbox/functions/parseResponse-1.html)
- Customized parsing directly from `Response` objects.
- **`function parseStream(stream[, options])`**: [๐](https://kamiazya.github.io/web-csv-toolbox/functions/parseStream-1.html)
- Stream-based parsing for larger or continuous data.
- **`function parseStringStream(stream[, options])`**: [๐](https://kamiazya.github.io/web-csv-toolbox/functions/parseStringStream-1.html)
- Combines string-based parsing with stream processing.
- **`function parseUint8ArrayStream(stream[, options])`**: [๐](https://kamiazya.github.io/web-csv-toolbox/functions/parseUint8ArrayStream-1.html)
- Parses binary streams with precise control over data types.### Low-level APIs โ๏ธ
These APIs are built for **Advanced Customization and Pipeline Design**,
ideal for developers looking for in-depth control and flexibility.- **`class LexerTransformer`**: [๐](https://kamiazya.github.io/web-csv-toolbox/classes/LexerTransformer.html)
- A TransformStream class for lexical analysis of CSV data.
- **`class RecordAssemblerTransformer`**: [๐](https://kamiazya.github.io/web-csv-toolbox/classes/RecordAssemblerTransformer.html)
- Handles the assembly of parsed data into records.### Experimental APIs ๐งช
These APIs are experimental and may change in the future.
#### Parsing using WebAssembly for high performance.
You can use WebAssembly to parse CSV data for high performance.
- Parsing with WebAssembly is faster than parsing with JavaScript,
but it takes time to load the WebAssembly module.
- Supports only UTF-8 encoding csv data.
- Quotation characters are only `"`. (Double quotation mark)
- If you pass a different character, it will throw an error.```ts
import { loadWASM, parseStringWASM } from "web-csv-toolbox";// load WebAssembly module
await loadWASM();const csv = "a,b,c\n1,2,3";
// parse CSV string
const result = parseStringToArraySyncWASM(csv);
console.log(result);
// Prints:
// [{ a: "1", b: "2", c: "3" }]
```- **`function loadWASM(): Promise`**: [๐](https://kamiazya.github.io/web-csv-toolbox/functions/loadWASM.html)
- Loads the WebAssembly module.
- **`function parseStringToArraySyncWASM(string[, options]): CSVRecord[]`**: [๐](https://kamiazya.github.io/web-csv-toolbox/functions/parseStringToArraySyncWASM.html)
- Parses CSV strings into an array of records.## Options Configuration ๐ ๏ธ
### Common Options โ๏ธ
| Option | Description | Default | Notes |
| ----------- | ------------------------------------- | ----------- | ------------------------------------------------- |
| `delimiter` | Character to separate fields | `,` | |
| `quotation` | Character used for quoting fields | `"` | |
| `headers` | Custom headers for the parsed records | First row | If not provided, the first row is used as headers |
| `signal` | AbortSignal to cancel processing | `undefined` | Allows aborting of long-running operations |### Advanced Options (Binary-Specific) ๐งฌ
| Option | Description | Default | Notes |
| --------------- | ------------------------------------------------- | ------- | --------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `charset` | Character encoding for binary CSV inputs | `utf-8` | See [Encoding API Compatibility](https://developer.mozilla.org/en-US/docs/Web/API/Encoding_API/Encodings) for the encoding formats that can be specified. |
| `decompression` | Decompression algorithm for compressed CSV inputs | | See [DecompressionStream Compatibility](https://developer.mozilla.org/en-US/docs/Web/API/DecompressionStream#browser_compatibilit). |
| `ignoreBOM` | Whether to ignore Byte Order Mark (BOM) | `false` | See [TextDecoderOptions.ignoreBOM](https://developer.mozilla.org/en-US/docs/Web/API/TextDecoderStream/ignoreBOM) for more information about the BOM. |
| `fatal` | Throw an error on invalid characters | `false` | See [TextDecoderOptions.fatal](https://developer.mozilla.org/en-US/docs/Web/API/TextDecoderStream/fatal) for more information. |## How to Contribute ๐ช
## Star โญ
The easiest way to contribute is to use the library and star [repository](https://github.com/kamiazya/web-csv-toolbox/).
### Questions ๐ญ
Feel free to ask questions on [GitHub Discussions](https://github.com/kamiazya/web-csv-toolbox/discussions).
### Report bugs / request additional features ๐ก
Please register at [GitHub Issues](https://github.com/kamiazya/web-csv-toolbox/issues/new/choose).
### Financial Support ๐ธ
Please support [kamiazya](https://github.com/sponsors/kamiazya).
> Even just a dollar is enough motivation to develop ๐
## License โ๏ธ
This software is released under the MIT License, see [LICENSE](https://github.com/kamiazya/web-csv-toolbox?tab=MIT-1-ov-file).
[![FOSSA Status](https://app.fossa.com/api/projects/git%2Bgithub.com%2Fkamiazya%2Fweb-csv-toolbox.svg?type=large)](https://app.fossa.com/projects/git%2Bgithub.com%2Fkamiazya%2Fweb-csv-toolbox?ref=badge_large)