https://github.com/kamiazya/web-csv-toolbox

🌊A CSV Toolbox utilizing Web Standard APIs.
https://github.com/kamiazya/web-csv-toolbox
csv web-streams
Last synced: 3 months ago
JSON representation
🌊A CSV Toolbox utilizing Web Standard APIs.
Host: GitHub
URL: https://github.com/kamiazya/web-csv-toolbox
Owner: kamiazya
License: mit
Created: 2023-11-23T00:53:52.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2025-01-02T05:53:42.000Z (6 months ago)
Last Synced: 2025-03-24T01:40:36.149Z (3 months ago)
Topics: csv, web-streams
Language: TypeScript
Homepage: https://kamiazya.github.io/web-csv-toolbox/
Size: 1.99 MB
Stars: 13
Watchers: 2
Forks: 4
Open Issues: 25
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
Awesome Lists containing this project

README

        


[![npm version](https://badge.fury.io/js/web-csv-toolbox.svg)](https://badge.fury.io/js/web-csv-toolbox)

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

[![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg)](http://makeapullrequest.com)

![node version](https://img.shields.io/node/v/web-csv-toolbox)

[![FOSSA Status](https://app.fossa.com/api/projects/git%2Bgithub.com%2Fkamiazya%2Fweb-csv-toolbox.svg?type=shield)](https://app.fossa.com/projects/git%2Bgithub.com%2Fkamiazya%2Fweb-csv-toolbox?ref=badge_shield)

![npm package minimized gzipped size](https://img.shields.io/bundlejs/size/web-csv-toolbox)

![GitHub code size in bytes](https://img.shields.io/github/languages/code-size/kamiazya/web-csv-toolbox)

![npm](https://img.shields.io/npm/dm/web-csv-toolbox)

[![codecov](https://codecov.io/gh/kamiazya/web-csv-toolbox/graph/badge.svg?token=8RbDcXHTFl)](https://codecov.io/gh/kamiazya/web-csv-toolbox)

# `🌐 web-csv-toolbox 🧰`

A CSV Toolbox utilizing Web Standard APIs.

🔗

[![GitHub](https://img.shields.io/badge/-GitHub-181717?logo=GitHub&style=flat)](https://github.com/kamiazya/web-csv-toolbox)

[![npm](https://img.shields.io/badge/-npm-CB3837?logo=npm&style=flat)](https://www.npmjs.com/package/web-csv-toolbox)

[![API Reference](https://img.shields.io/badge/-API%20Refarence-3178C6?logo=TypeScript&style=flat&logoColor=fff)](https://kamiazya.github.io/web-csv-toolbox/)

[![Sponsor](https://img.shields.io/badge/-GitHub%20Sponsor-fff?logo=GitHub%20Sponsors&style=flat)](https://github.com/sponsors/kamiazya)

[![CodSpeed Badge](https://img.shields.io/endpoint?url=https://codspeed.io/badge.json)](https://codspeed.io/kamiazya/web-csv-toolbox)

[![format: Biome](https://img.shields.io/badge/format%20with-Biome-F7B911?logo=biome&style=flat)](https://biomejs.dev/)

[![test: Vitest](https://img.shields.io/badge/tested%20with-Vitest-6E9F18?logo=vitest&style=flat)](https://vitest.dev/)

[![build: Vite](https://img.shields.io/badge/build%20with-Vite-646CFF?logo=vite&style=flat)](https://rollupjs.org/)



---

## Key Concepts ✨

- 🌐 **Web Standards first.**

  - Utilizing the Web Standards APIs, such as the [Web Streams API](https://developer.mozilla.org/en/docs/Web/API/Streams_API).

- ❤️ **TypeScript friendly & User friendly.**

  - Fully typed and documented.

- 0️⃣ **Zero dependencies.**

  - Using only Web Standards APIs.

- 💪 **Property-based testing.**

  - Using [fast-check](https://fast-check.dev/) and [vitest](https://vitest.dev).

- ✅ **Cross-platform.**

  - Works on browsers, Node.js, and Deno.

## Key Features 📗

- 🌊 **Efficient CSV Parsing with Streams**

  - 💻 Leveraging the [WHATWG Streams API](https://streams.spec.whatwg.org/) and other Web APIs for seamless and efficient data processing.

- 🛑 **AbortSignal and Timeout Support**: Ensure your CSV processing is cancellable, including support for automatic timeouts.

  - ✋ Integrate with [`AbortController`](https://developer.mozilla.org/docs/Web/API/AbortController) to manually cancel operations as needed.

  - ⏳ Use [`AbortSignal.timeout`](https://developer.mozilla.org/docs/Web/API/AbortSignal/timeout_static) to automatically cancel operations that exceed a specified time limit.

- 🎨 **Flexible Source Support**

  - 🧩 Parse CSVs directly from `string`s, `ReadableStream`s, or `Response` objects.

- ⚙️ **Advanced Parsing Options**: Customize your experience with various delimiters and quotation marks.

  - 🔄 Defaults to `,` and `"` respectively.

- 💾 **Specialized Binary CSV Parsing**: Leverage Stream-based processing for versatility and strength.

  - 🔄 Flexible BOM handling.

  - 🗜️ Supports various compression formats.

  - 🔤 Charset specification for diverse encoding.

- 🚀 **Using WebAssembly for High Performance**: WebAssembly is used for high performance parsing. (_Experimental_)

  - 📦 WebAssembly is used for high performance parsing.

- 📦 **Lightweight and Zero Dependencies**: No external dependencies, only Web Standards APIs.

- 📚 **Fully Typed and Documented**: Fully typed and documented with [TypeDoc](https://typedoc.org/).

## Installation 📥

### With Package manager 📦

This package can then be installed using a package manager.

```sh

# Install with npm

$ npm install web-csv-toolbox

# Or Yarn

$ yarn add web-csv-toolbox

# Or pnpm

$ pnpm add web-csv-toolbox

```

### From CDN (unpkg.com) 🌐

#### UMD Style 🔄

```html

const csv = `name,age

Alice,42

Bob,69`;

(async function () {

  for await (const record of CSV.parse(csv)) {

    console.log(record);

  }

})();

```

#### ESModule Style 📦

```html

import { parse } from 'https://unpkg.com/web-csv-toolbox?module';

const csv = `name,age

Alice,42

Bob,69`;

for await (const record of parse(csv)) {

  console.log(record);

}

```

#### Deno 🦕

You can install and use the package by specifying the following:

```js

import { parse } from "npm:web-csv-toolbox";

```

## Usage 📘

### Parsing CSV files from strings

```js

import { parse } from 'web-csv-toolbox';

const csv = `name,age

Alice,42

Bob,69`;

for await (const record of parse(csv)) {

  console.log(record);

}

// Prints:

// { name: 'Alice', age: '42' }

// { name: 'Bob', age: '69' }

```

### Parsing CSV files from `ReadableStream`s

```js

import { parse } from 'web-csv-toolbox';

const csv = `name,age

Alice,42

Bob,69`;

const stream = new ReadableStream({

  start(controller) {

    controller.enqueue(csv);

    controller.close();

  },

});

for await (const record of parse(stream)) {

  console.log(record);

}

// Prints:

// { name: 'Alice', age: '42' }

// { name: 'Bob', age: '69' }

```

### Parsing CSV files from `Response` objects

```js

import { parse } from 'web-csv-toolbox';

const response = await fetch('https://example.com/data.csv');

for await (const record of parse(response)) {

  console.log(record);

}

// Prints:

// { name: 'Alice', age: '42' }

// { name: 'Bob', age: '69' }

```

### Parsing CSV files with different delimiters and quotation characters

```js

import { parse } from 'web-csv-toolbox';

const csv = `name\tage

Alice\t42

Bob\t69`;

for await (const record of parse(csv, { delimiter: '\t' })) {

  console.log(record);

}

// Prints:

// { name: 'Alice', age: '42' }

// { name: 'Bob', age: '69' }

```

### Parsing CSV files with headers

```js

import { parse } from 'web-csv-toolbox';

const csv = `Alice,42

Bob,69`;

for await (const record of parse(csv, { headers: ['name', 'age'] })) {

  console.log(record);

}

// Prints:

// { name: 'Alice', age: '42' }

// { name: 'Bob', age: '69' }

```

### `AbortSignal` / `AbortController` Support

Support for [`AbortSignal`](https://developer.mozilla.org/docs/Web/API/AbortSignal) / [`AbortController`](https://developer.mozilla.org/docs/Web/API/AbortController), enabling you to cancel ongoing asynchronous CSV processing tasks.

This feature is useful for scenarios where processing needs to be halted, such as when a user navigates away from the page or other conditions that require stopping the task early.

#### Example Use Case: Abort with user action

```js

import { parse } from 'web-csv-toolbox';

const controller = new AbortController();

const csv = "name,age\nAlice,30\nBob,25";

try {

  // Parse the CSV data then pass the AbortSignal to the parse function

  for await (const record of parse(csv, { signal: controller.signal })) {

    console.log(record);

  }

} catch (error) {

  if (error instanceof DOMException && error.name === 'AbortError') {

     // The CSV processing was aborted by the user

    console.log('CSV processing was aborted by the user.');

  } else {

    // An error occurred during CSV processing

    console.error('An error occurred:', error);

  }

}

// Some abort logic, like a cancel button

document.getElementById('cancel-button')

  .addEventListener('click', () => {

    controller.abort();

  });

```

#### Example Use Case: Abort with timeout

```js

import { parse } from 'web-csv-toolbox';

// Set up a timeout of 5 seconds (5000 milliseconds)

const signal = AbortSignal.timeout(5000);

const csv = "name,age\nAlice,30\nBob,25";

try {

  // Pass the AbortSignal to the parse function

  const result = await parse.toArray(csv, { signal });

  console.log(result);

} catch (error) {

  if (error instanceof DOMException && error.name === 'TimeoutError') {

    // Handle the case where the processing was aborted due to timeout

    console.log('CSV processing was aborted due to timeout.');

  } else {

    // Handle other errors

    console.error('An error occurred during CSV processing:', error);

  }

}

```

## Supported Runtimes 💻

### Works on Node.js

| Versions | Status |

| -------- | ------ |

| 20.x     | ✅     |

| 18.x     | ✅     |

### Works on Browser

| OS      | Chrome | FireFox | Default       |

| ------- | ------ | ------- | ------------- |

| Windows | ✅     | ✅      | ✅ (Edge)     |

| macos   | ✅     | ✅      | ⬜ (Safari *) |

| Linux   | ✅     | ✅      | -             |

> **\* To Be Tested**:  [I couldn't launch Safari in headless mode](https://github.com/vitest-dev/vitest/blob/main/packages/browser/src/node/providers/webdriver.ts#L39-L41) on GitHub Actions, so I couldn't verify it, but it probably works.

### Others

- Verify that JavaScript is executable on the Deno. [![Deno CI](https://github.com/kamiazya/web-csv-toolbox/actions/workflows/deno.yaml/badge.svg)](https://github.com/kamiazya/web-csv-toolbox/actions/workflows/deno.yaml)

## APIs 🧑‍💻

### High-level APIs 🚀

These APIs are designed for **Simplicity and Ease of Use**,

providing an intuitive and straightforward experience for users.

- **`function parse(input[, options]): AsyncIterableIterator`**: [📑](https://kamiazya.github.io/web-csv-toolbox/functions/parse-1.html)

  - Parses various CSV input formats into an asynchronous iterable of records.

- **`function parse.toArray(input[, options]): Promise`**: [📑](https://kamiazya.github.io/web-csv-toolbox/functions/parse.toArray.html)

  - Parses CSV input into an array of records, ideal for smaller data sets.

The `input` paramater can be a `string`, a [ReadableStream](https://developer.mozilla.org/docs/Web/API/ReadableStream)

of `string`s or [Uint8Array](https://developer.mozilla.org/docs/Web/JavaScript/Reference/Global_Objects/Uint8Array)s,

or a [Uint8Array](https://developer.mozilla.org/docs/Web/JavaScript/Reference/Global_Objects/Uint8Array) object,

or a [ArrayBuffer](https://developer.mozilla.org/docs/Web/JavaScript/Reference/Global_Objects/ArrayBuffer) object,

or a [Response](https://developer.mozilla.org/docs/Web/API/Response) object.

### Middle-level APIs 🧱

These APIs are optimized for **Enhanced Performance and Control**,

catering to users who need more detailed and fine-tuned functionality.

- **`function parseString(string[, options])`**: [📑](https://kamiazya.github.io/web-csv-toolbox/functions/parseString-1.html)

  - Efficient parsing of CSV strings.

- **`function parseBinary(buffer[, options])`**: [📑](https://kamiazya.github.io/web-csv-toolbox/functions/parseBinary-1.html)

  - Parse CSV Binary of ArrayBuffer or Uint8Array.

- **`function parseResponse(response[, options])`**: [📑](https://kamiazya.github.io/web-csv-toolbox/functions/parseResponse-1.html)

  - Customized parsing directly from `Response` objects.

- **`function parseStream(stream[, options])`**: [📑](https://kamiazya.github.io/web-csv-toolbox/functions/parseStream-1.html)

  - Stream-based parsing for larger or continuous data.

- **`function parseStringStream(stream[, options])`**: [📑](https://kamiazya.github.io/web-csv-toolbox/functions/parseStringStream-1.html)

  - Combines string-based parsing with stream processing.

- **`function parseUint8ArrayStream(stream[, options])`**: [📑](https://kamiazya.github.io/web-csv-toolbox/functions/parseUint8ArrayStream-1.html)

  - Parses binary streams with precise control over data types.

### Low-level APIs ⚙️

These APIs are built for **Advanced Customization and Pipeline Design**,

ideal for developers looking for in-depth control and flexibility.

- **`class LexerTransformer`**: [📑](https://kamiazya.github.io/web-csv-toolbox/classes/LexerTransformer.html)

  - A TransformStream class for lexical analysis of CSV data.

- **`class RecordAssemblerTransformer`**: [📑](https://kamiazya.github.io/web-csv-toolbox/classes/RecordAssemblerTransformer.html)

  - Handles the assembly of parsed data into records.

### Experimental APIs 🧪

These APIs are experimental and may change in the future.

#### Parsing using WebAssembly for high performance.

You can use WebAssembly to parse CSV data for high performance.

- Parsing with WebAssembly is faster than parsing with JavaScript,

but it takes time to load the WebAssembly module.

- Supports only UTF-8 encoding csv data.

- Quotation characters are only `"`. (Double quotation mark)

  - If you pass a different character, it will throw an error.

```ts

import { loadWASM, parseStringWASM } from "web-csv-toolbox";

// load WebAssembly module

await loadWASM();

const csv = "a,b,c\n1,2,3";

// parse CSV string

const result = parseStringToArraySyncWASM(csv);

console.log(result);

// Prints:

// [{ a: "1", b: "2", c: "3" }]

```

- **`function loadWASM(): Promise`**: [📑](https://kamiazya.github.io/web-csv-toolbox/functions/loadWASM.html)

  - Loads the WebAssembly module.

- **`function parseStringToArraySyncWASM(string[, options]): CSVRecord[]`**: [📑](https://kamiazya.github.io/web-csv-toolbox/functions/parseStringToArraySyncWASM.html)

  - Parses CSV strings into an array of records.

## Options Configuration 🛠️

### Common Options ⚙️

| Option      | Description                           | Default     | Notes                                             |

| ----------- | ------------------------------------- | ----------- | ------------------------------------------------- |

| `delimiter` | Character to separate fields          | `,`         |                                                   |

| `quotation` | Character used for quoting fields     | `"`         |                                                   |

| `headers`   | Custom headers for the parsed records | First row   | If not provided, the first row is used as headers |

| `signal`    | AbortSignal to cancel processing      | `undefined` | Allows aborting of long-running operations        |

### Advanced Options (Binary-Specific) 🧬

| Option          | Description                                       | Default | Notes                                                                                                                                                     |

| --------------- | ------------------------------------------------- | ------- | --------------------------------------------------------------------------------------------------------------------------------------------------------- |

| `charset`       | Character encoding for binary CSV inputs          | `utf-8` | See [Encoding API Compatibility](https://developer.mozilla.org/en-US/docs/Web/API/Encoding_API/Encodings) for the encoding formats that can be specified. |

| `decompression` | Decompression algorithm for compressed CSV inputs |         | See [DecompressionStream Compatibility](https://developer.mozilla.org/en-US/docs/Web/API/DecompressionStream#browser_compatibilit).                       |

| `ignoreBOM`     | Whether to ignore Byte Order Mark (BOM)           | `false` | See [TextDecoderOptions.ignoreBOM](https://developer.mozilla.org/en-US/docs/Web/API/TextDecoderStream/ignoreBOM) for more information about the BOM.      |

| `fatal`         | Throw an error on invalid characters              | `false` | See [TextDecoderOptions.fatal](https://developer.mozilla.org/en-US/docs/Web/API/TextDecoderStream/fatal) for more information.                            |

## How to Contribute 💪

## Star ⭐

The easiest way to contribute is to use the library and star [repository](https://github.com/kamiazya/web-csv-toolbox/).

### Questions 💭

Feel free to ask questions on [GitHub Discussions](https://github.com/kamiazya/web-csv-toolbox/discussions).

### Report bugs / request additional features 💡

Please register at [GitHub Issues](https://github.com/kamiazya/web-csv-toolbox/issues/new/choose).

### Financial Support 💸

Please support [kamiazya](https://github.com/sponsors/kamiazya).

> Even just a dollar is enough motivation to develop 😊

## License ⚖️

This software is released under the MIT License, see [LICENSE](https://github.com/kamiazya/web-csv-toolbox?tab=MIT-1-ov-file).

[![FOSSA Status](https://app.fossa.com/api/projects/git%2Bgithub.com%2Fkamiazya%2Fweb-csv-toolbox.svg?type=large)](https://app.fossa.com/projects/git%2Bgithub.com%2Fkamiazya%2Fweb-csv-toolbox?ref=badge_large)
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/kamiazya/web-csv-toolbox

Awesome Lists containing this project

README