Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mljs/libsvm

LIBSVM for the browser and nodejs :fire:
https://github.com/mljs/libsvm

classification cross-validation emscripten libsvm regression support-vector-machine svm

Last synced: about 1 month ago
JSON representation

LIBSVM for the browser and nodejs :fire:

Awesome Lists containing this project

README

        

[![NPM version][npm-image]][npm-url]
[![npm download][download-image]][download-url]

# [DEMO](https://mljs.github.io/libsvm/)

Port of to port libsvm v3.22 using [emscripten](https://github.com/kripken/emscripten) , for usage in the browser or nodejs. 2 build targets: asm and WebAssembly.

What is libsvm?
libsvm is a [c++ library](https://github.com/cjlin1/libsvm) developped by Chih-Chung Chang and Chih-Jen Lin that allows to do support vector machine (aka SVM) classification and regression.

Resources about libsvm:
- [libsvm website](https://www.csie.ntu.edu.tw/~cjlin/libsvm/)
- [libsvm github repository](https://github.com/cjlin1/libsvm)
- [Wikipedia article](https://en.wikipedia.org/wiki/Support_vector_machine)
- Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1--27:27, 2011.

# Usage
## Install
```bash
npm install libsvm-js
```

## Load in nodejs
The main entry point loads the WebAssembly build and is asynchronous.
```js
require('libsvm-js').then(SVM => {
const svm = new SVM(); // ...
});
```

There is an alternative entry point if you want to use asm build. This entrypoint is synchronous.
```js
const SVM = require('libsvm-js/asm');
const svm = new SVM(); // ...
```

## Load in the browser
The npm package contains a bundle for the browser that works with AMD and browser globals. There is one bundle for the asm build and another for the web assembly build. They are located in the `dist/browser` directory of the package. You can load them into your web page with a `script` tag. For the web assembly module, make sure that the libsvm.wasm file is served from the same relative path as the js file.

## Basic usage
This example illustrates how to use the library to train and use an SVM classifier.
```js

async function xor() {
const SVM = await
require('libsvm-js');
const svm = new SVM({
kernel: SVM.KERNEL_TYPES.RBF, // The type of kernel I want to use
type: SVM.SVM_TYPES.C_SVC, // The type of SVM I want to run
gamma: 1, // RBF kernel gamma parameter
cost: 1 // C_SVC cost parameter
});

// This is the xor problem
//
// 1 0
// 0 1
const features = [[0, 0], [1, 1], [1, 0], [0, 1]];
const labels = [0, 0, 1, 1];
svm.train(features, labels); // train the model
const predictedLabel = svm.predictOne([0.7, 0.8]);
console.log(predictedLabel) // 0
}

xor().then(() => console.log('done!'));
```

# Benchmarks
You can compare the performance of the library in various environments. Run `npm run benchmark` to run the benchmarks with native c/c++ code and with the compiled code with your local version of node.js. For browser performance, go to the [web benchmark page](https://mljs.github.io/libsvm/#benchmarks).

Speed is mainly affected by the javascript engine that compiles it. Since WebAssembly has been stabilized and is an optimization phase, more recent engines are almost always faster.

Speed is also affected by the version of emscripten that generated the build or the options used in the build. I will try to keep up with any improvement that might significantly impact the performance.

## Cross-validation benchmark
I report the results here for the cross-validation benchmark on the iris dataset to get a feeling for how performance compares on different platforms. There are other benchmarks that can be run from the terminal in node.js or in the browser. The performance results are given relative to how they run natively (with compiled c++ code). The benchmarks only consider runtime performance, not load and parse performance.

| Platform | Rel asm perf | Rel wasm perf |
| --- | --- | --- |
| Native | 100% | 100% |
| Node.js 8.1.2 | 34.2% | 52.6% |
| Node.js v7.10.0 | 14.4% | N/A |
| Chrome 59.0.3071.115 | 36.2% | 51.3% |
| Firefox 54.0 | 35.5% | 70.4% |

# What are asm and WebAssembly ?
From [asmjs.org](http;//asmjs.org)
> asm is an optimizable subset of javascript.

From [webassembly.org](http://webassembly.org)
> WebAssembly or wasm is a new portable, size- and load-time-efficient format suitable for compilation to the web

# Should I use asm or WebAssembly ?
Both. You should try to use WebAssembly first and fall back to asm in order to support all browsers.

WebAssembly is currently supported in the latest stable versions of Chrome, Firefox and on preview versions of Safari and Edge.

# API Documentation

## SVM
**Kind**: global class

* [SVM](#SVM)
* [new SVM(options)](#new_SVM_new)
* _instance_
* [.train(samples, labels)](#SVM+train)
* [.crossValidation(samples, labels, kFold)](#SVM+crossValidation) ⇒ Array.<number>
* [.free()](#SVM+free)
* [.predictOne(sample)](#SVM+predictOne) ⇒ number
* [.predict(samples)](#SVM+predict) ⇒ Array.<number>
* [.predictProbability(samples)](#SVM+predictProbability) ⇒ Array.<object>
* [.predictOneProbability(sample)](#SVM+predictOneProbability) ⇒ object
* [.predictOneInterval(sample, confidence)](#SVM+predictOneInterval) ⇒ object
* [.predictInterval(samples, confidence)](#SVM+predictInterval) ⇒ Array.<object>
* [.getLabels()](#SVM+getLabels) ⇒ Array.<number>
* [.getSVIndices()](#SVM+getSVIndices) ⇒ Array.<number>
* [.serializeModel()](#SVM+serializeModel) ⇒ string
* _static_
* [.SVM_TYPES](#SVM.SVM_TYPES) : Object
* [.KERNEL_TYPES](#SVM.KERNEL_TYPES) : Object
* [.load(serializedModel)](#SVM.load) ⇒ [SVM](#SVM)

### new SVM(options)

| Param | Type | Default | Description |
| --- | --- | --- | --- |
| options | object | | |
| [options.type] | number | SVM_TYPES.C_SVC | Type of SVM to perform, |
| [options.kernel] | number | KERNEL_TYPES.RBF | Kernel function, |
| [options.degree] | number | 3 | Degree of polynomial, for polynomial kernel |
| [options.gamma] | number | | Gamma parameter of the RBF, Polynomial and Sigmoid kernels. Default value is 1/num_features |
| [options.coef0] | number | 0 | coef0 parameter for Polynomial and Sigmoid kernels |
| [options.cost] | number | 1 | Cost parameter, for C SVC, Epsilon SVR and NU SVR |
| [options.nu] | number | 0.5 | For NU SVC and NU SVR |
| [options.epsilon] | number | 0.1 | For epsilon SVR |
| [options.cacheSize] | number | 100 | Cache size in MB |
| [options.tolerance] | number | 0.001 | Tolerance |
| [options.shrinking] | boolean | true | Use shrinking euristics (faster), |
| [options.probabilityEstimates] | boolean | false | weather to train SVC/SVR model for probability estimates, |
| [options.weight] | object | | Set weight for each possible class |
| [options.quiet] | boolean | true | Print info during training if false |

### svM.train(samples, labels)
Trains the SVM model.

**Kind**: instance method of [SVM](#SVM)
**Throws**:

- if SVM instance was instantiated from SVM.load.

| Param | Type | Description |
| --- | --- | --- |
| samples | Array.<Array.<number>> | The training samples. First level of array are the samples, second level are the individual features |
| labels | Array.<number> | The training labels. It should have the same size as the samples. If you are training a classification model, the labels should be distinct integers for each class. If you are training a regression model, each label should be the value of the predicted variable. |

### svM.crossValidation(samples, labels, kFold) ⇒ Array.<number>
Performs k-fold cross-validation (KF-CV). KF-CV separates the data-set into kFold random equally sized partitions,
and uses each as a validation set, with all other partitions used in the training set. Observations left over
from if kFold does not divide the number of observations are left out of the cross-validation process. If
kFold is one, this is equivalent to a leave-on-out cross-validation

**Kind**: instance method of [SVM](#SVM)
**Returns**: Array.<number> - The array of predicted labels produced by the cross validation. Has a size equal to
the number of samples provided as input.
**Throws**:

- if SVM instance was instantiated from SVM.load.

| Param | Type | Description |
| --- | --- | --- |
| samples | Array.<Array.<number>> | The training samples. |
| labels | Array.<number> | The training labels. |
| kFold | number | Number of datasets into which to split the training set. |

### svM.free()
Free the memory allocated for the model. Since this memory is stored in the memory model of emscripten, it is
allocated within an ArrayBuffer and WILL NOT BE GARBARGE COLLECTED, you have to explicitly free it. So
not calling this will result in memory leaks. As of today in the browser, there is no way to hook the
garbage collection of the SVM object to free it automatically.
Free the memory that was created by the compiled libsvm library to.
store the model. This model is reused every time the predict method is called.

**Kind**: instance method of [SVM](#SVM)

### svM.predictOne(sample) ⇒ number
Predict the label of one sample.

**Kind**: instance method of [SVM](#SVM)
**Returns**: number - - The predicted label.

| Param | Type | Description |
| --- | --- | --- |
| sample | Array.<number> | The sample to predict. |

### svM.predict(samples) ⇒ Array.<number>
Predict the label of many samples.

**Kind**: instance method of [SVM](#SVM)
**Returns**: Array.<number> - - The predicted labels.

| Param | Type | Description |
| --- | --- | --- |
| samples | Array.<Array.<number>> | The samples to predict. |

### svM.predictProbability(samples) ⇒ Array.<object>
Predict the label with probability estimate of many samples.

**Kind**: instance method of [SVM](#SVM)
**Returns**: Array.<object> - - An array of objects containing the prediction label and the probability estimates for each label

| Param | Type | Description |
| --- | --- | --- |
| samples | Array.<Array.<number>> | The samples to predict. |

### svM.predictOneProbability(sample) ⇒ object
Predict the label with probability estimate.

**Kind**: instance method of [SVM](#SVM)
**Returns**: object - - An object containing the prediction label and the probability estimates for each label

| Param | Type |
| --- | --- |
| sample | Array.<number> |

### svM.predictOneInterval(sample, confidence) ⇒ object
Predict a regression value with a confidence interval

**Kind**: instance method of [SVM](#SVM)
**Returns**: object - - An object containing the prediction value and the lower and upper bounds of the confidence interval

| Param | Type | Description |
| --- | --- | --- |
| sample | Array.<number> | |
| confidence | number | A value between 0 and 1. For example, a value 0.95 will give you the 95% confidence interval of the predicted value. |

### svM.predictInterval(samples, confidence) ⇒ Array.<object>
Predict regression values with confidence intervals

**Kind**: instance method of [SVM](#SVM)
**Returns**: Array.<object> - - An array of objects each containing the prediction label and the probability estimates for each label

| Param | Type | Description |
| --- | --- | --- |
| samples | Array.<Array.<number>> | An array of samples. |
| confidence | number | A value between 0 and 1. For example, a value 0.95 will give you the 95% confidence interval of the predicted value. |

### svM.getLabels() ⇒ Array.<number>
Get the array of labels from the model. Useful when creating an SVM instance with SVM.load

**Kind**: instance method of [SVM](#SVM)
**Returns**: Array.<number> - - The list of labels.

### svM.getSVIndices() ⇒ Array.<number>
Get the indices of the support vectors from the training set passed to the train method.

**Kind**: instance method of [SVM](#SVM)
**Returns**: Array.<number> - - The list of indices from the training samples.

### svM.serializeModel() ⇒ string
Uses libsvm's serialization method of the model.

**Kind**: instance method of [SVM](#SVM)
**Returns**: string - The serialization string.

### SVM.SVM_TYPES : Object
SVM classification and regression types

**Kind**: static property of [SVM](#SVM)
**Properties**

| Name | Description |
| --- | --- |
| C_SVC | The C support vector classifier type |
| NU_SVC | The nu support vector classifier type |
| ONE_CLASS | The one-class support vector classifier type |
| EPSILON_SVR | The epsilon support vector regression type |
| NU_SVR | The nu support vector regression type |

### SVM.KERNEL_TYPES : Object
SVM kernel types

**Kind**: static property of [SVM](#SVM)
**Properties**

| Name | Description |
| --- | --- |
| LINEAR | Linear kernel |
| POLYNOMIAL | Polynomial kernel |
| RBF | Radial basis function (gaussian) kernel |
| SIGMOID | Sigmoid kernel |

### SVM.load(serializedModel) ⇒ [SVM](#SVM)
Create a SVM instance from the serialized model.

**Kind**: static method of [SVM](#SVM)
**Returns**: [SVM](#SVM) - - SVM instance that contains the model.

| Param | Type | Description |
| --- | --- | --- |
| serializedModel | string | The serialized model. |

# LICENSE
BSD-3-Clause

[npm-image]: https://img.shields.io/npm/v/libsvm-js.svg?style=flat-square
[npm-url]: https://npmjs.org/package/libsvm-js
[download-image]: https://img.shields.io/npm/dm/libsvm-js.svg?style=flat-square
[download-url]: https://npmjs.org/package/libsvm-js