https://github.com/jmhessel/fmpytorch

A PyTorch implementation of a Factorization Machine module in cython.
https://github.com/jmhessel/fmpytorch

Last synced: 5 months ago
JSON representation

A PyTorch implementation of a Factorization Machine module in cython.

Host: GitHub
URL: https://github.com/jmhessel/fmpytorch
Owner: jmhessel
License: mit
Created: 2017-08-15T03:30:10.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2018-03-12T18:12:44.000Z (almost 8 years ago)
Last Synced: 2024-11-27T04:31:05.920Z (about 1 year ago)
Language: Python
Size: 288 KB
Stars: 173
Watchers: 10
Forks: 18
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

Awesome-pytorch-list-CNVersion - fmpytorch
Awesome-pytorch-list - fmpytorch

README

          # fmpytorch

A library for factorization machines in pytorch. A factorization

machine is like a linear model, except multiplicative interaction

terms between the variables are modeled as well.

The input to a factorization machine layer is a vector, and the output

is a scalar. Batching is fully supported.

This is a work in progress. Feedback and bugfixes welcome! Hopefully you

find the code useful.

## Usage

The factorization machine layers in fmpytorch can be used just like any other built-in module. Here's a simple feed-forward model using a factorization machine that takes in a 50-D input, and models interactions using `k=5` factors.

```python

import torch

from fmpytorch.second_order.fm import FactorizationMachine

class MyModel(torch.nn.Module):

    def __init__(self):

        super(MyModel, self).__init__()

        self.linear = torch.nn.Linear(100, 50)

        self.dropout = torch.nn.Dropout(.5)

	# This makes a fm layer mapping from 50-D to 1-D.

	# The number of factors is 5.

        self.fm = FactorizationMachine(50, 5)

    def forward(self, x):

        x = self.linear(x)

        x = self.dropout(x)

        x = self.fm(x)

        return x

```

See examples/toy.py or examples/regression.py for fuller examples.

## Installation

This package requires pytorch, numpy, and cython.

To install, you can run:

```

cd fmpytorch

sudo python setup.py install

```

## Factorization Machine brief intro

A linear model, given a vector `x` models its output `y` as







where `w` are the learnable weights of the model.

However, the interactions between the input variables `x_i` are purely additive. In some cases, it might be useful to model the interactions between your variables, e.g., `x_i * x_j`. You could add terms into your model like







However, this introduces a large number of `w2` variables. Specifically, there are `O(n^2)` parameters introduced in this formulation, one for each interaction pair. A factorization machine approximates `w2` using low dimensional factors, i.e.,







where each `v_i` is a low-dimensional vector. This is the forward pass of a second order factorization machine. This low-rank re-formulation has reduced the number of additional parameters for the factorization machine to `O(k*n)`. Magically, the forward (and backward) pass can be reformulated so that it can be computed in `O(k*n)`, rather than the naive `O(k*n^2)` formulation above.

## Currently supported features

Currently, only a second order factorization machine is supported. The

forward and backward passes are implemented in cython. Compared to the

autodiff solution, the cython passes run several orders of magnitude

faster. I've only tested it with python 2 at the moment.

## TODOs

0. Support for sparse tensors.

1. More interesting useage examples

2. More testing, e.g., with python 3, etc.

3. Make sure all of the code plays nice with torch-specific stuff, e.g., GPUs

4. Arbitrary order factorization machine support

5. Better organization/code cleaning

## Thanks to

Vlad Niculae (@vene) for his sage wisdom.

The original factorization machine citation, which this layer is based off of, is

```

@inproceedings{rendle2010factorization,

	       title={Factorization machines},

    	       author={Rendle, Steffen},

      	       booktitle={ICDM},

               pages={995--1000},

	       year={2010},

	       organization={IEEE}

}

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jmhessel/fmpytorch

Awesome Lists containing this project

README