https://github.com/uliontse/mlgb

MLGB is a library that includes many models of CTR Prediction & Recommender System by TensorFlow & PyTorch. 「妙计包」是一个包含50+点击率预估和推荐系统深度模型的、通过TensorFlow和PyTorch撰写的库。
https://github.com/uliontse/mlgb
autoint ctr-prediction dcn deep-learning deepfm din dsin dssm edcn esmm fibinet machine-learning masknet mind mmoe pepnet ple pnn recommender-system xdeepfm
Last synced: 5 months ago
JSON representation
MLGB is a library that includes many models of CTR Prediction & Recommender System by TensorFlow & PyTorch. 「妙计包」是一个包含50+点击率预估和推荐系统深度模型的、通过TensorFlow和PyTorch撰写的库。
Host: GitHub
URL: https://github.com/uliontse/mlgb
Owner: UlionTse
License: apache-2.0
Created: 2024-01-02T18:18:10.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2025-03-02T02:41:11.000Z (8 months ago)
Last Synced: 2025-05-12T12:07:18.944Z (5 months ago)
Topics: autoint, ctr-prediction, dcn, deep-learning, deepfm, din, dsin, dssm, edcn, esmm, fibinet, machine-learning, masknet, mind, mmoe, pepnet, ple, pnn, recommender-system, xdeepfm
Language: Python
Homepage:
Size: 548 KB
Stars: 689
Watchers: 8
Forks: 22
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: change_log.txt
- License: LICENSE
Awesome Lists containing this project

README

          


  





  

  

  

  

  

  

  

  

  



* * *

**MLGB** means **M**achine **L**earning of the **G**reat **B**oss, and is called **「妙计包」**.  

**MLGB** is a library that includes many models of CTR Prediction & Recommender System by TensorFlow & PyTorch.

- [Advantages](#advantages)

- [Supported Models](#supported-models)

- [Installation](#installation)

- [Getting Started](#getting-started)

- [Code Examples](#code-examples)

- [Citation](#citation)

## Advantages

- **Easy!** Use `mlgb.get_model(model_name, **kwargs)` to get a complex model.

- **Fast!** Better performance through better code.

- **Enjoyable!** 50+ ranking & matching models to use, 2 languages(TensorFlow & PyTorch) to deploy.

## Supported Models

| ID  | Model Name 
| --- 
| :open_file_folder: 
| 1   | LR 
| 2   | PLM/MLR 
| 3   | MLP/DNN 
| 4   | DLRM 
| 5   | MaskNet 
|     | 
| 6 
| 7   | DCN 
| 8   | EDCN 
|     | 
| 9   | FM 
| 10  | FFM 
| 11  | HOFM 
| 12  | FwFM 
| 13  | FmFM 
| 14  | FEFM 
| 15  | AFM 
| 16  | LFM 
| 17  | IFM 
| 18  | DIFM 
|     | 
| 19  | FNN 
| 20  | PNN 
| 21  | PIN 
| 22  | ONN/NFFM 
| 23  | AFN 
|     | 
| 24  | NFM 
| 25  | WDL 
| 26  | DeepFM 
| 27  | DeepFEFM 
| 28  | FLEN 
|     | 
| 29  | CCPM 
| 30  | FGCNN 
| 31  | XDeepFM 
| 32  | FiBiNet 
| 33  | AutoInt 
| :open_file_folder: 
| 34  | GRU4Rec 
| 35  | Caser 
| 36  | SASRec 
| 37  | BERT4Rec 
| 38  | BST 
| 39  | DIN 
| 40  | DIEN 
| 41  | DSIN 
| :open_file_folder: 
| 42 
| 43  | ESMM 
| 44  | MMoE 
| 45  | PLE 
| 46  | PEPNet 
| :open_file_folder: 
| 47  | NCF 
| 48  | MatchFM 
| 49  | DSSM 
| 50  | EBR 
| 51  | YoutubeDNN 
| 52  | MIND 
|     |

| Paper Link                                                                                                                                                                             | Paper Team                                                                   | Paper Year | | ------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------- | ---------- | **Ranking-Model::Normal** :point_down: | | [Predicting Clicks: Estimating the Click-Through Rate for New Ads](https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/predictingclicks.pdf)                           | Microsoft                                                                    | 2007       | | [Learning Piece-wise Linear Models from Large Scale Data for Ad Click Prediction](https://arxiv.org/pdf/1704.05194.pdf)                                                                | Alibaba                                                                      | 2017       | | [Neural Networks for Pattern Recognition](http://diyhpl.us/~bryan/papers2/ai/ahuman-pdf-only/neural-networks/2005-Pattern%20Recognition.pdf)                                           | Christopher M. Bishop(Microsoft, 1997-Present), Foreword by Geoffrey Hinton. | 1995       | | [Deep Learning Recommendation Model for Personalization and Recommendation Systems](https://arxiv.org/pdf/1906.00091.pdf)                                                              | Facebook(Meta)                                                               | 2019       | | [MaskNet: Introducing Feature-Wise Multiplication to CTR Ranking Models by Instance-Guided Mask](https://arxiv.org/pdf/2102.07619.pdf)                                                 | Weibo(Sina)                                                                  | 2021       | |                                                                                                                                                                                        |                                                                              |            | | DCM/DeepCross | [Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features](https://www.kdd.org/kdd2016/papers/files/adf0975-shanA.pdf)                                        | Microsoft                                                                    | 2016       | | [DCN V2: Improved Deep & Cross Network and Practical Lessons for Web-scale Learning to Rank Systems](https://arxiv.org/pdf/2008.13535.pdf), [v1](https://arxiv.org/pdf/1708.05123.pdf) | Google(Alphabet)                                                             | 2017, 2020 | | [Enhancing Explicit and Implicit Feature Interactions via Information Sharing for Parallel Deep CTR Models](https://dlp-kdd.github.io/assets/pdf/DLP-KDD_2021_paper_12.pdf)            | Huawei                                                                       | 2021       | |                                                                                                                                                                                        |                                                                              |            | | [Factorization Machines](https://cseweb.ucsd.edu/classes/fa17/cse291-b/reading/Rendle2010FM.pdf)                                                                                       | Steffen Rendle(Google, 2013-Present)                                         | 2010       | | [Field-aware Factorization Machines for CTR Prediction](https://www.csie.ntu.edu.tw/~cjlin/papers/ffm.pdf)                                                                             | NTU                                                                          | 2016       | | [Higher-Order Factorization Machines](https://arxiv.org/pdf/1607.07195v2.pdf)                                                                                                          | NTT                                                                          | 2016       | | [Field-weighted Factorization Machines for Click-Through Rate Prediction in Display Advertising](https://arxiv.org/pdf/1806.03514.pdf)                                                 | Junwei Pan(Yahoo), etc.                                                      | 2018, 2020 | | [FM^2: Field-matrixed Factorization Machines for Recommender Systems](https://arxiv.org/pdf/2102.12994v2.pdf)                                                                          | Yahoo                                                                        | 2021       | | [FIELD-EMBEDDED FACTORIZATION MACHINES FOR CLICK-THROUGH RATE PREDICTION](https://arxiv.org/pdf/2009.09931v2.pdf)                                                                      | Harshit Pande(Adobe)                                                         | 2020, 2021 | | [Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks](https://arxiv.org/pdf/1708.04617.pdf)                                         | ZJU&NUS(Jun Xiao(ZJU), Xiangnan He(NUS), etc.)                               | 2017       | | [Learning Feature Interactions with Lorentzian Factorization Machine](https://arxiv.org/pdf/1911.09821.pdf)                                                                            | EBay                                                                         | 2019       | | [An Input-aware Factorization Machine for Sparse Prediction](https://www.ijcai.org/proceedings/2019/0203.pdf)                                                                          | THU                                                                          | 2019       | | [A Dual Input-aware Factorization Machine for CTR Prediction](https://www.ijcai.org/proceedings/2020/0434.pdf)                                                                         | THU                                                                          | 2020       | |                                                                                                                                                                                        |                                                                              |            | | [Deep Learning over Multi-field Categorical Data – A Case Study on User Response Prediction](https://arxiv.org/pdf/1601.02376.pdf)                                                     | UCL(Weinan Zhang(UCL, SJTU), etc.)                                           | 2016       | | [Product-based Neural Networks for User Response](https://arxiv.org/pdf/1611.00144.pdf)                                                                                                | SJTU&UCL(Yanru Qu(SJTU), Weinan Zhang(SJTU, UCL), etc.)                      | 2016       | | [Product-based Neural Networks for User Response Prediction over Multi-field Categorical Data](https://arxiv.org/pdf/1807.00311.pdf)                                                   | Huawei(Yanru Qu(Huawei(2017.3-2018.3), SJTU), Weinan Zhang(SJTU, UCL), etc.) | 2018       | | [Operation-aware Neural Networks for User Response Prediction](https://arxiv.org/pdf/1904.12579.pdf)                                                                                   | NJU                                                                          | 2019       | | [Adaptive Factorization Network: Learning Adaptive-Order Feature Interactions](https://arxiv.org/pdf/1909.03276v2.pdf)                                                                 | SJTU                                                                         | 2019, 2020 | |                                                                                                                                                                                        |                                                                              |            | | [Neural Factorization Machines for Sparse Predictive Analytics](https://arxiv.org/pdf/1708.05027.pdf)                                                                                  | NUS(Xiangnan He(NUS))                                                        | 2017       | | [Wide & Deep Learning for Recommender Systems](https://arxiv.org/pdf/1606.07792.pdf)                                                                                                   | Google(Alphabet)                                                             | 2016       | | [DeepFM: A Factorization-Machine based Neural Network for CTR Prediction](https://arxiv.org/pdf/1703.04247.pdf)                                                                        | Huawei                                                                       | 2017       | | [FIELD-EMBEDDED FACTORIZATION MACHINES FOR CLICK-THROUGH RATE PREDICTION](https://arxiv.org/pdf/2009.09931v2.pdf)                                                                      | Harshit Pande(Adobe)                                                         | 2020, 2021 | | [FLEN: Leveraging Field for Scalable CTR Prediction](https://arxiv.org/pdf/1911.04690v4.pdf)                                                                                           | Meitu                                                                        | 2019, 2020 | |                                                                                                                                                                                        |                                                                              |            | | [A Convolutional Click Prediction Model](http://wnzhang.net/share/rtb-papers/cnn-ctr.pdf)                                                                                              | CASIA                                                                        | 2015       | | [Feature Generation by Convolutional Neural Network for Click-Through Rate Prediction](https://arxiv.org/pdf/1904.04447.pdf)                                                           | Huawei                                                                       | 2019       | | [xDeepFM: Combining Explicit and Implicit Feature Interactions for Recommender Systems](https://arxiv.org/pdf/1803.05170v3.pdf)                                                        | Microsoft(Jianxun Lian(USTC, Microsoft(2018.7-Present)), etc.)               | 2018       | | [FiBiNET: Combining Feature Importance and Bilinear feature Interaction for Click-Through Rate Prediction](https://arxiv.org/pdf/1905.09433.pdf)                                       | Weibo(Sina)                                                                  | 2019       | | [AutoInt: Automatic Feature Interaction Learning via Self-Attentive Neural Networks](https://arxiv.org/pdf/1810.11921v2.pdf)                                                           | PKU                                                                          | 2018, 2019 | **Ranking-Model::Sequential** :point_down: | | [Session-based Recommendations with Recurrent Neural Networks](https://arxiv.org/pdf/1511.06939.pdf)                                                                                   | Telefonica                                                                   | 2015, 2016 | | [Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding](https://arxiv.org/pdf/1809.07426.pdf)                                                              | SFU                                                                          | 2018       | | [Self-Attentive Sequential Recommendation](https://arxiv.org/pdf/1808.09781.pdf)                                                                                                       | UCSD                                                                         | 2018       | | [BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer](https://arxiv.org/pdf/1904.06690.pdf)                                                | Alibaba                                                                      | 2019       | | [Behavior Sequence Transformer for E-commerce Recommendation in Alibaba](https://arxiv.org/pdf/1905.06874.pdf)                                                                         | Alibaba                                                                      | 2019       | | [Deep Interest Network for Click-Through Rate Prediction](https://arxiv.org/pdf/1706.06978v4.pdf), [v1](https://arxiv.org/pdf/1706.06978v1.pdf)                                        | Alibaba                                                                      | 2017, 2018 | | [Deep Interest Evolution Network for Click-Through Rate Prediction](https://arxiv.org/pdf/1809.03672.pdf)                                                                              | Alibaba                                                                      | 2018       | | [Deep Session Interest Network for Click-Through Rate Prediction](https://arxiv.org/pdf/1905.06482.pdf)                                                                                | Alibaba                                                                      | 2019       | **Ranking-Model::Multitask** :point_down: | | SharedBottom  | [An Overview of Multi-Task Learning in Deep Neural Networks](https://arxiv.org/pdf/1706.05098.pdf)                                                                                     | Sebastian Ruder(InsightCentre)                                               | 2017       | | [Entire Space Multi-Task Model: An Effective Approach for Estimating Post-Click Conversion Rate](https://arxiv.org/pdf/1804.07931.pdf)                                                 | Alibaba                                                                      | 2018       | | [Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts](https://dl.acm.org/doi/pdf/10.1145/3219819.3220007)                                            | Google(Alphabet)                                                             | 2018       | | [Progressive Layered Extraction (PLE): A Novel Multi-Task Learning (MTL) Model for Personalized Recommendations](https://www.sci-hub.se/10.1145/3383313.3412236)                       | Tencent                                                                      | 2020       | | [PEPNet: Parameter and Embedding Personalized Network for Infusing with Personalized Prior Information](https://arxiv.org/pdf/2302.01115.pdf)                                          | Kuaishou                                                                     | 2023       | **Matching-Model** :point_down: | | [Neural Collaborative Filtering](https://arxiv.org/pdf/1708.05031.pdf)                                                                                                                 | NUS(Xiangnan He(NUS), etc)                                                   | 2017       | | [Factorization Machines](https://cseweb.ucsd.edu/classes/fa17/cse291-b/reading/Rendle2010FM.pdf)                                                                                       | Steffen Rendle(Google, 2013-Present)                                         | 2010       | | [Learning deep structured semantic models for web search using clickthrough data](https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/cikm2013_DSSM_fullversion.pdf)   | Microsoft                                                                    | 2013       | | [Embedding-based Retrieval in Facebook Search](https://browse.arxiv.org/pdf/2006.11632.pdf)                                                                                            | Facebook(Meta)                                                               | 2020       | | [Deep Neural Networks for YouTube Recommendations](https://static.googleusercontent.com/media/research.google.com/zh-CN//pubs/archive/45530.pdf)                                       | Google(Alphabet)                                                             | 2016       | | [Multi-Interest Network with Dynamic Routing for Recommendation at Tmall](https://arxiv.org/pdf/1904.08030.pdf)                                                                        | Alibaba                                                                      | 2019       | |                                                                                                                                                                                        |                                                                              |            |

## Installation

```sh

# PYPI

pip install --upgrade mlgb

# Conda

conda install conda-forge::mlgb

```

## Getting Started

```python

import mlgb

# parameters of get_model:

help(mlgb.get_model)

"""

get_model(feature_names, model_name='LR', task='binary', aim='ranking', lang='TensorFlow', device=None, seed=None, **kwargs)

    :param feature_names: tuple(tuple(dict)), must. Embedding need vocabulary size and custom embed_dim of features.

    :param model_name: str, default 'LR'. Union[`mlgb.ranking_models`, `mlgb.matching_models`, `mlgb.mtl_models`]

    :param task: str, default 'binary'. Union['binary', 'regression', 'multiclass:{int}']

    :param aim: str, default 'ranking'. Union['ranking', 'matching', 'mtl']

    :param lang: str, default 'TensorFlow'. Union['TensorFlow', 'PyTorch', 'tf', 'torch']

    :param device: Optional[str, int], default None. Only for PyTorch.

    :param seed: Optional[int], default None.

    :param **kwargs: more model parameters by `mlgb.get_model_help(model_name)`.

"""

# parameters of model:

mlgb.get_model_help(model_name='LR', lang='tf')

"""

 class LR(tf.keras.src.models.model.Model)

 |  LR(feature_names, task='binary', seed=None, inputs_if_multivalued=False, inputs_if_sequential=False, inputs_if_embed_dense=False, embed_dim=32, embed_2d_dim=None, embed_l2=0.0, embed_initializer=None, pool_mv_mode='Pooling:average', pool_mv_axis=2, pool_mv_l2=0.0, pool_mv_initializer=None, pool_seq_mode='Pooling:average', pool_seq_axis=1, pool_seq_l2=0.0, pool_seq_initializer=None, linear_if_bias=True, linear_l1=0.0, linear_l2=0.0, linear_initializer=None)

 |  

 |  Methods defined here:

 |  

 |  __init__(self, feature_names, task='binary', seed=None, inputs_if_multivalued=False, inputs_if_sequential=False, inputs_if_embed_dense=False, embed_dim=32, embed_2d_dim=None, embed_l2=0.0, embed_initializer=None, pool_mv_mode='Pooling:average', pool_mv_axis=2, pool_mv_l2=0.0, pool_mv_initializer=None, pool_seq_mode='Pooling:average', pool_seq_axis=1, pool_seq_l2=0.0, pool_seq_initializer=None, linear_if_bias=True, linear_l1=0.0, linear_l2=0.0, linear_initializer=None)

 |      Model Name: LR(LinearOrLogisticRegression)

 |      Paper Team: Microsoft

 |      Paper Year: 2007

 |      Paper Name: 

 |      Paper Link: https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/predictingclicks.pdf

 |      

 |      Task Inputs Parameters:

 |          :param feature_names: tuple(tuple(dict)), must. Embedding need vocabulary size and custom embed_dim of features.

 |          :param task: str, default 'binary'. Union['binary', 'regression']

 |          :param seed: Optional[int], default None.

 |          :param inputs_if_multivalued: bool, default False.

 |          :param inputs_if_sequential: bool, default False.

 |          :param inputs_if_embed_dense: bool, default False.

 |          :param embed_dim: int, default 32.

 |          :param embed_2d_dim: Optional[int], default None. When None, each field has own embed_dim by feature_names.

 |          :param embed_l2: float, default 0.0.

 |          :param embed_initializer: Optional[str], default None. When None, activation judge first, xavier_normal end.

 |          :param pool_mv_mode: str, default 'Pooling:average'. Pooling mode of multivalued inputs. Union[

 |                              'Attention', 'Weighted', 'Pooling:max', 'Pooling:average', 'Pooling:sum']

 |          :param pool_mv_axis: int, default 2. Pooling axis of multivalued inputs.

 |          :param pool_mv_l2: float, default 0.0. When pool_mv_mode is in ('Weighted', 'Attention'), it works.

 |          :param pool_mv_initializer: Optional[str], default None. When None, activation judge first,

 |                              xavier_normal end. When pool_mv_mode is in ('Weighted', 'Attention'), it works.

 |          :param pool_seq_mode: str, default 'Pooling:average'. Pooling mode of sequential inputs. Union[

 |                              'Attention', 'Weighted', 'Pooling:max', 'Pooling:average', 'Pooling:sum']

 |          :param pool_seq_axis: int, default 1. Pooling axis of sequential inputs.

 |          :param pool_seq_l2: float, default 0.0. When pool_seq_mode is in ('Weighted', 'Attention'), it works.

 |          :param pool_seq_initializer: Optional[str], default None. When None, activation judge first,

 |                              xavier_normal end. When pool_seq_mode is in ('Weighted', 'Attention'), it works.

 |      

 |      Task Model Parameters:

 |          :param linear_if_bias: bool, default True.

 |          :param linear_l1: float, default 0.0.

 |          :param linear_l2: float, default 0.0.

 |          :param linear_initializer: Optional[str], default None. When None, activation judge first, xavier_normal end.

"""

```

## Code Examples

| Code Examples                                                          |

| ---------------------------------------------------------------------- |

| [TensorFlow](https://github.com/UlionTse/mlgb/tree/main/mlgb/examples) |

| [PyTorch](https://github.com/UlionTse/mlgb/tree/main/mlgb/examples)    |

## Citation

If you use this for research, please cite it using the following BibTeX entry. Thanks.

```bibtex

@misc{uliontse2020mlgb,

  author = {UlionTse},

  title = {MLGB is a library that includes many models of CTR Prediction & Recommender System by TensorFlow & PyTorch},

  year = {2020},

  publisher = {GitHub},

  journal = {GitHub Repository},

  howpublished = {\url{https://github.com/UlionTse/mlgb}},

}

```
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/uliontse/mlgb

Awesome Lists containing this project

README