Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/chengang95/UnKD
https://github.com/chengang95/UnKD
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/chengang95/UnKD
- Owner: chengang95
- Created: 2022-11-03T05:26:26.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-06-15T08:41:54.000Z (over 1 year ago)
- Last Synced: 2024-08-01T22:38:07.180Z (6 months ago)
- Language: Python
- Size: 81.1 KB
- Stars: 14
- Watchers: 1
- Forks: 1
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# UnKD
This is an implemention for our WSDM 2023 paper "Unbiased Knowledge Distillation for Recommendation" based on Pytorch.## Requirements
+ pytorch == 1.8.0
+ Numpy
+ python3
+ pandas == 0.25.0
+ tqdm## Datasets
Data preprocessing through python preprocess_xxx.py
+ Movielens:[original data](https://grouplens.org/datasets/movielens/); Data preprocessing code:preprocess_ml1m.py
+ CiteULike:[original data](https://github.com/changun/CollMetric); Data preprocessing code:preprocess_cite.py
+ Apps:[original data](http://jmcauley.ucsd.edu/data/amazon/links.html); Data preprocessing code:preprocess_app.py## Parameters
Key parameters in distill_new_api.py:
+ --split_group: The number of divided groups.
+ --sample_num: The number of samples.
+ --recdim: Model dimensions.
+ -- model: MF or LightGCN.
+ --teacher_model: MF or LightGCN.
+ --teacher_dim: Teacher model dimensions.
+ --sampler: Distillation method, such as: UnKD.
+ --datasetpkl: Dataset save location.
+ others: read help, or "python xxx.py --help"## Commands
We provide following commands for our method and baselines.The following two steps are required.
#### 1. Train a teacher model:
+ First, we need to train a teacher model.
```
python -u train_new_api.py --dataset=Apps --datasetpkl=$location --recdim=100 --model=MF --decay=0.01 --epochs=1000 --lr=0.01 --seed=2022
```
#### 2. Simply Reproduce the Results:
+ Then you can run the following commands to reproduce the results in our paper. Note that the name of the teacher model trained in the previous step needs to be modified in the code...
```
python -u distill_new_api.py --split_group=4 --sample_num=30 --dataset=Apps --datasetpkl=$location --recdim=10 --model=MF --teacher_model=MF --decay=0.01 --epochs=1000 --lr=0.01 --seed=2022 --stopflag=ndcg --teacher_dim=100 --kd_weight=1.0 --sampler=UnKD
```
## Citation
If you use our codes in your research, please cite our paper.