https://github.com/zzy979/cdl-lda
Cross-Domain Labeled LDA (CDL-LDA)
https://github.com/zzy979/cdl-lda
latent-dirichlet-allocation natural-language-processing topic-model
Last synced: 9 months ago
JSON representation
Cross-Domain Labeled LDA (CDL-LDA)
- Host: GitHub
- URL: https://github.com/zzy979/cdl-lda
- Owner: ZZy979
- Created: 2020-09-07T11:40:56.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2020-10-29T01:47:12.000Z (over 5 years ago)
- Last Synced: 2025-03-11T21:59:20.468Z (about 1 year ago)
- Topics: latent-dirichlet-allocation, natural-language-processing, topic-model
- Language: Python
- Homepage:
- Size: 48.8 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Cross-Domain Labeled LDA (CDL-LDA)
原论文:
## [CDL-LDA](models/cdllda.py)
原论文模型
## [CDL-LDA-un](models/cdllda_un.py)
原论文中的无监督版本的CDL-LDA模型
* 初始化过程中源域文档单词的主题组不使用文档的标签,而是随机生成
* 预测文档标签使用逻辑回归而不是文档中单词的主题组
## [CDL-LDA-soft](models/cdllda_soft.py)
使用soft prior的CDL-LDA模型
* 单词主题组可以使用soft prior
## [CDL-LDA-LR](models/cdllda_lr.py)
嵌入逻辑回归的CDL-LDA模型
* 单词的主题组不再对应文档标签,主题组的数量和标签数量可以不同
* 生成过程嵌入逻辑回归模型,和模型本身同时训练
* 预测文档标签使用训练得到的逻辑回归模型
## [CDL-LDA-soft-LR](models/cdllda_soft_lr.py)
CDL-LDA-soft + CDL-LDA-LR