https://github.com/gatenlp/topicllm_granularity_hallucination

Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling
https://github.com/gatenlp/topicllm_granularity_hallucination

Last synced: 4 months ago
JSON representation

Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling

Host: GitHub
URL: https://github.com/gatenlp/topicllm_granularity_hallucination
Owner: GateNLP
Created: 2024-05-01T10:41:22.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-06-09T23:01:22.000Z (about 1 year ago)
Last Synced: 2025-01-13T06:10:41.324Z (6 months ago)
Language: Jupyter Notebook
Size: 6.79 MB
Stars: 2
Watchers: 5
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# TopicLLM_Granularity_Hallucination
Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling

```text
#trl setup: https://huggingface.co/docs/trl/example_overview
python == 3.10.9
torch == 2.1.2+cu121
transformers == 4.37.0.dev0
trl == 0.7.9
```

## set conda env:
```bash
conda env create -f topicllm.yml
```

## set Accelerate config file:
```bash
#to use Huggingface TRL, you’ll need to generate an Accelerate config file
#also see: https://huggingface.co/docs/trl/example_overview
accelerate config
```

## Develop DPO Training Sample (i.e., Re-construction Pipeline)
([/GateNLP/TopicLLM_Granularity_Hallucination/blob/main/Create%20DPO%20Training%20Samples%20Pipeline.ipynb](https://github.com/GateNLP/TopicLLM_Granularity_Hallucination/blob/main/Create%20DPO%20Training%20Samples%20Pipeline.ipynb))

## DPO Mistral 7B:
```bash
# code from Huggingface TRL
CUDA_VISIBLE_DEVICES=[your_device] accelerate launch finetune_Mistral7b.py
--model_name_or_path="mistralai/Mistral-7B-Instruct-v0.1"
--output_dir="mistral_new_Adapter"
```

## Merge Adapter Mistral 7B:
```bash
# code from Huggingface TRL
CUDA_VISIBLE_DEVICES=[your_device] python merge_peft_adapter.py
--base_model_name="mistralai/Mistral-7B-Instruct-v0.1"
--adapter_model_name="mistral_new_Adapter"
--output_dir="mistral_new_checkpoint"
```

## To extract topics via LLMs:
```bash
# Please note that more than one GPU graphics card may be required to run LLaMA 13B models!
CUDA_VISIBLE_DEVICES = "Your_GPU" python3 Seed_Topic_Dynamic.py
```

## To evaluate topics generated by LLMs
```python
python3 topic_evaluation.py
```

# Bibtex:
```bibtex
@article{mu2024addressing,
title={Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling},
author={Mu, Yida and Bai, Peizhen and Bontcheva, Kalina and Song, Xingyi},
journal={arXiv preprint arXiv:2405.00611},
year={2024}
}

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/gatenlp/topicllm_granularity_hallucination

Awesome Lists containing this project

README