An open API service indexing awesome lists of open source software.

https://github.com/tonmoytalukder/rank-your-summaries-enhancing-bengali-text-summarization-via-ranking-based-approach

Enhancinng Bengali Text Summarization via Ranking based Approach
https://github.com/tonmoytalukder/rank-your-summaries-enhancing-bengali-text-summarization-via-ranking-based-approach

bangla-summarization banglabert bengali bengali-natural-language-generation bengali-summarization bengali-summary-ranker bengali-summary-ranking fine-tuning mt5 multilingual-t5 natural-language-processing summarization summary-ranker summary-ranking text-ranker textrank transformers

Last synced: about 1 month ago
JSON representation

Enhancinng Bengali Text Summarization via Ranking based Approach

Awesome Lists containing this project

README

        

# Rank-Your-Summaries-Enhancing-Bengali-Text-Summarization-via-Ranking-based-Approach

## Pretrained Models
We have used bellow pre-trained models for summarization:
- mT5 (mT5 XLSum, mT5 CrossSum, mT5 Shahidul)
- BERT (scibert uncased)

We have used BERT (BanglaBERT) for summary ranking purposes.

## Code
You will find the codes of this project inside the "Code" folder. You need to install specific libraries mentioned in the Notebook to run the code.
- There is a folder named 'Summary Generation with Pre-trained Models' inside the "Code" folder. It contains codes of four summary generator models, all of which generate a summary of Bengali text.
- And a notebook named "Summary_Ranker.ipynb" contains codes for:
- merging datasets
- ranking summaries
- generating the final data frame for evaluation
- and all evaluations.

## Data
You will find both of the datasets used in this project inside the "Data" folder. We have used one Huggingface Dataset and another from Kaggle. For the second one, you need to download it first.