https://github.com/tonmoytalukder/rank-your-summaries-enhancing-bengali-text-summarization-via-ranking-based-approach
Enhancinng Bengali Text Summarization via Ranking based Approach
https://github.com/tonmoytalukder/rank-your-summaries-enhancing-bengali-text-summarization-via-ranking-based-approach
bangla-summarization banglabert bengali bengali-natural-language-generation bengali-summarization bengali-summary-ranker bengali-summary-ranking fine-tuning mt5 multilingual-t5 natural-language-processing summarization summary-ranker summary-ranking text-ranker textrank transformers
Last synced: about 1 month ago
JSON representation
Enhancinng Bengali Text Summarization via Ranking based Approach
- Host: GitHub
- URL: https://github.com/tonmoytalukder/rank-your-summaries-enhancing-bengali-text-summarization-via-ranking-based-approach
- Owner: TonmoyTalukder
- Created: 2023-04-30T16:26:24.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2023-09-08T14:16:13.000Z (over 1 year ago)
- Last Synced: 2025-01-31T19:44:15.962Z (3 months ago)
- Topics: bangla-summarization, banglabert, bengali, bengali-natural-language-generation, bengali-summarization, bengali-summary-ranker, bengali-summary-ranking, fine-tuning, mt5, multilingual-t5, natural-language-processing, summarization, summary-ranker, summary-ranking, text-ranker, textrank, transformers
- Language: Jupyter Notebook
- Homepage: https://link.springer.com/chapter/10.1007/978-981-99-8937-9_11
- Size: 1.74 MB
- Stars: 1
- Watchers: 2
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Rank-Your-Summaries-Enhancing-Bengali-Text-Summarization-via-Ranking-based-Approach
## Pretrained Models
We have used bellow pre-trained models for summarization:
- mT5 (mT5 XLSum, mT5 CrossSum, mT5 Shahidul)
- BERT (scibert uncased)We have used BERT (BanglaBERT) for summary ranking purposes.
## Code
You will find the codes of this project inside the "Code" folder. You need to install specific libraries mentioned in the Notebook to run the code.
- There is a folder named 'Summary Generation with Pre-trained Models' inside the "Code" folder. It contains codes of four summary generator models, all of which generate a summary of Bengali text.
- And a notebook named "Summary_Ranker.ipynb" contains codes for:
- merging datasets
- ranking summaries
- generating the final data frame for evaluation
- and all evaluations.## Data
You will find both of the datasets used in this project inside the "Data" folder. We have used one Huggingface Dataset and another from Kaggle. For the second one, you need to download it first.