An open API service indexing awesome lists of open source software.

https://github.com/semanticclimate/docalr


https://github.com/semanticclimate/docalr

Last synced: 11 months ago
JSON representation

Awesome Lists containing this project

README

          

# docALR
# Scientific Paper Analysis Pipeline

A modular and extensible pipeline for **automated retrieval**, **named entity recognition (NER)**, **summarization**, and **question-answering** from scientific papers using **Pygetpapers**, **spaCy/transformers**, and **LLM/RAG-based models**.

---

## Features

- **Retrieve scientific papers** from open-access sources using [Pygetpapers](https://github.com/petermr/pygetpapers)
- **Extract named entities** using pre-trained spaCy or transformer-based models
- **Summarize full texts** or abstracts using transformer-based summarization models (e.g., BART, T5)
- **Ask questions** and get answers with RAG-based pipelines or custom LLMs

---