https://github.com/rensvandeschoot/foras_scripts_hunt
Scripts for the paper 'the hunt for the last relevant paper' of the FORAS project
https://github.com/rensvandeschoot/foras_scripts_hunt
Last synced: 6 months ago
JSON representation
Scripts for the paper 'the hunt for the last relevant paper' of the FORAS project
- Host: GitHub
- URL: https://github.com/rensvandeschoot/foras_scripts_hunt
- Owner: Rensvandeschoot
- License: mit
- Created: 2024-11-12T08:24:47.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2025-02-05T10:35:43.000Z (8 months ago)
- Last Synced: 2025-02-15T01:44:54.217Z (8 months ago)
- Language: Jupyter Notebook
- Size: 81.1 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
[](https://doi.org/10.5281/zenodo.14615846)
# Overview
This repository is part of the **Hunt for the Last Relevant Paper** project,
pre-registered as "[Trajectories of PTSD Following Traumatic Events: A
Systematic and Multi-database Review]
(https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=494027)".The repository is focusing on data quality validation and running the analysis
for the paper. It includes Jupyter Notebooks designed to automate tests and
analyses for ensuring dataset consistency, logical integrity, and
reproducibility.# Contents
## `hunt_tests.ipynb`
- **Purpose**: Performs comprehensive data quality tests to validate the consistency of the labels across the different datasets.
- **Features**:
- Validates the uniqueness and completeness of key identifiers (`MID` values).
- Ensures logical consistency across related columns and datasets.
- Checks for synchronization between primary datasets and second-screener data.
- Reports detailed insights into any violations or inconsistencies.## `hunt_analysis.ipynb`:
- **Purpose**: Conducts analyses for the paper.
- **Features**:
- Implements statistical tests and visualizations.
- Analyzes dataset trends and patterns to support the systematic review findings.
- Provides clear outputs for inclusion in the manuscript.## Required Input Datasets
The following datasets are required to run the notebooks:
1. **`PTSS_Data_Foras.xlsx`**
- The primary dataset with all labels resulted from screening.2. **`PTSS_Data_Foras_Fulltext.xlsx`**
- Contains full-text screening information for records identified as TI-AB relevant in the primary dataset.3. **`PTSS_Data_Foras_Fulltext_2ndscreener.xlsx`**
- Contains second-screener evaluations for the same full-text records.4. **`PTSS_Data_Synergy.xlsx`**
- The initial dataset used for comparison and cross-checks.5. **`PTSS_Data_Synergy_Fulltext.xlsx`**
- Contains full-text screening information for the `PTSS_Data_Synergy` dataset.## Usage
Place these files in a `data/` directory at the root of the repository to ensure the notebooks can locate them.
Clone the repository:
```bash
git clone https://github.com/yourusername/hunt-for-the-last-relevant-paper.git
cd hunt-for-the-last-relevant-paper
```Install dependencies: Ensure you have Python and Jupyter installed, along with necessary Python libraries:
```bash
pip install pandas numpy
```Place the required input datasets in the data/ directory and open the notebooks:
```bash
jupyter notebook hunt_tests.ipynb
jupyter notebook hunt_analysis.ipynb
```Follow the steps in the notebook.
## Funding
The research is supported by the Dutch Research Council under grant number 406.22.GO.048## Contact
For questions contact Rens van de Schoot (a.g.j.vandeschoot@uu.nl)