https://github.com/alankritaaaa/side-effect-prediction-ml
AI/ML project for predicting side effect risk score & recommending drug substitutions
https://github.com/alankritaaaa/side-effect-prediction-ml
bioinformatics data-science drug-safety healthcare healthcare-ai machine-learning predictive-modeling streamlit
Last synced: 3 months ago
JSON representation
AI/ML project for predicting side effect risk score & recommending drug substitutions
- Host: GitHub
- URL: https://github.com/alankritaaaa/side-effect-prediction-ml
- Owner: alankritaaaa
- Created: 2025-06-22T12:50:02.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2025-06-22T14:12:52.000Z (4 months ago)
- Last Synced: 2025-06-22T14:26:16.381Z (4 months ago)
- Topics: bioinformatics, data-science, drug-safety, healthcare, healthcare-ai, machine-learning, predictive-modeling, streamlit
- Language: Python
- Homepage:
- Size: 16.6 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
💊 AI-Driven Drug Side Effect Prediction & Smart Substitution Recommender
This project presents a Clinical Decision Support System (CDSS) designed to predict adverse drug reactions (ADRs) and recommend safer therapeutic alternatives using machine learning. Built as part of my Master’s thesis at Pondicherry University, it aims to improve drug safety and reduce harmful side effects in clinical settings.
Key Features
- Predicts drug risk scores using an FDA-weighted side effect severity system.
- Recommends safer substitutes within the same therapeutic class.
- Streamlit web app for interactive use — drug input, risk output, and substitute recommendations.
- Explainable AI: Feature importance is derived using interpretable ML techniques.Tech Stack
- Python, Scikit-learn, Pandas, NumPy
- Matplotlib & Seaborn for visualization
- Streamlit for deployment
- Random Forest Regressor (best performing model with R² ≈ 0.81)Model Performance
- R² Score: 0.81
- RMSE: 3.2
- MAE: 2.4
- Feature Importance:
- SideEffectScore = 0.58
- Chemical Class = 0.18
- Action Class = 0.15Dataset
- Sourced from Kaggle: Includes drug names, classes, side effects, and substitutes.
- Features: Action Class, Therapeutic Class, Habit Forming, Side Effect Severity.
- Weighted scores based on FDA classifications (e.g., heart attack = 10, rash = 4).How It Works
1. Drug name is input via Streamlit UI.
2. Model outputs a risk score using learned pharmacological patterns.
3. The system retrieves and recommends lower-risk substitutes from the same category.Streamlit App Preview
Not live-hosted yet. Run locally using:
```VSCode
streamlit run streamlit_app.py