https://github.com/angchekar28/lung-cancer-prediction
This project builds and compares multiple machine learning models to predict lung cancer based on patient attributes. It evaluates classification models like Logistic Regression, Decision Tree, Random Forest, and SVM for early diagnosis.
https://github.com/angchekar28/lung-cancer-prediction
data-science data-visualization jupyter-notebook lung-cancer-detection machine-learning model-comparison python
Last synced: about 2 months ago
JSON representation
This project builds and compares multiple machine learning models to predict lung cancer based on patient attributes. It evaluates classification models like Logistic Regression, Decision Tree, Random Forest, and SVM for early diagnosis.
- Host: GitHub
- URL: https://github.com/angchekar28/lung-cancer-prediction
- Owner: angchekar28
- License: mit
- Created: 2025-02-04T18:11:38.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2025-02-04T18:19:05.000Z (4 months ago)
- Last Synced: 2025-02-04T19:27:34.376Z (4 months ago)
- Topics: data-science, data-visualization, jupyter-notebook, lung-cancer-detection, machine-learning, model-comparison, python
- Language: Jupyter Notebook
- Homepage:
- Size: 79.1 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# 📌 Lung Cancer Prediction: Model Comparison
## 📝 Project Overview
This project builds and compares multiple machine learning models to predict lung cancer presence based on patient data. The objective is to determine the most effective model for early diagnosis.## 📊 Dataset
- **Source:** Medical dataset with patient attributes
- **Columns:**
- `Age`, `Gender`, `Smoking`, `Yellow Fingers`, `Anxiety`, `Peer Pressure`, `Chronic Disease`
- `Wheezing`, `Alcohol`, `Coughing`, `Shortness of Breath`
- `Swallowing Difficulty`, `Chest Pain`
- `Lung Cancer` (Target variable: Yes/No)## ⚙️ Methodology
1. **Data Preprocessing**
- Handling missing values
- Encoding categorical variables
- Feature scaling (if required)2. **Exploratory Data Analysis (EDA)**
- Correlation analysis of symptoms
- Class distribution visualization3. **Model Implementation**
- Logistic Regression
- Decision Tree
- Random Forest
- Support Vector Machine (SVM)
- K-Nearest Neighbors (KNN)4. **Model Evaluation**
- Accuracy, Precision, Recall, and F1-Score
- ROC Curve and AUC Score for classification performance## 🛠️ Installation & Usage
```bash
# Install dependencies
pip install pandas numpy matplotlib seaborn scikit-learn# Clone the repository
git clone
cd# Open Jupyter Notebook
jupyter notebook lung-cancer-analyze-predict-model-comparison.ipynb
```
## Conclusion:
- This study helps in identifying an optimal predictive model for lung cancer diagnosis.
- It can assist in early detection and improve patient outcomes.