https://github.com/allanotieno254/cancer-classification-support_vector_machine-svm
machine learning project focused on classifying cancer types using Support Vector Machine (SVM) algorithms.
https://github.com/allanotieno254/cancer-classification-support_vector_machine-svm
data-preprocessing data-science libraries machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas predictive-modeling python python3 support-vector-machines svm-classifier svm-model
Last synced: 6 months ago
JSON representation
machine learning project focused on classifying cancer types using Support Vector Machine (SVM) algorithms.
- Host: GitHub
- URL: https://github.com/allanotieno254/cancer-classification-support_vector_machine-svm
- Owner: AllanOtieno254
- Created: 2024-10-17T16:23:33.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-10-17T17:27:44.000Z (about 1 year ago)
- Last Synced: 2025-02-17T08:33:56.819Z (9 months ago)
- Topics: data-preprocessing, data-science, libraries, machine-learning, machine-learning-algorithms, matplotlib, matplotlib-pyplot, pandas, predictive-modeling, python, python3, support-vector-machines, svm-classifier, svm-model
- Language: Jupyter Notebook
- Homepage: https://colab.research.google.com/drive/1qvLMUeKS0mriS4GK2yd4YCDZhc4RbDKB?authuser=1#scrollTo=w5d4WH4a_q_R
- Size: 1.87 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Cancer Classification using SVM
This repository contains a machine learning project aimed at classifying cancer types based on features extracted from cell samples using a Support Vector Machine (SVM) model.


**Introduction to Support Vector Machine(SVM):** Used SVM to build and train a model using human cell records, and classify cells to whether the samples are benign (mild state) or malignant (evil state).
# Cancer images



SVM works by mapping data to a high-dimensional feature space so that data points can be categorized, even when the data are not otherwise linearly separable (This gets done by kernel function of SVM classifier). A separator between the categories is found, then the data is transformed in such a way that the separator could be drawn as a hyperplane.**
## Project Structure
- `data/`: Contains datasets used for model training and predictions.
- `models/`: Contains the saved SVM model for making predictions.
- `notebooks/`: Jupyter notebooks for exploratory data analysis and modeling.
- `src/`: Scripts for data preprocessing, model training, and making predictions.
- `requirements.txt`: Required packages for the project.
- `README.md`: Project documentation.
- `LICENSE`: License information.
## Dataset
The dataset used for training and testing can be found in the `data/` folder. Ensure that the feature columns are properly formatted for predictions.
# confusion matrix


# field names and descriptions

# Usage
1Preprocess the data using data_preprocessing.py.
2.Train the model using model_training.py.
3.Make predictions on new data using predictions.py.
### License
Consider using the **MIT License** for your project. You can include a `LICENSE` file with the following content:
markdown
MIT License
Copyright (c) [2024] [Allan Otieno]
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
[...]
## Installation
To install the required packages, run:
```bash
pip install -r requirements.txt