An open API service indexing awesome lists of open source software.

https://github.com/allanotieno254/cancer-classification-support_vector_machine-svm

machine learning project focused on classifying cancer types using Support Vector Machine (SVM) algorithms.
https://github.com/allanotieno254/cancer-classification-support_vector_machine-svm

data-preprocessing data-science libraries machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas predictive-modeling python python3 support-vector-machines svm-classifier svm-model

Last synced: 6 months ago
JSON representation

machine learning project focused on classifying cancer types using Support Vector Machine (SVM) algorithms.

Awesome Lists containing this project

README

          

# Cancer Classification using SVM
This repository contains a machine learning project aimed at classifying cancer types based on features extracted from cell samples using a Support Vector Machine (SVM) model.

svm class

svm classification1

**Introduction to Support Vector Machine(SVM):** Used SVM to build and train a model using human cell records, and classify cells to whether the samples are benign (mild state) or malignant (evil state).

# Cancer images
![lung cancer](https://github.com/user-attachments/assets/a74111ba-294b-487c-87d0-b2bcd8f02788)

![breast cancer](https://github.com/user-attachments/assets/ca400554-7b01-4f9d-8d42-1434bcc1acce)

![Visualization-of-breast-cancer-a-benign-and-malignant-tumor-cells-b-benign-and](https://github.com/user-attachments/assets/42fe1671-8ca9-4594-94f5-77779ad3c830)

SVM works by mapping data to a high-dimensional feature space so that data points can be categorized, even when the data are not otherwise linearly separable (This gets done by kernel function of SVM classifier). A separator between the categories is found, then the data is transformed in such a way that the separator could be drawn as a hyperplane.**

## Project Structure

- `data/`: Contains datasets used for model training and predictions.
- `models/`: Contains the saved SVM model for making predictions.
- `notebooks/`: Jupyter notebooks for exploratory data analysis and modeling.
- `src/`: Scripts for data preprocessing, model training, and making predictions.
- `requirements.txt`: Required packages for the project.
- `README.md`: Project documentation.
- `LICENSE`: License information.

## Dataset

The dataset used for training and testing can be found in the `data/` folder. Ensure that the feature columns are properly formatted for predictions.

# confusion matrix
confusion matrix

confusion matrix2

# field names and descriptions

Screenshot 2024-10-16 084625

# Usage
1Preprocess the data using data_preprocessing.py.
2.Train the model using model_training.py.
3.Make predictions on new data using predictions.py.

### License
Consider using the **MIT License** for your project. You can include a `LICENSE` file with the following content:
markdown
MIT License

Copyright (c) [2024] [Allan Otieno]

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

[...]

## Installation

To install the required packages, run:
```bash
pip install -r requirements.txt