https://github.com/mariarodr1136/frauddetectionsystem

FraudWatch is a machine learning-based credit card fraud detection system that uses a Random Forest classifier. It visualizes model performance with an interactive confusion matrix heatmap. The system is deployed as a user-friendly Flask web application. 📊
https://github.com/mariarodr1136/frauddetectionsystem

creditcardfraud dataanalysis datascience financialsecurity flaskapp frauddetection fraudprevention interactivedashboard machinelearning randomforest

Last synced: 8 months ago
JSON representation

Host: GitHub
URL: https://github.com/mariarodr1136/frauddetectionsystem
Owner: mariarodr1136
Created: 2025-01-07T19:59:11.000Z (10 months ago)
Default Branch: main
Last Pushed: 2025-01-07T23:56:29.000Z (10 months ago)
Last Synced: 2025-01-17T05:12:14.810Z (10 months ago)
Topics: creditcardfraud, dataanalysis, datascience, financialsecurity, flaskapp, frauddetection, fraudprevention, interactivedashboard, machinelearning, randomforest
Language: HTML
Homepage:
Size: 2.6 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# FraudWatch: Credit Card Fraud Detection System 🚨

FraudWatch is an advanced credit card fraud detection system designed to identify fraudulent transactions in real-time using machine learning techniques. Built with **Python** and **Flask**, the system processes transaction data to predict fraud using a **Random Forest classifier**, a powerful ensemble learning algorithm known for its high accuracy in classification tasks. The backend leverages **scikit-learn** for model training and evaluation, while the web application, built with **Flask**, serves as an interactive platform for users to engage with the results.

FraudWatch incorporates robust data preprocessing steps such as feature scaling using **StandardScaler** to ensure optimal model performance. For visualization, the system employs **Plotly**, a data visualization library that creates an interactive chart for the model's performance metrics, allowing users to explore how well the model classifies legitimate and fraudulent transactions. The chart enhances user experience by offering an intuitive, interactive view of model accuracy, false positives, false negatives, true positives, and true negatives.

![Python](https://img.shields.io/badge/Python-Programming%20Language-blue)
![Flask](https://img.shields.io/badge/Flask-Web%20Framework-lightgreen)
![Pandas](https://img.shields.io/badge/Pandas-Data%20Manipulation-orange)
![scikit-learn](https://img.shields.io/badge/scikit--learn-Machine%20Learning-blue)
![Plotly](https://img.shields.io/badge/Plotly-Data%20Visualization-brightgreen)

---
Screenshot 2025-01-07 at 6 41 12 PM

---

## Table of Contents
- [Technologies Used](#technologies-used)
- [Features](#features)
- [Dataset](#dataset)
- [Future Enhancements](#future-enhancements)
- [Installation](#installation)
- [Usage](#usage)
- [Example](#example)
- [Contributing](#contributing)
- [Contact](#contact-)

---

## Technologies Used
- **Python**: The primary programming language for the backend logic.
- **Flask**: Web framework for creating the web application and serving the results.
- **Pandas**: Data manipulation and analysis library used for loading and preparing the dataset.
- **scikit-learn**: Machine learning library used for building and evaluating the Random Forest model.
- **Plotly**: Data visualization library for generating interactive charts for model performance metrics.

---

Screenshot 2025-01-07 at 6 41 21 PM

---

## Features
- **Fraud Detection**: The system predicts whether a credit card transaction is fraudulent or legitimate using a Random Forest classifier.
- **Feature Scaling**: Data is preprocessed with **StandardScaler** to normalize the features.
- **Classification Report**: Detailed classification report with precision, recall, and F1-score for both classes (legitimate and fraudulent transactions).
- **Model Performance Metrics Chart**: Visualizes the performance of the model through an interactive line chart for accuracy, false positives, false negatives, true positives, and true negatives using Plotly.
- **Sample Transaction Table**: Displays a table with a few sample transactions, showing their true class and predicted class, with color-coded indicators for correct or incorrect predictions.
- **User-friendly Web Application**: The system is packaged in a Flask web application to display results in an accessible and interactive format.

---

Screenshot 2025-01-07 at 6 43 08 PM

---

## Dataset

The data used in this project is from [Kaggle - Credit Card Fraud Detection](https://www.kaggle.com/datasets/mlg-ulb/creditcardfraud?resource=download).

It is important that credit card companies are able to recognize fraudulent credit card transactions to prevent customers from being charged for items they did not purchase.

## Content

The dataset contains transactions made by credit cards in September 2013 by European cardholders. It presents transactions that occurred over two days, with 492 frauds out of 284,807 transactions. The dataset is highly unbalanced, with frauds accounting for 0.172% of all transactions.

The dataset contains only numerical input variables, which are the result of a PCA transformation. Due to confidentiality concerns, the original features are not provided. The features V1, V2, … V28 are the principal components obtained via PCA, with the exception of two features:

- **'Time'**: The seconds elapsed between each transaction and the first transaction in the dataset.
- **'Amount'**: The transaction amount, which can be useful for cost-sensitive learning based on examples.

The target variable, **'Class'**, indicates whether a transaction is fraudulent (1) or not (0).

---

## Future Enhancements

- **Model Optimization**: Explore and integrate additional machine learning models such as **XGBoost** or **Gradient Boosting** to improve classification accuracy and model efficiency.
- **User Authentication**: Implement a user authentication system using **OAuth** or **JWT** for secure, personalized access to the application.
- **Real-Time Fraud Detection**: Integrate **real-time transaction data** for live fraud detection and alert generation.
- **Model Interpretability**: Incorporate **SHAP** or **LIME** for better understanding and visualization of how the model makes decisions.
- **Additional Data Sources**: Expand the dataset to include more features such as **transaction location** or **merchant information** to enhance fraud prediction.
- **Frontend Enhancements**: Improve the user interface by integrating more interactive elements and visualizations, such as **line charts** or **bar graphs**, for deeper insights into the model's performance.

---

## Installation

### Prerequisites
- Python 3.x
- pip (Python package installer)

### Setup Instructions

1. Clone the repository to your local machine:

```bash
git clone https://github.com/your-username/fraudwatch.git
cd fraudwatch
2. Install the required dependencies:
```bash
pip install -r requirements.txt
3. Download the dataset creditcard.csv and place it in the project directory.
4. Run the Flask application:
```bash
python app.py
5. Open your browser and go to http://127.0.0.1:5000/ to view the application.

## Usage

Once the application is running, you can access the following features:

1. **Classification Report**: View the precision, recall, and F1-score for both classes (legitimate and fraudulent).
2. **Interactive Model Metrics Line Chart**: See the model performance metrics (accuracy, false positives, false negatives, true positives, and true negatives) visualized interactively in a line chart, where you can hover over the points for detailed values.
3. **Sample Transactions Table**: View the first few sample transactions, with true and predicted classes, color-coded to indicate correct or incorrect predictions.

---

## Example

### Classification Report

| Class | Precision | Recall | F1-Score |
|--------------|-----------|--------|----------|
| Legitimate | 0.99 | 0.99 | 0.99 |
| Fraudulent | 0.98 | 0.95 | 0.97 |

---

### Model Performance Metrics Line Chart

The line chart visualizes the model's performance metrics, such as accuracy, false positives, false negatives, true positives, and true negatives.

---

### Sample Transactions Table

| Transaction (V1, V2, V3, V4) | True Class | Prediction |
|----------------------------------------|------------|-------------|
| -0.1348, 0.5127, -0.1256, -0.3432 | Legitimate | Fraudulent |
| -0.0321, 0.3548, -0.2156, -0.2342 | Fraudulent | Fraudulent |
| ... | ... | ... |

---

## Contributing
Feel free to submit issues or pull requests for improvements or bug fixes. You can also open issues to discuss potential changes or enhancements. All contributions are welcome to enhance the app’s features or functionality!

To contribute, please follow these steps:

1. Fork the repository.
2. Create a new branch for your feature or bug fix:
```bash
git checkout -b feat/your-feature-name
- Alternatively, for bug fixes:
```bash
git checkout -b fix/your-bug-fix-name
3. Make your changes and run all tests before committing the changes and make sure all tests are passed.
4. After all tests are passed, commit your changes with descriptive messages:
```bash
git commit -m 'add your commit message'
5. Push your changes to your forked repository:
```bash
git push origin feat/your-feature-name.
6. Submit a pull request to the main repository, explaining your changes and providing any necessary details.

## Contact 🌐
If you have any questions or feedback, feel free to reach out at [mrodr.contact@gmail.com](mailto:mrodr.contact@gmail.com).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mariarodr1136/frauddetectionsystem

Awesome Lists containing this project

README