Projects in Awesome Lists tagged with data-classification
A curated list of projects in awesome lists tagged with data-classification .
https://github.com/mthh/jenkspy
Compute Natural Breaks in Python (Fisher-Jenks algorithm)
data-classification jenks-fisher python-library
Last synced: 15 May 2025
https://github.com/openraven/mockingbird
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
data-classification synthetic-data-generation
Last synced: 23 Jan 2026
https://github.com/nightfallai/nightfall-python-sdk
Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
api data-classification data-loss-prevention data-privacy data-protection nightfall pii python sdk secrets-detection
Last synced: 16 Jan 2026
https://github.com/chgl16/data-mining-algorithm
:bar_chart: 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法
apriori-algorithm correlation-analysis data-classification data-mining-algorithms k-means-clustering
Last synced: 12 May 2025
https://github.com/arpitnarechania/binguru
BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.
binning cartography choropleth-map data-binning data-classification geospatial-visualization geovisualization gis open-source resiliency typescript-library visualization
Last synced: 21 Mar 2025
https://github.com/gabfr/truck-data-wrangler
ELT (Extract, Load, Transform) process of accelerometer/gyroscope events with Apache Spark (w/ Structured Streaming) and TimescaleDB
data-classification spark stream timescaledb
Last synced: 03 Aug 2025
https://github.com/mrseanryan/data-type-predictor
Given the name of a property or attribute like 'BrandName' or 'AmountReceived', try to predict a data type like String, Boolean, Integer...
ai data-classification data-types nlp stemming
Last synced: 08 Nov 2025
https://github.com/mthh/classif
Library for one-dimensional data classification and simple statistics in Rust
data-classification rust-library statistics
Last synced: 12 Jul 2025
https://github.com/qeeqbox/data-classification
Data classification defines and categorizes data according to its type, sensitivity, and value
classification data data-classification infosecsimplified qeeqbox
Last synced: 05 Mar 2025
https://github.com/arpitnarechania/resiliency-app
Resiliency is an ensemble binning method that considers how frequently a geographic entity (e.g., county) falls in a particular bin across multiple comparable data binning methods. This application helps users visualize and interact with the outputs of Resiliency on a variety of datasets.
cartography choropleth choropleth-map data-binning data-classification geographical-information-system gis visualization
Last synced: 27 Feb 2025
https://github.com/dsadriel/inf01124-cpd-tf
Trabalho final da disciplina Classificação e Pesquisa de Dados, ministrada pelo Prof. Leandro Krug Wives
computer-science data-classification data-search graduation-project ufrgs
Last synced: 05 Jul 2025
https://github.com/melvinmo/ropac-rule-optimized-aggregation-classifier
Python code for the ROPAC data classification algorithm
data-classification data-mining machine-learning ropac rule-based-classifier
Last synced: 25 Feb 2025
https://github.com/debopriya2320/classification-of-biological-databases
A project for categorizing and analyzing various biological databases based on their type and content.
bioinformatics biological-databases computational-biology data-analysis data-classification databases genomics proteomics research-tools systems-biology
Last synced: 09 Oct 2025
https://github.com/monish-nallagondalla/universal-bank
Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.
classification-models credit-card-prediction data-analysis data-classification decision-tree-classifier imbalanced-datasets machine-learning model-evaluation python scikit-learn
Last synced: 05 Apr 2025
https://github.com/samjoesilvano/password_strength_prediction_using_nlp
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf
Last synced: 25 Aug 2025
https://github.com/rubyyy1118/machine_learning_optimization_study
The Learning From Data - Assignment in my MSc Business Analytics course
data-classification data-cleaning data-science data-visualization hyperparameter-tuning label-encoding neural-network python support-vector-machine tensorflow
Last synced: 29 Jun 2025