An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-classification

A curated list of projects in awesome lists tagged with data-classification .

https://github.com/mthh/jenkspy

Compute Natural Breaks in Python (Fisher-Jenks algorithm)

data-classification jenks-fisher python-library

Last synced: 15 May 2025

https://github.com/openraven/mockingbird

A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.

data-classification synthetic-data-generation

Last synced: 23 Jan 2026

https://github.com/chgl16/data-mining-algorithm

:bar_chart: 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法

apriori-algorithm correlation-analysis data-classification data-mining-algorithms k-means-clustering

Last synced: 12 May 2025

https://github.com/arpitnarechania/binguru

BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.

binning cartography choropleth-map data-binning data-classification geospatial-visualization geovisualization gis open-source resiliency typescript-library visualization

Last synced: 21 Mar 2025

https://github.com/gabfr/truck-data-wrangler

ELT (Extract, Load, Transform) process of accelerometer/gyroscope events with Apache Spark (w/ Structured Streaming) and TimescaleDB

data-classification spark stream timescaledb

Last synced: 03 Aug 2025

https://github.com/mrseanryan/data-type-predictor

Given the name of a property or attribute like 'BrandName' or 'AmountReceived', try to predict a data type like String, Boolean, Integer...

ai data-classification data-types nlp stemming

Last synced: 08 Nov 2025

https://github.com/mthh/classif

Library for one-dimensional data classification and simple statistics in Rust

data-classification rust-library statistics

Last synced: 12 Jul 2025

https://github.com/qeeqbox/data-classification

Data classification defines and categorizes data according to its type, sensitivity, and value

classification data data-classification infosecsimplified qeeqbox

Last synced: 05 Mar 2025

https://github.com/arpitnarechania/resiliency-app

Resiliency is an ensemble binning method that considers how frequently a geographic entity (e.g., county) falls in a particular bin across multiple comparable data binning methods. This application helps users visualize and interact with the outputs of Resiliency on a variety of datasets.

cartography choropleth choropleth-map data-binning data-classification geographical-information-system gis visualization

Last synced: 27 Feb 2025

https://github.com/dsadriel/inf01124-cpd-tf

Trabalho final da disciplina Classificação e Pesquisa de Dados, ministrada pelo Prof. Leandro Krug Wives

computer-science data-classification data-search graduation-project ufrgs

Last synced: 05 Jul 2025

https://github.com/monish-nallagondalla/universal-bank

Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.

classification-models credit-card-prediction data-analysis data-classification decision-tree-classifier imbalanced-datasets machine-learning model-evaluation python scikit-learn

Last synced: 05 Apr 2025

https://github.com/samjoesilvano/password_strength_prediction_using_nlp

Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf

Last synced: 25 Aug 2025