An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with train-test-split

A curated list of projects in awesome lists tagged with train-test-split .

https://github.com/odancona/bboxconverter

This library allows reading and converting bounding box annotations in many popular formats

bounding-boxes computer-vision image-recognition numpy object-detection pandas python pytorch tensorflow train-test-split

Last synced: 29 Dec 2025

https://github.com/ODAncona/bboxconverter

This library allows reading and converting bounding box annotations in many popular formats

bounding-boxes computer-vision image-recognition numpy object-detection pandas python pytorch tensorflow train-test-split

Last synced: 09 Jul 2025

https://github.com/yu9824/kennard_stone

This is an algorithm for evenly partitioning.

kfold-cross-validation python scikit-learn train-test-split

Last synced: 16 Mar 2025

https://github.com/camilajaviera91/bagging-with-kaggle

Code in which an initial approach to decision trees and bagging will be made, and an attempt will be made to ensure that the model can be trained with any dataset coming from Kaggle (for this, we will again use the 'connect with Kaggle' project).

accuracy-score bagging-classifier curses decision-tree-classifier kaggle labelencoder pandas python simpleimputer sklearn-library train-test-split

Last synced: 07 Sep 2025

https://github.com/elifftosunn/bert-bank-model

It is a Turkish BERT-based model that will analyze people's bank complaints and classify them according to one of eight categories.

countvectorizer doc2vec f1-score huggingface huggingface-transformer huggingface-transformers nlp nltk python3 scikit-learn stopwords tagged tfidf-transformer train-test-split word-tokenizer wordnetlemmatizer

Last synced: 15 Mar 2025

https://github.com/bhattbhavesh91/random_state_train_test_split

A simple example of random state in train test split using python

random-state train-test-split train-test-using-sklearn

Last synced: 05 Dec 2025

https://github.com/aarryasutar/linear_multivariate_regression_on_football_statistics

Linear regression models are used to predict football player attacking stats based on attributes like finishing and passing, with the model trained, evaluated, and applied for predictions. Multiple features improve accuracy, and performance is assessed using metrics like MSE and R-squared.

datasets feature-selection football-stats linear-reg machine-learning mae mse multivariate-regression numpy pandas rmse scatter-plot train-test-split

Last synced: 27 Jun 2025

https://github.com/camilajaviera91/prediction-of-housing-prices-using-linear-regression

This project provides tools to search for datasets on Kaggle, download and preprocess them, and perform predictions using a Linear Regression model. It includes interactive text-based user interfaces built with `curses`.

curses kaggle linear-regression matplotlib-pyplot mean-absolute-error mean-square-error numpy pandas pathlib python scikit-learn train-test-split

Last synced: 30 Dec 2025

https://github.com/yashrajgithub/crop-recommendation

KrishiGyaan is a web app designed to help farmers make informed decisions on crop selection. By analyzing soil and environmental factors, the app provides personalized crop recommendations, enhancing agricultural productivity and promoting sustainable farming practices.

api artificial-intelligence crop-recommendation-system data-preprocessing data-visualization json machine-learning-algorithms pickle python random-forest-classifier scikit-learn streamlit supervised-learning train-test-split user-interface

Last synced: 30 Dec 2025

https://github.com/harmanveer-2546/predicting-schizophrenia-disorder

The positive symptoms typical of schizophrenia – such as delusions, hallucinations or formal thought disorders – often first appear in an attenuated or transient form during the initial prodromal phase

boxplot decisiontreeregressor disorder linearregression matplotlib mean-squared-error numpy pairplot pandas prediction randomforestregressor schizophrenia seaborn train-test-split visualization

Last synced: 24 Nov 2025

https://github.com/harmanveer-2546/prediction-of-ticket-cancellation

The objective is to develop a model that accurately predicts whether users will cancel their tickets. Each cancellation incurs a fine for the ticket registration site from the passenger company.

datetime evaluation gridsearchcv labelencoder numpy pandas standardscaler stratified-k-fold train-test-split xgboost-model

Last synced: 28 Feb 2025

https://github.com/guslovesmath/o3_aqi_emission_ml

Analyzing O3 Air Quality Index trends (2000-2023) in the U.S., this project identifies regions with rising pollution. Utilizing exploratory data analysis and time-series modeling, it offers actionable insights for informed policy decisions on urgent O3 pollution issues.

forecasting machine-learning statsmodels time-series train-test-split

Last synced: 08 Oct 2025

https://github.com/pb319/california_house-price-prediction

This is going to be my first end to end ML project implementation covering all required stages taking guidence from book called "Hands On Machine Learning".

evaluation-metrics hyperparameter-tuning jupyter-notebook kfold-cross-validation machine-learning matplotlib numpy pandas python scikit-learn seaborn train-test-split

Last synced: 29 Dec 2025

https://github.com/jbizzlefoshizzle/linear-and-ridge-regression

The purpose of this project was to analyze and predict housing prices using attributes or features such as square footage, number of bedrooms, number of floors, and so on.

linear-regression machine-learning machine-learning-algorithms regression-analysis regression-models ridge-regression scikit-learn scikitlearn-machine-learning train-test-split train-test-using-sklearn

Last synced: 20 Mar 2025

https://github.com/venkat-0706/titanic-survival-prediction

A machine learning project predicting Titanic passenger survival using data preprocessing, feature engineering, and model optimization with Logistic Regression, Random Forest, and XGBoost.

classification-report confusion-matrix gridsearchcv matplotlib numpy onehot-encoder pandas sckit-learn seaborn train-test-split xgboost

Last synced: 04 Apr 2025

https://github.com/sithu-khant/train-valid-test

Codes for "why (how) we split train, valid and test?" blog

machine-learning train-test-split

Last synced: 24 Feb 2025

https://github.com/mindlessmuse666/train-test-splitter

Анализ данных о пассажирах Титаника и разбиение на обучающую и тестовую выборки. Практическое задание по дисциплине "Основы применения методов искусственного интеллекта в программировании".

data-analysis data-preprocessing data-visualization machine-learning pandas python scikit-learn seaborn titanic train-test-split

Last synced: 31 Dec 2025

https://github.com/eddex/signals-dataset

A dataset with 5077 images of numbered signals and a script to create a train-test-split

annotations dataset hslu machine-learning train-test-split

Last synced: 29 Apr 2025