Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-10 00:07:12 UTC
- JSON Representation
https://github.com/mwoss/mlflow-stock-market-example
Stock market prediction - machine learning pipeline using MLFlow.
anaconda data-analysis databricks example lstm mlflow python stock-market stock-price-prediction tutorial
Last synced: 24 Jan 2025
https://github.com/reddyprasade/pandas-practice
Pandas
daat data-analysis data-science flexible labeling missing-data missing-values pandas pandas-profiling
Last synced: 19 Jan 2025
https://github.com/armanx200/diabetes_model
🚀 A machine learning model predicting diabetes with logistic regression, feature scaling, and VIF analysis. 📊🩺
arman-kianian classification data-analysis data-science data-visualization feature-engineering healthcare logistic-regression machine-learning model-evaluation predictive-modeling python scaling scikit-learn statistical-analysis statsmodels
Last synced: 24 Jan 2025
https://github.com/hyperspy/holospy-demos
HoloSpy Jupyter Notebook demos
data-analysis data-visualization electron-holography hyperspy materials-science multi-dimensional physical-sciences tutorial
Last synced: 19 Jan 2025
https://github.com/mohamedomar2020/random-forest
Creating a Random Forest model to predict the progression of bladder cancer
bladder-cancer cancer-genomics cancer-research data-analysis data-science genomics machine-learning machine-learning-algorithms random-forest
Last synced: 30 Jan 2025
https://github.com/mynenik/xyplot-32
Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux
cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows
Last synced: 24 Jan 2025
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 08 Feb 2025
https://github.com/chrdek/linqdatacalc
📈 🎲 Linq based data statistics set of extensions.
calculations calculator data-analysis data-analytics data-science data-statictics extension-methods extensions linq linq-extensions set-theory statistical-analysis statistics
Last synced: 29 Jan 2025
https://github.com/duart38/sponge
Quickly make endpoints for testing
cms data-analysis deno developer-tools development-tools helper-tool mock server sponge testing testing-tools toolkit tools
Last synced: 06 Feb 2025
https://github.com/nhsdigital/sde_example_analysis
Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.
data-analysis data-science databricks-notebooks machine-learning mlflow
Last synced: 23 Dec 2024
https://github.com/mindful-ai-assistants/credit-card-prediction
💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.
artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning predictive-modeling python3 scikit-learn
Last synced: 09 Dec 2024
https://github.com/gustavohnsv/teamwork_mqa
Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.
data-analysis group-project r team-repo
Last synced: 16 Dec 2024
https://github.com/yahia3200/become-an-independent-data-scientist
My final project for the Applied Plotting, Charting & Data Representation in Python Course
data-analysis data-science data-visualization matplotlib
Last synced: 22 Jan 2025
https://github.com/thennen/py-ivtools
A package for flexible and reproducible measurement and analysis of current-voltage characteristics of electronic devices.
current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements
Last synced: 24 Jan 2025
https://github.com/depressioncenter/data-and-design-core
Code developed by the EFDC Data and Design Core team to support mental health research.
data-analysis data-science efdc inference r statistical-analysis umich
Last synced: 25 Jan 2025
https://github.com/narius2030/sakila-datawarehouse-ssis
Implement a simple data warehouse to store Saklia data - Create data pipelines for extract, transform and load data from source to warehouse - Retrieve data in warehouse to explore and do several analysis
data-analysis data-integration data-modeling data-visualization excel microsoft-sql-server power-bi ssas ssis
Last synced: 07 Feb 2025
https://github.com/ziaeemehr/itng_nest
Nest Simulator quick guides and examples, adding new model using NESTML
computational-neuroscience data-analysis nest-simulator neuroscience
Last synced: 07 Feb 2025
https://github.com/tathithienthanh/datamining-banking-dataset
Implement some learned data mining techniques and predict if the client will subscribe to a term deposit
apriori association-rules classification clustering data-analysis data-mining data-processing google-colab ipynb kmeans naive-bayes py python scikit-learn svm visualization
Last synced: 25 Jan 2025
https://github.com/atxtechbro/flightradar24
Advanced Python application leveraging the power of APIs and the pandas library to retrieve and perform in-depth analysis of flight data from Flightradar24. It uncovers insights such as the most common departure and arrival cities, contributing to the field of aviation data science.
api-integration aviation-data data-analysis data-science data-visualization flightradar24-api pandas-library python requests-library web-scraping
Last synced: 25 Jan 2025
https://github.com/antononcube/wl-outlieridentifiers-paclet
Wolfram Language (aka Mathematica) paclet that provides outlier identifier functions.
data-analysis hampel outlier-detection outliers
Last synced: 08 Feb 2025
https://github.com/elhaban3ro/thewildtool
TheWildTool is a tool developed with the main objective of saving time when working with audio datasets. Either to prepare them, to get them or to train a model with them. 🤖
ai audio audio-processing data-analysis data-science dataset deeplearning python
Last synced: 30 Jan 2025
https://github.com/adirthaborgohain/community-data-analysis
Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.
Last synced: 05 Feb 2025
https://github.com/sn2606/global-temperature-time-series
Time series analysis is performed on the Berkeley Earth Surface Temperature dataset.
arima arima-forecasting arima-model climate-change data-analysis data-visualization forecasting-model global-temperature series-analysis singular-spectrum-analysis time-series time-series-analysis time-series-forecasting
Last synced: 25 Jan 2025
https://github.com/rajshrestha86/police-brutality-data-analysis
In this project, we analyze the events after George Floyd’s death. The protests and riots across the United States and sentiments of news articles of three different news sources that have different political leaning. We will see how these media reacted after Floyd’s death and see the effect of media bias on the sentiments of news for #BlackLivesMatter and #AllLivesMatter movement. We will also see if there is a correlation between the police budget and the number of protests. This analysis will help us to see if there is really a need for defunding police to reduce police brutality and casualties. We will also see the correlation of partisan segregation and number of deaths to see if political preference has an effect on the number of deaths by police.
data-analysis matplotlib pandas python sentiment-analysis web-scraping
Last synced: 07 Feb 2025
https://github.com/njoyedevs/chatgpt3_riskanalyzer
In this project, ChatGPT3 was fine tuned on 9 data series spanning 40 years. This helped train ChatGPT3 to provide a market risk score. To view, visit: https://www.aimarketrisk.com
chatgpt3 data-analysis flask fred-api full-stack-web-development pandas python
Last synced: 30 Jan 2025
https://github.com/zachlagden/spotify-listening-analyzer
A comprehensive Python tool for analyzing your Spotify listening history data.
analytics data-analysis pandas python spotify-web-api spotipy
Last synced: 07 Feb 2025
https://github.com/lacerbi/vbmc
Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)
bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference
Last synced: 05 Feb 2025
https://github.com/cego669/datathonengopevi
Equipe: Embrapeiros. Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística). Obs: FOMOS CAMPEÕES!!!!!!!!
data-analysis data-science datathon python r streamlit xgboost-classifier
Last synced: 08 Dec 2024
https://github.com/shivamswarnkar/tesla-stock-prediction
Making prediction of close prices of Tesla Stocks using different regression methods.
data-analysis data-visualization plotly regression regularization sklearn stock-price-prediction
Last synced: 26 Jan 2025
https://github.com/winter000boy/dsa-practice
This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.
data-analysis data-science leetcode leetcode-python pandas-python python3
Last synced: 30 Jan 2025
https://github.com/frikishaan/browsing-history-analysis
This is a data analysis of my browsing history for the last 7 months.
browsing-history data-analysis jupyter-notebook python
Last synced: 09 Jan 2025
https://github.com/programmer-rd-ai/moviedatascraper
Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!
beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web
Last synced: 12 Jan 2025
https://github.com/al-ghaly/airline-company-data-warehouse
Data Warehouse modeling, design, implementation, and analysis for an Airline Company.
data-analysis data-warehousing database-modeling sql-server
Last synced: 22 Jan 2025
https://github.com/manmolecular/http-response-clustering
:chart_with_downwards_trend: Clustering of HTTP responses using k-means++ and the elbow method
data-analysis elbow-method elbow-plot jupyter k-means-plus-plus python3
Last synced: 16 Jan 2025
https://github.com/nafiealhilaly/analyze-coderhub-sa
A simple web app to analyze/explore coderhub.sa API data, this project was my first real react app.
backend data-analysis eda frontend python react reactjs
Last synced: 08 Feb 2025
https://github.com/wittline/data-analytics-with-r
Repository for data analytics course using R
cassandra-database cql data-analysis genetic-algorithm pentaho-data-integration r
Last synced: 29 Jan 2025
https://github.com/c0deta1ker/matbase
MatBase provides access to an extensive database of material parameters, inelastic mean free paths (IMFP), photoionization binding energies, cross sections, and asymmetry parameters. Additionally, MatBase includes a suite of functions for users to load, process, model and fit their own data, making it an indispensable tool in the field.
cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps
Last synced: 30 Nov 2024
https://github.com/zrkhadija/data-analysis-for-financial-time-series
In this notebook, we performed data analysis on financial time series data from Yahoo Finance for the US market. We examined seasonality, trends, stationarity, and other aspects such as outliers and correlations.
autocorrelation correlation-analysis data-analysis financial-analysis time-series-analysis timeseries-forecasting visualization
Last synced: 09 Feb 2025
https://github.com/thecoderpinar/reta
🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!
arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series
Last synced: 09 Feb 2025
https://github.com/hvignolo87/ortex-programming-challenge
Coding challenges required for the Python Developer and Data Engineer job positions.
challenge data-analysis finance pandas python scripting sql sqlalchemy
Last synced: 02 Jan 2025
https://github.com/janheinrichmerker/song-analysis
Analysing the Million Song Dataset.
big-data data-analysis data-science hadoop hadoop-mapreduce java kotlin songs
Last synced: 24 Dec 2024
https://github.com/louislefevre/sstubs-miner
Data mining and analysis for the ManySStuBs4J dataset.
data-analysis data-mining manysstubs4j-dataset msr
Last synced: 05 Feb 2025
https://github.com/jgekko99/portfolio-optimization-and-backtesting-using-python-a-pragmatic-approach
Modern Portfolio Theory (MPT) and Monte Carlo simulations to optimize and backtest a portfolio of various financial assets
asset-management data-analysis data-cleaning jupyter-notebook modern-portfolio-theory monte-carlo-simulation multiprocessing multithreading numba numba-jit-compiler perfomance-python python
Last synced: 29 Jan 2025
https://github.com/leonism/customer-predictive-analysis
Explore this repository, a comprehensive resource offering an in-depth guide to conducting customer predictive analysis using cutting-edge machine learning techniques, all within the intuitive framework of Dataiku.
data-analysis data-model data-science data-visualization dataiku machine-learning predictive-modeling
Last synced: 03 Feb 2025
https://github.com/yogeshnile/nifty50-index-time-series-analysis
In this repo i did analysis of Nifty50 five year data from 01-04-2015 to 31-03-2020. Data Downloaded from nse official website.
data-analysis matplotlib nifty numpy pandas plotly python3 time-series-analysis
Last synced: 10 Jan 2025
https://github.com/yogeshnile/covid-19-time-series-data-analysis
In this repo created a Covid-19 Time Series Data Analysis on python
covid-19 covid19 data-analysis folium folium-maps pandas plotly time-series-analysis visualization
Last synced: 10 Jan 2025
https://github.com/mindlessmuse666/client-data-analysing-tool
Проект производственной практики: Инструмент для анализа данных, построенный с использованием Python (бэкэнд, фронтэнд PyQt6), Pandas, Matplotlib и SQLite. Это приложение позволяет пользователям загружать данные в формате CSV, фильтровать их, визуализировать ключевые показатели с помощью графиков и создавать отчеты.
data-analysis desktop-application matplotlib pandas pyqt6 pyqt6-desktop-application python sqlite student-project
Last synced: 23 Dec 2024
https://github.com/zelosleone/finncorr
A .NET Core financial analysis tool/API for calculating correlations between time series data with interactive visualizations powered by ML.NET and Plotly.js.
aspnet-core correlation-analysis csv-parser data-analysis dotnet financial-analysis machine-learning ml-net plotly rest-api statistical-analysis swagger time-series visualization
Last synced: 06 Feb 2025
https://github.com/tirendazacademy/data-sets
Data sets for Tirendaz Akademi Youtube
Last synced: 01 Jan 2025
https://github.com/virajbhutada/tableau-data-vizzes
Engage with a growing collection of Tableau dashboards covering financial trends, HR analytics, streaming service insights, real estate dynamics, and more. Meticulously crafted for valuable insights, this repository continues to expand with new and compelling visualizations.
business-analytics data-analysis data-visualization hr-analytics industry-trends netflix performance-metrics stock-market-analysis strategic-analytics tableau visual-insights
Last synced: 10 Jan 2025
https://github.com/juliusmarkwei/titanic-data-analysis
Data analysis, data visualization, feature scaling, feature transformation, model selection and model optimization.
data-analysis data-science data-visualization linear-regression model-selection regression
Last synced: 01 Jan 2025
https://github.com/kinshuk-code-1729/data-visualisation-using-python
This Repository consists of several python snippets for creating Two-Dimensional (2D) Graphics
data-analysis data-science data-visualization matplotlib visualization
Last synced: 12 Jan 2025
https://github.com/birkkarlsen/beam_dynamics_tools
Repository filled with functions related to the analysis of longitudinal beam dynamics measurements and simulations
accelerator-physics beam-dynamics data-analysis
Last synced: 12 Jan 2025
https://github.com/riju18/advanced-data-analysis-and-visualization
Advanced level of data preparation, level of detail calculation, animation, table calculation etc for data analysis & visualization.
data-analysis data-science data-visualization tableau
Last synced: 28 Jan 2025
https://github.com/tnleite/projeto_king_lift
Este projeto apresenta uma análise detalhada dos dados financeiros da King Lift, uma empresa de locação de empilhadeiras. Utilizando Microsoft Excel, Power Query e Power Pivot, desenvolvi um dashboard interativo, também em Excel, que ajuda a empresa a obter insights valiosos para melhorar a eficiência operacional e aumentar o faturamento.
data-analysis data-science data-visualization excel
Last synced: 04 Feb 2025
https://github.com/boardgameanalytics/bga-notebooks
Exploratory notebooks using the BGG dataset
data-analysis data-exploration data-visualization ipython-notebook python
Last synced: 12 Jan 2025
https://github.com/husna-poyraz/titanic-machine-learning
Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.
data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic
Last synced: 09 Jan 2025
https://github.com/sufiyanahmed4566/sql-musicmaven
"This Music Store Database Project showcases SQL skills through comprehensive database design, query optimization, and data analysis. Includes ER diagram, database file, query questions (Easy, Medium, Hard), answered queries, and CSV table data. Ideal for recruiters seeking skilled SQL developers for music store management and data analysis.
data-analysis database insights mysql-database oracle-database relational-databases sql
Last synced: 24 Jan 2025
https://github.com/manishjanky/analyse-ford-gobike-dataset
Analyse ford go bike dataset
data-analysis ford-gobike udacity udacity-data-analyst-nanodegree udacity-nanodegree
Last synced: 12 Jan 2025
https://github.com/michenriksen/inspectra
A simple web app for data inspection.
data-analysis decoding web-tool
Last synced: 14 Jan 2025
https://github.com/m-faizan-mahmood/house-price-prediction-machine-learning-model
Implemented a Multiple Linear Regression model to predict house prices based on square footage, number of bedrooms, and age of the house.
artificial-intelligence data-analysis data-science data-visualization machine-learning machine-learning-algorithms matplotlib neural-network numpy pandas predictive-modeling python regression-models seaborn sklearn
Last synced: 18 Jan 2025
https://github.com/ifibla/adsdb-project
Algorithms, Data Structures and Databases Project
data-analysis data-engineering python
Last synced: 28 Jan 2025
https://github.com/jfjlaros/spreadscript
SpreadScript: Use a spreadsheet as a function.
automation command-line data-analysis evaluation function interface spreadsheet
Last synced: 12 Jan 2025
https://github.com/ajimaulana123/e-commerce-data-analis
Analisis dataset e-commerce guna menjawab kebutuhan product mana yang paling laris dibeli customer
Last synced: 28 Jan 2025
https://github.com/edikedik/lxtractor
Library for analysing protein structures and sequences
bioinfomatics computational-biology data-analysis data-mining feature-extraction python structural-biology
Last synced: 16 Nov 2024
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 16 Jan 2025
https://github.com/nysportsfan/Gun-Violence-in-the-US
This repository contains all the relevant files for my first capstone project as part of the Springboard Data Science Career Track.
data-analysis data-science data-visualization machine-learning python3 statistics
Last synced: 16 Nov 2024
https://github.com/eesunmoon/on-device_multimodal_er
[Research] Multimodal Emotion Recognition for On-device AI
artificial-intelligence data-analysis deep-learning embedded-systems emotion-recognition heart-rate-analysis multimodal-fusion npu on-device python speech-processing speech-recognition tensorflow wearable-devices
Last synced: 22 Dec 2024
https://github.com/prime-infinity/type-one
Software to visualize and analyze GitHub repos based on certain statistics such as stars, forks and issues
data-analysis data-visualization
Last synced: 24 Nov 2024
https://github.com/dhairyac/customer-churn-prediction
Analyze, visualize and predict customer churn using Machine Learning
data-analysis data-visualization ensemble-classifier machine-learning performance-metrics python-3 random-forest-classifier softmax-regression svm-classifier
Last synced: 22 Jan 2025
https://github.com/anilkumarteegala/aspiration.ai-ml-internship
This repo contains the internship project by Career Launcher.
data-analysis data-science financial internship machine-learning python3 stock-analysis stock-market visualization
Last synced: 13 Nov 2024
https://github.com/rapter1990/data-visualization-examples
Data Visualization Examples
data data-analysis data-visualization folium matplotlib plot plotly python seaborn visualization
Last synced: 20 Jan 2025
https://github.com/sumidcyber/dataviz-master
This Python application provides a user-friendly interface to load and visualize the contents of a CSV file. Users can choose from various types of graphs and perform analyses on the dataset.
data-analysis data-analysis-project data-analysis-python database databases python python3
Last synced: 22 Jan 2025
https://github.com/aravind-selvam/bikeshare-company-analysis
Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company
analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server
Last synced: 14 Jan 2025
https://github.com/bgr8/bokeh-ile-veri-gorsellestirme
Data visualization with Bokeh Library
bokeh color data-analysis data-visualization hbar html python vbar
Last synced: 08 Feb 2025
https://github.com/vkbo/osirisanalysis
Matlab toolbox for analysing simulation results from Osiris 3
data-analysis matlab matlab-gui physics-simulation
Last synced: 16 Nov 2024
https://github.com/mftnakrsu/crm-rfm-analysis
CRM-RFM-Analysis
ai crm data data-analysis data-science deep-learning machine-learning python rfm rfm-analysis
Last synced: 22 Jan 2025
https://github.com/jxareas/de-zoomcamp-2024
Solutions for @datatalksclub's Data Engineering Zoomcamp 2024.
data-analysis data-engineering data-science database datascience de-zoomcamp docker docker-compose etl etl-pipeline mage-ai orchestration python workflow
Last synced: 20 Jan 2025
https://github.com/colburncodes/se_pudding_2023
This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2
chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs
Last synced: 22 Jan 2025
https://github.com/raad07/sql_project-world_layoffs_dataset
This is a SQL project which comprises the Data Cleaning in the first part and Exploratory Data Analysis (EDA) in the second part.
data-analysis database mysql sql
Last synced: 22 Jan 2025
https://github.com/maskedsyntax/budgetpie
Android app to manage monthly budgets
android dart data-analysis data-visualization finance-management firebase flutter
Last synced: 05 Feb 2025
https://github.com/phammings/sales-management-analysis
Sales management analysis and Power BI dashboard for sample business request and user stories
data-analysis excel powerbi sql
Last synced: 15 Jan 2025
https://github.com/maskedsyntax/taskit
A simple web based Task Tracker for better focus
charts data-analysis python3 streamlit task-tracker-app todo-list
Last synced: 05 Feb 2025
https://github.com/namratha2301/best-selling-books
Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.
data-analysis data-visualization matplotlib pandas sckiit-learn seaborn
Last synced: 28 Jan 2025
https://github.com/bilal-belli/personalacademicdocuments
This repository contains some personal academic assignments, maybe it will help someone!
compilation computer-architecture data-analysis data-structures-and-algorithms database front-end hpc networking operating-systems signal-processing
Last synced: 17 Jan 2025
https://github.com/walidalsafadi/titanic-disaster
In this challenge, we ask you to build a predictive model that answers the question: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).
data-analysis data-science decision-trees eda gradient-boosting knearest-neighbors machine-learning-algorithms naive-bayes random-forest titanic-kaggle titanic-survival-prediction
Last synced: 22 Jan 2025
https://github.com/phomint/udacity_dataanalysis
All projects and activities
data-analysis python udacity-nanodegree
Last synced: 15 Jan 2025
https://github.com/aekanshd/crazytics-suicidesindia
Basic interpretation of the Suicides in India data-set using R.
data-analysis data-science graph india r suicides
Last synced: 15 Jan 2025
https://github.com/johnsesana/eda-video-game-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization excel
Last synced: 17 Jan 2025
https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost
Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.
classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost
Last synced: 17 Jan 2025
https://github.com/jpcadena/solid-principles-machine-learning
S.O.L.I.D. Principles for Machine Learning project.
clean-code data-analysis data-engineering data-science deep-learning dependency-inversion-principle design-patterns design-principles interface-segregation-principle liskov-substitution-principle machine-learning machine-learning-models mlops models open-closed-principle pylint python single-responsibility-principle software-engineering solid-principles
Last synced: 15 Jan 2025
https://github.com/shriram-vibhute/data-analysis
This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.
data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn
Last synced: 15 Jan 2025
https://github.com/shriram-vibhute/digit_classification
This project demonstrates various machine learning techniques for classifying handwritten digits from the MNIST dataset. It covers data preprocessing, model training, evaluation, and advanced classification strategies.
classification data-analysis data-visualization machine-learning matplotlib numpy pandas sk-learn
Last synced: 15 Jan 2025
https://github.com/solrikk/pictrace-web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 09 Jan 2025
https://github.com/sermonzagoto/data_cleansing_in_telco
Data Cleansing in Python
data-analysis data-science machine-learning matplotlib-figures pandas-python seaborn-plots
Last synced: 28 Jan 2025
https://github.com/sermonzagoto/data_manipulation_with_pandas
Data Manipulation with Pandas - Part 1
data-analysis data-science jupyter-notebook pandas-python python
Last synced: 28 Jan 2025
https://github.com/jen-uis/la-crime-data-analysis
This repository contains project materials for the Fall 2023 MGT 256 class. This project is completed with assists from Professor Adem Orsdemir.
business-analytics crime-data crime-data-analysis data-analysis knn la-crimes-from-2020 la-safe r r-markdown r-studio report-generation rmd united-states visualization
Last synced: 21 Jan 2025
https://github.com/noodleslove/house-of-representative-analysis-i
This project uses public data about the stock trades made by members of the US House of Representatives.
data-analysis data-science eda kaggle-dataset matplotlib-pyplot pandas python stocks-trading
Last synced: 28 Jan 2025
https://github.com/sarincr/basics-of-julia-programming-language
Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.
data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning
Last synced: 21 Jan 2025