Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-06 00:07:05 UTC
- JSON Representation
https://github.com/ivanildobarauna-dev/data-consumer-api
ETL Process for Currency Quotes Data" project is a complete solution dedicated to extracting, transforming and loading (ETL) currency quote data. This project uses several advanced techniques and architectures to ensure the efficiency and robustness of the ETL process.
business-intelligence data-analysis data-analytics data-engineering data-pipeline data-visualization etl-pipeline python
Last synced: 19 Dec 2024
https://github.com/w-edward/youtube-keyword-popularity-analyzer
An effort to discover the top trending keywords on Youtube.
data-analysis node-js numpy python webscraping youtube-api
Last synced: 16 Jan 2025
https://github.com/hongbo-wei/global-status-of-cc-security-certification
Data visualization of CC Security Certification using VUE, Django, and MySQL.
big-date common-criteria data-analysis data-visualisation data-visualization
Last synced: 14 Jan 2025
https://github.com/iguptashubham/online-retail-sales
This Power BI dashboard, designed for marketing strategists, analyzes sales trends and customer behavior. It provides key insights empowering them to identify sales opportunities and optimize marketing campaigns, ultimately boosting business sales.
dashboard data data-analysis data-analysis-project data-analysis-project-powerbi data-analysis-python data-project data-science powerbi project
Last synced: 14 Jan 2025
https://github.com/justsecret123/twitter-sentiment-analysis
A sentiment analysis model trained with Kaggle GPU on 1.6M examples, used to make inferences on 220k tweets about Messi and draw insights from their results.
classification data-analysis data-science deep-learning deep-neural-networks docker glove-embeddings kaggle lstm lstm-neural-networks machine-learning natural-language-processing nlp python rnn scikit-learn sentiment-analysis sentiment-classification tensorflow word-embeddings
Last synced: 05 Nov 2024
https://github.com/jhrcook/100daysofpython
100 days, at least 1 hour a day, of learning the Python programming language.
100-days-of-code 100daysofcode continued-learning data-analysis data-science decision-trees deep-learning keras keras-tensorflow machine-learning neural-network neural-networks plots python python3 scikit-learn tensorflow
Last synced: 13 Nov 2024
https://github.com/ivanildobarauna-dev/data-pipeline-sync-ingest
ETL Process for Currency Quotes Data" project is a complete solution dedicated to extracting, transforming and loading (ETL) currency quote data. This project uses several advanced techniques and architectures to ensure the efficiency and robustness of the ETL process.
business-intelligence data-analysis data-analytics data-engineering data-pipeline data-visualization etl-pipeline python
Last synced: 26 Jan 2025
https://github.com/dataopstix/modelt
Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.
airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization
Last synced: 02 Feb 2025
https://github.com/coumbacoulibaly/adventureworkscycles
Repository for Adventure Works Sample Database Analysis
adventureworks data-analysis data-analytics mssql-database mssqlserver sql ssms
Last synced: 17 Nov 2024
https://github.com/saadarazzaq/excel-merger
Merge multiple Excel and CSV files into a single dataset with the Excel Merger Streamlit app. 📊🔄🚀
data-analysis excel pandas python streamlit-webapp
Last synced: 23 Jan 2025
https://github.com/michaelcurrin/water-crisis-scraper
Scrape and explore data related to Cape Town's water crisis (Python3 application)
cape-town cron csv dam-levels data-analysis html open-data python3 schedule scraping south-africa water-crisis water-level webscraping
Last synced: 28 Oct 2024
https://github.com/simranjeet97/ipl-dataanalysis
Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.
artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python
Last synced: 14 Jan 2025
https://github.com/1994nikunj/nlp-toolkit-desktop-app
The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.
data-analysis n-grams network-visualization nlp python text-cleaning topic-modeling wordcloud-generator
Last synced: 25 Nov 2024
https://github.com/ahmednasef3/titanic-full-eda
Simple EDA for Titanic Dataset.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas seaborn titanic titanic-data-analytics
Last synced: 15 Jan 2025
https://github.com/mindlessmuse666/client-data-analysing-tool
Проект производственной практики: Инструмент для анализа данных, построенный с использованием Python (бэкэнд, фронтэнд PyQt6), Pandas, Matplotlib и SQLite. Это приложение позволяет пользователям загружать данные в формате CSV, фильтровать их, визуализировать ключевые показатели с помощью графиков и создавать отчеты.
data-analysis desktop-application matplotlib pandas pyqt6 pyqt6-desktop-application python sqlite student-project
Last synced: 23 Dec 2024
https://github.com/duart38/sponge
Quickly make endpoints for testing
cms data-analysis deno developer-tools development-tools helper-tool mock server sponge testing testing-tools toolkit tools
Last synced: 13 Dec 2024
https://github.com/thennen/py-ivtools
A package for flexible and reproducible measurement and analysis of current-voltage characteristics of electronic devices.
current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements
Last synced: 24 Jan 2025
https://github.com/elhaban3ro/thewildtool
TheWildTool is a tool developed with the main objective of saving time when working with audio datasets. Either to prepare them, to get them or to train a model with them. 🤖
ai audio audio-processing data-analysis data-science dataset deeplearning python
Last synced: 30 Jan 2025
https://github.com/ehtisham-sadiq/ai-pioneers-datascience-arena
This repository is dedicated to the AI Amigos team's participation in the Artificial Intelligence (AI) competition with a focus on Data Science.
artificial-intelligence competition data-analysis data-science data-visualization machine-learning model-building model-evaluation numpy pandas python3 supervised-learning unsupervised-learning
Last synced: 11 Jan 2025
https://github.com/tirendazacademy/data-sets
Data sets for Tirendaz Akademi Youtube
Last synced: 01 Jan 2025
https://github.com/joanacmbarros/ardm-website
Website to support the R in Pharma 2023 workshop on the ARDM
analysis-results automation clinical-data data-analysis data-model r-in-pharma
Last synced: 16 Dec 2024
https://github.com/stoverc/slots
A collection of slots-related code (initially in Python3, but perhaps more later)
data-analysis data-science monte-carlo-simulation probabilistic-programming probability probability-theory python3 slot-machine slots statistical-analysis statistics
Last synced: 11 Jan 2025
https://github.com/louislefevre/sstubs-miner
Data mining and analysis for the ManySStuBs4J dataset.
data-analysis data-mining manysstubs4j-dataset msr
Last synced: 05 Feb 2025
https://github.com/depressioncenter/data-and-design-core
Code developed by the EFDC Data and Design Core team to support mental health research.
data-analysis data-science efdc inference r statistical-analysis umich
Last synced: 25 Jan 2025
https://github.com/virajbhutada/tableau-data-vizzes
Engage with a growing collection of Tableau dashboards covering financial trends, HR analytics, streaming service insights, real estate dynamics, and more. Meticulously crafted for valuable insights, this repository continues to expand with new and compelling visualizations.
business-analytics data-analysis data-visualization hr-analytics industry-trends netflix performance-metrics stock-market-analysis strategic-analytics tableau visual-insights
Last synced: 10 Jan 2025
https://github.com/shashankbansal6/signal-analysis-for-patient-monitoring
A reliable patient monitoring system which analyzes the correlated physiological signals collected from the patient's body, and generates alarms for abnormalities.
data-analysis patient-monitoring
Last synced: 17 Dec 2024
https://github.com/farahibrar/kpmg-job-simulation
This repository showcases my work from the KPMG Technology Job Simulation by Forage, focusing on Data Analytics and Cloud Engineering. Explore how I tackled real-world business challenges through sales data analysis, regional growth strategies, and AWS architecture design, highlighting my analytical and technical expertise.
aws-architecture business-intelligence cloud-engineering cloud-strategy-and-design data-analysis data-visualization fintech-solutions forage kpmg kpmg-careers python-for-data-analysis sales-data-insights sustainable-retail-analysis
Last synced: 25 Jan 2025
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 01 Jan 2025
https://github.com/jupyterphysscilab/documentation
Documentation for the Jupyter Physical Science Lab Suite of Packages
analog-to-digital-converter data-acquisition data-analysis education jupyter-notebooks pandas physical-sciences plotting python raspberry-pi
Last synced: 04 Dec 2024
https://github.com/jamesquinlan/intro-stats-mat150
Introduction to Statistics
data-analysis statistics university-course
Last synced: 18 Dec 2024
https://github.com/gjbex/python-dashboards
Repository that contains material for training sessions on creating dashboards using Python.
dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization
Last synced: 22 Nov 2024
https://github.com/iamgmujtaba/scholar_search
This project provides a tool for extracting and analyzing the quantity and distribution of scholarly articles related to a particular topic or field over a desired time span, using Google Scholar search results and built-in data visualization functionality.
academia academic academic-papers data-analysis data-visualization google google-scholar scholarly-articles
Last synced: 16 Dec 2024
https://github.com/lacerbi/vbmc
Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)
bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference
Last synced: 05 Feb 2025
https://github.com/cego669/datathonengopevi
Equipe: Embrapeiros. Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística). Obs: FOMOS CAMPEÕES!!!!!!!!
data-analysis data-science datathon python r streamlit xgboost-classifier
Last synced: 08 Dec 2024
https://github.com/al-ghaly/airline-company-data-warehouse
Data Warehouse modeling, design, implementation, and analysis for an Airline Company.
data-analysis data-warehousing database-modeling sql-server
Last synced: 22 Jan 2025
https://github.com/andr3w03/bike-sharing-dashboard
Bike Sharing Data Analysis Streamlit Dashboard
dashboard data-analysis data-visualization python streamlit
Last synced: 29 Jan 2025
https://github.com/shivamswarnkar/tesla-stock-prediction
Making prediction of close prices of Tesla Stocks using different regression methods.
data-analysis data-visualization plotly regression regularization sklearn stock-price-prediction
Last synced: 26 Jan 2025
https://github.com/gher-uliege/liege-colloquium-on-ocean-dynamics
Python tools and latex files for the Colloquium
data-analysis data-assimilation numerical-simulations ocean-modelling oceanography remote-sensing submesoscale turbulence
Last synced: 05 Feb 2025
https://github.com/deep-diver/enron-data-analysis
Data Analysis and Machine Learning on Enron Data
data-analysis enron-data exploratory-data-analysis machine-learning
Last synced: 05 Feb 2025
https://github.com/rapidsurveys/oldr
An Implementation of the Rapid Assessment Method for Older People (RAM-OP)
assessment data-analysis epidata estimate odk older-people r ram-op ranalyticflow rapid-assessment
Last synced: 24 Dec 2024
https://github.com/viper373/baidutieba
爬取百度贴吧(指定吧名、起始页数/重点页数、日志输出)
baidutieba-crawler bert data-analysis deep-learning python spider
Last synced: 05 Feb 2025
https://github.com/jimbrig/eda
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 23 Jan 2025
https://github.com/antononcube/wl-outlieridentifiers-paclet
Wolfram Language (aka Mathematica) paclet that provides outlier identifier functions.
data-analysis hampel outlier-detection outliers
Last synced: 15 Dec 2024
https://github.com/winter000boy/dsa-practice
This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.
data-analysis data-science leetcode leetcode-python pandas-python python3
Last synced: 30 Jan 2025
https://github.com/frikishaan/browsing-history-analysis
This is a data analysis of my browsing history for the last 7 months.
browsing-history data-analysis jupyter-notebook python
Last synced: 09 Jan 2025
https://github.com/prajwalchapke055/cognizant-artificial-intelligence-job-simulation-forage
Advise one of Cognizant’s clients on a supply chain issue by applying knowledge of machine learning models.
artificial-intelligence cognizant communication data-analysis data-modeling data-visualization development evaluation forage job-simulation machine-learning machine-learning-algorithms machine-learning-engineering model-interpretation presentation problem-statement python quality-assurance skills virtual-internship
Last synced: 22 Jan 2025
https://github.com/leewannacott/datacamp-projects
DataCamp project solutions.
data-analysis data-mining data-science datacamp-projects machine-learning python r
Last synced: 30 Jan 2025
https://github.com/poga/dat-ipynb-demo
use ipython notebook to analyze data in dat archive
dat data-analysis distributed jupyter-notebook
Last synced: 15 Dec 2024
https://github.com/juliusmarkwei/concrete-data
Data analysis, machine learning, model evaluation and optimization on the Concret_ Dataset
data-analysis data-science data-visualization ensemble-learning machine-learning modeling
Last synced: 01 Jan 2025
https://github.com/leonism/customer-predictive-analysis
Explore this repository, a comprehensive resource offering an in-depth guide to conducting customer predictive analysis using cutting-edge machine learning techniques, all within the intuitive framework of Dataiku.
data-analysis data-model data-science data-visualization dataiku machine-learning predictive-modeling
Last synced: 03 Feb 2025
https://github.com/worst001/note_machine_learning
整理了机器学习相关资料与手册,包括数学基础、机器学习模型实现示例、神经网络。
ai data-analysis deep-learning development guide learning machine-learning markdown mkdocs note notebook
Last synced: 12 Jan 2025
https://github.com/jimbrig/EDA
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 04 Dec 2024
https://github.com/karan-malik/uberdataanalysis
Uber Data Analysis and Visualization using Python
data-analysis data-analysis-python data-analytics data-science data-visualization dataanalysis matplotlib-pyplot numpy pandas pandas-dataframe python python3 seaborn uber uber-data
Last synced: 14 Jan 2025
https://github.com/sn2606/global-temperature-time-series
Time series analysis is performed on the Berkeley Earth Surface Temperature dataset.
arima arima-forecasting arima-model climate-change data-analysis data-visualization forecasting-model global-temperature series-analysis singular-spectrum-analysis time-series time-series-analysis time-series-forecasting
Last synced: 25 Jan 2025
https://github.com/vandita2020/merra2_nasa_wind_speed_analysis
In this study, we aim to explore the vulnerability of power grids in the south-east region of the USA with the help of data analysis tools and machine learning algorithms
data-analysis data-science machine-learning-algorithms python
Last synced: 11 Jan 2025
https://github.com/jshinm/web-scrapper
Web Scrapper used to extract NeuroData github repo stats
Last synced: 17 Dec 2024
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 16 Dec 2024
https://github.com/prakhar-ff13/yellow-taxi-demand-prediction
Predicting Taxi Demand in various regions of New York City
data-analysis data-analytics data-science data-visualization machine-learning python3 time-series
Last synced: 28 Jan 2025
https://github.com/atxtechbro/flightradar24
Advanced Python application leveraging the power of APIs and the pandas library to retrieve and perform in-depth analysis of flight data from Flightradar24. It uncovers insights such as the most common departure and arrival cities, contributing to the field of aviation data science.
api-integration aviation-data data-analysis data-science data-visualization flightradar24-api pandas-library python requests-library web-scraping
Last synced: 25 Jan 2025
https://github.com/lafayettegabe/nlp-resume-extraction
📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.
big-data data data-analysis data-science eda ner nlp resume-extractor
Last synced: 16 Dec 2024
https://github.com/sevdanurgenc/data-modeling-techniques-lecture-notes
In this repo, I have the course contents of Data Modelling Techniques training, which will be given to Innova Technology by the cooperation of Academy Peak Information Technologies Training and Consultancy between 25 - 26 January 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization
Last synced: 29 Jan 2025
https://github.com/yahia3200/become-an-independent-data-scientist
My final project for the Applied Plotting, Charting & Data Representation in Python Course
data-analysis data-science data-visualization matplotlib
Last synced: 22 Jan 2025
https://github.com/ziaeemehr/itng_nest
Nest Simulator quick guides and examples, adding new model using NESTML
computational-neuroscience data-analysis nest-simulator neuroscience
Last synced: 01 Jan 2025
https://github.com/mynenik/xyplot-32
Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux
cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows
Last synced: 24 Jan 2025
https://github.com/nunesma/Health-analytics
Data analysis focusing on health problems
data-analysis epidemiology-analysis health-analytics python r-programming
Last synced: 04 Dec 2024
https://github.com/hyperspy/holospy-demos
HoloSpy Jupyter Notebook demos
data-analysis data-visualization electron-holography hyperspy materials-science multi-dimensional physical-sciences tutorial
Last synced: 19 Jan 2025
https://github.com/uts58/international-student-job-insights-usa
Data-driven insights on job hunting for international students in the USA, analyzing listings, roles, and trends.
career-insights cpt data-analysis eb1 eb2 eb3 h1b handshake job-analytics job-trends jobs jupyter-notebook opt python work-visa
Last synced: 25 Dec 2024
https://github.com/arv-anshul/campusx-project-notebooks
Capstone project by Campusx in DSMP course.
campusx campusx-dsmp data-analysis data-science eda jupyter-notebook machine-learning ml-project nlp project python3 recommender-system regression streamlit
Last synced: 25 Dec 2024
https://github.com/armanx200/diabetes_model
🚀 A machine learning model predicting diabetes with logistic regression, feature scaling, and VIF analysis. 📊🩺
arman-kianian classification data-analysis data-science data-visualization feature-engineering healthcare logistic-regression machine-learning model-evaluation predictive-modeling python scaling scikit-learn statistical-analysis statsmodels
Last synced: 24 Jan 2025
https://github.com/quantumudit/uk-student-accommodation-analysis
This project focuses on scraping student properties related data from the UK Student Accommodation website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/quantumudit/analyzing-goodreads-famous-quotes
This project focuses on scraping famous quotes and their related data from the GoodReads website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/quantumudit/analyzing-quotes
This project focuses on scraping all the quotes and their related data from the "Quotes To Scrape" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/quantumudit/analyzing-suez-services
This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/praju-1/pandas
The library is widely used in data science and machine learning for data cleaning, preparation, and analysis.
Last synced: 15 Dec 2024
https://github.com/quantumudit/demographic-data-analysis
This project focuses on analyzing and finding correlations between the three important metrics by 195 countries,i.e., birth rate, internet users, and income group.
data-analysis jupyter-notebook power-bi python
Last synced: 26 Dec 2024
https://github.com/tathithienthanh/datamining-banking-dataset
Implement some learned data mining techniques and predict if the client will subscribe to a term deposit
apriori association-rules classification clustering data-analysis data-mining data-processing google-colab ipynb kmeans naive-bayes py python scikit-learn svm visualization
Last synced: 25 Jan 2025
https://github.com/kenvilar/data-analysis-using-python
Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3
bs4 data-analysis jupyter pandas python python3 requests xlrd
Last synced: 18 Nov 2024
https://github.com/quantumudit/alteryx-weekly-challenges
This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community
alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl
Last synced: 26 Dec 2024
https://github.com/quantumudit/analyzing-gamerevolution-games
This project focuses on scraping data related to video games from the GameRevolution website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/verbasik/yandex.practicum.datascience
Портфолио проектов Data Science, выполненных в рамках профессиональной переподготовки в Яндекс.Практикум. Включает исследования в области финансов, недвижимости, кинопроката и других, с использованием статистики, машинного обучения и анализа данных.
data-analysis data-science machine-learning yandex-praktikum
Last synced: 10 Jan 2025
https://github.com/reddyprasade/pandas-practice
Pandas
daat data-analysis data-science flexible labeling missing-data missing-values pandas pandas-profiling
Last synced: 19 Jan 2025
https://github.com/casualcomputer/sql.mechanic
Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).
data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server
Last synced: 04 Dec 2024
https://github.com/iraikov/chicken-dataframe
Tabular data structure for data analysis in Scheme
chicken-scheme chicken-scheme-eggs data-analysis dataframe linear-regression scheme scheme-programming-language
Last synced: 30 Jan 2025
https://github.com/mwoss/mlflow-stock-market-example
Stock market prediction - machine learning pipeline using MLFlow.
anaconda data-analysis databricks example lstm mlflow python stock-market stock-price-prediction tutorial
Last synced: 24 Jan 2025
https://github.com/chaitanyac22/house-price-prediction-project-for-a-us-based-housing-company
The goal of this project is to garner data insights using data analytics to purchase houses at a price below their actual value and flip them on at a higher price. This project aims at building an effective regression model using regularization (i.e. advanced linear regression: Ridge and Lasso regression) in order to predict the actual values of prospective housing properties and decide whether to invest in them or not.
advanced-linear-regression business-analytics data-analysis data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering lasso-regression linear-regression machine-learning model-building model-evaluation prediction-model python3 regularization rfe ridge-regression statistics
Last synced: 01 Feb 2025
https://github.com/supertetelman/kaggle-public
A collection of Python and Matlab projects aimed at utilizing various machine learning techniques to solve big data problems.
cnn data-analysis deep-learning machine-learning matlab python
Last synced: 28 Jan 2025
https://github.com/yusufcinarci/covid-19-data-analysis-visualization
The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.
covid-19-data-visualization data-analysis data-science data-visualization
Last synced: 26 Dec 2024
https://github.com/goggle/dataisbeautiful
Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.
data-analysis data-visualisation jupyter-notebook notebook reddit
Last synced: 26 Dec 2024
https://github.com/mohamedomar2020/random-forest
Creating a Random Forest model to predict the progression of bladder cancer
bladder-cancer cancer-genomics cancer-research data-analysis data-science genomics machine-learning machine-learning-algorithms random-forest
Last synced: 30 Jan 2025
https://github.com/stimulsoft/samples-dashboards.js-for-html
JavaScript samples for Dashboards.JS data visualization tool for HTML and native JavaScript applications
analytics automation components dashboard-application dashboard-designer dashboard-viewer data-analysis embedded html5 indicators javascript js json-database native-javascript onepage panels pivot-tables simple-dashboard transformation website
Last synced: 30 Jan 2025
https://github.com/mohd-faizy/07p_tumor-diagnosis-exploratory-data-analysis-on-breast-cancer-wisconsin-dataset
Tumor Diagnosis: Exploratory Data Analysis With Seaborn
data-analysis data-visualization eda exploratory-data-analysis knn-classification pca-analysis python random-forest random-forest-classifier statistics support-vector-machines tumor-detection visualization
Last synced: 12 Jan 2025
https://github.com/chrdek/linqdatacalc
📈 🎲 Linq based data statistics set of extensions.
calculations calculator data-analysis data-analytics data-science data-statictics extension-methods extensions linq linq-extensions set-theory statistical-analysis statistics
Last synced: 29 Jan 2025
https://github.com/nhsdigital/sde_example_analysis
Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.
data-analysis data-science databricks-notebooks machine-learning mlflow
Last synced: 23 Dec 2024
https://github.com/mindful-ai-assistants/credit-card-prediction
💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.
artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning predictive-modeling python3 scikit-learn
Last synced: 09 Dec 2024
https://github.com/gustavohnsv/teamwork_mqa
Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.
data-analysis group-project r team-repo
Last synced: 16 Dec 2024
https://github.com/njoyedevs/chatgpt3_riskanalyzer
In this project, ChatGPT3 was fine tuned on 9 data series spanning 40 years. This helped train ChatGPT3 to provide a market risk score. To view, visit: https://www.aimarketrisk.com
chatgpt3 data-analysis flask fred-api full-stack-web-development pandas python
Last synced: 30 Jan 2025