Projects in Awesome Lists tagged with dataanalysis
A curated list of projects in awesome lists tagged with dataanalysis .
https://github.com/tanu-n-prabhu/python
This repository helps you understand python from the scratch.
data dataanalysis datascraping google-colab google-colab-notebook jupyter-notebook machine-learning numpy numpy-arrays pandas-dataframe prediction python python-3 python3
Last synced: 14 May 2025
https://github.com/Tanu-N-Prabhu/Python
This repository helps you understand python from the scratch.
data dataanalysis datascraping google-colab google-colab-notebook jupyter-notebook machine-learning numpy numpy-arrays pandas-dataframe prediction python python-3 python3
Last synced: 15 Apr 2025
https://github.com/prateekiiest/code-sleep-python
Awesome Projects in Python - Machine Learning Applications, Games, Desktop Applications all in Python :snake:
analysis caesar-cipher case-study cipher classification dataanalysis hacktoberfest hamlet kharagpur-winter-of-code projects python python-script social-network-analysis translation
Last synced: 12 Apr 2025
https://github.com/prateekiiest/Code-Sleep-Python
Awesome Projects in Python - Machine Learning Applications, Games, Desktop Applications all in Python :snake:
analysis caesar-cipher case-study cipher classification dataanalysis hacktoberfest hamlet kharagpur-winter-of-code projects python python-script social-network-analysis translation
Last synced: 09 May 2025
https://github.com/mtahiraslan/data-analyst-roadmap
Based on my own experience, I think this roadmap will answer all the questions of how to become a data analyst from zero, which technologies and programming languages are better to know, what kind of soft skills do we need, how do I start my professional career in this field.
blogs businessintelligence courses data dataanalysis dataanalyst excel interview mtahiraslan powerbi programming python resources resume roadmap softskills sql statistics tableau tutorials
Last synced: 14 Mar 2025
https://github.com/akabe/ocaml-jupyter
An OCaml kernel for Jupyter (IPython) notebook
dataanalysis datascience functional-programming jupyter jupyter-kernels jupyter-notebook machine-learning ocaml ocaml-kernel ocaml-repl
Last synced: 28 Dec 2025
https://github.com/cs-mohamedayman/data-science-case-studies
Data Science Case Studies for computer science students.
casestudies dashboard dataanalysis datacamp datascience datascienceindustries deeplearning excel googlesheets hackerrank kaggle leetcode machinelearning powerbi powerpoint sql tableau
Last synced: 23 Feb 2025
https://github.com/ptyadana/mysql-tableau-for-data-analytics-and-business-intelligence
collection of SQL - Tableau integration projects for Data Analytics and Business Intelligence
business-analytics business-intelligence csv data-analysis data-analytics data-visualizations dataanalysis datavisualization integration mysql mysqlworkbench sql tableau tableau-desktop tableau-public
Last synced: 04 Sep 2025
https://github.com/caioricciuti/duck-ui
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, and keyboard shortcuts, all running seamlessly in the browser using DuckDB's WebAssembly (WASM) capabilities.
data-science data-visualization dataanalysis datanalytics duckdb local
Last synced: 04 Apr 2025
https://github.com/geodacenter/rgeoda
R library for spatial data analysis based on libgeoda and GeoDa
dataanalysis geoda geospatial r
Last synced: 04 Apr 2025
https://github.com/GeoDaCenter/rgeoda
R library for spatial data analysis based on libgeoda and GeoDa
dataanalysis geoda geospatial r
Last synced: 13 Jul 2025
https://github.com/cis-team/datascience-squad
Data Science Squad Roadmap
cis-team computer-science data-science dataanalysis
Last synced: 20 Feb 2025
https://github.com/akabe/docker-ocaml-jupyter-datascience
Dockerfiles for data science in OCaml on Jupyter
dataanalysis datascience docker dockerfile functional-programming jupyter-notebook machine-learning ocaml
Last synced: 10 Apr 2025
https://github.com/gher-uliege/diva
DIVA (Data-Interpolating Variational Analysis) is a software tool dedicated to the spatial interpolation of in situ data in oceanography.
analysis dataanalysis emodnet interpolation ocean-data ocean-sciences oceanography odv seadatacloud seadatanet
Last synced: 30 Mar 2025
https://github.com/georgehanymilad/data-analysis-and-bi-resources
Data Analysis and BI Resources 📊
business-intelligence data-visualization dataanalysis database excel powerbi python sql tableau
Last synced: 14 Apr 2025
https://github.com/yahoo/cubed
Data Mart As A Service
bigdata businessintelligence dataanalysis datamart etl funnel-analysis saas
Last synced: 03 Aug 2025
https://github.com/hexastack/eazychart
EazyChart is a reactive chart library 📈, it allows you to easily add SVG charts in your React and Vue web applications.
chart charts d3 data dataanalysis dataviz graphs hacktoberfest hacktoberfest2022 javascript library react typescript visualization vue web
Last synced: 20 Oct 2025
https://github.com/tushar2704/sql-portfolio
Collection of personal SQL projects and queries I've worked on, showcasing my skills and expertise in database management, data analysis, and data manipulation using SQL.
data data-analytics data-science dataanalysis datamanipulation machine-learning mysql postgresql sql streamlit-tushar2704 tushar2704
Last synced: 07 May 2025
https://github.com/hemansnation/python-for-beginners
Course for Python Beginners
dataanalysis python pythonforbeginner
Last synced: 15 Apr 2025
https://github.com/harrystaley/open-source-data-science-degree-python
A fully curated, open-source Data Science curriculum focused on Python. Includes top-tier university courses (MIT, Stanford, Princeton) covering essential topics in computer science, data analysis, machine learning, and statistics — everything you need to build a solid foundation in Data Science, 100% free.
data data-science dataanalysis datasci ds open open-source py python python3 science source statistics
Last synced: 13 Apr 2025
https://github.com/uditmahato/heart-attack-analysis
"Heart Attack Analysis" - A data science project for predicting heart attacks using machine learning on health-related data.
dataanalysis heartattack jupyternotebook python
Last synced: 13 Apr 2025
https://github.com/pingcap/dbt-tidb
A dbt adapter for TiDB
dataanalysis database dbt mysql python sql tidb
Last synced: 05 Aug 2025
https://github.com/adritpal08/eda-and-ml-model-training-of-student-performance-data
The Exploratory Data Analysis and Machine Learning Model Training for the Student Performance Data
dataanalysis exploratory-data-analysis machine-learning python student-performance-analysis
Last synced: 06 Oct 2025
https://github.com/arm-university/arduino-projects-for-schools
Arduino MKR Projects for Schools is a colourful entry-level resource, which introduces learners to the exciting world of microcontrollers, the Internet of Things and Data Science. Learners use both simulators and physical devices to build systems and solve real-life problems.
arduino arduinomkr computerscience computing cs dataanalysis dataanalysisusingpython datascience education embeddedsystems iot pbl physicalcomputing
Last synced: 23 Apr 2025
https://github.com/developer-student-clubs/dataxchange
Welcome to DataScience Collaborative, a community-driven data science project where data enthusiasts, analysts, and machine learning practitioners come together to collaborate on data analysis tasks and projects. Whether you're a seasoned data scientist or just getting started with data analysis, this is the place to learn, contribute, and grow you
data-sciense dataanalysis hacktoberfest machine-learning
Last synced: 23 Jun 2025
https://github.com/cecivieira/cotas-genero-eleicoes-e-proposicoes-legislativas
Análise de dados sobre cotas de gênero e seu impacto nas eleições e proposições legislativas da Câmara dos Deputados Federais entre 1934 e 2021. Parte do TCC da pós-graduação em Inteligência Artificial e Aprendizado de Máquina na @pucminas
dataanalysis pandas preprocessing-data python randomforestclassifier
Last synced: 20 Sep 2025
https://github.com/tushar2704/data-portfolio
This repository showcases my skills and experience in the field of data analysis. Here, you will find a collection of projects and analyses that demonstrate my ability to extract insights and make data-driven decisions.
artificial-intelligence data-science dataanalysis postgresql python r sql streamlit-tushar2704 tushar2704
Last synced: 07 May 2025
https://github.com/MohammedSardar/Bive
Bive is a Kurdish profanity language processing project.
data dataanalysis kurdish kurdish-corpus kurdish-dataset kurdish-language-processing kurdishdata kurdishnlp
Last synced: 07 May 2025
https://github.com/tanishq-ctrl/consumer-personality-analysis
This project focuses on analyzing customer behavior and spending patterns using a comprehensive dataset. Through advanced data visualization and analysis techniques, we aim to uncover actionable insights to improve marketing strategies, optimize product targeting, and enhance customer engagement.
dataanalysis dataanalytics matplotlib numpy pandas python seaborn
Last synced: 14 Jun 2025
https://github.com/karan-malik/prepdata
Automating the process of Data Preprocessing for Data Science
classification data dataanalysis dataframe datapreprocessing datascience machine-learning numpy pandas pip preprocessing pypi-package python python3 random-forest regress sklearn
Last synced: 13 Apr 2025
https://github.com/ahammadnafiz/predicta
Predicta: Simplify your workflow with our powerful data analysis and machine learning tool.
analytics data-science data-visualization dataanalysis machine-learning pandas project python streamlit streamlit-webapp webapp
Last synced: 28 Jul 2025
https://github.com/akashkobal/data-science
I'm excited to share my data science project🚀, where I've applied various techniques and insights to solve a specific problem. The project follows best practices for maintainability and reproducibility, using the Data Science Project Template. Dive into the project to explore the code, datasets, documentation, and resources that showcase MyJourney
akash akash-kobal akashkobal applied-data-science artificial-intelligence classification data-science dataanalysis dataanalytics datascienceproject datascientist deep-learning kobal machine-learning prediction regression
Last synced: 26 Jul 2025
https://github.com/sondosaabed/data-analyst-nanodegree
I aquired a full scholarship from Google Launchpad. Advanced data wrangling skills to work with messy, complex real-world datasets. Highly customized visualizations using the Matplotlib Python library
data-science dataanalysis datawrangling nanodegree python udacity-nanodegree
Last synced: 09 Apr 2025
https://github.com/clarifai/clarifai-python-datautils
Extract Transform and Load unstructured data into the Clarifai's AI platform
dataanalysis dataengineering ingestion ingestion-pipeline unstructured-data unstructured-data-analysis unstructured-image unstructured-text
Last synced: 18 Oct 2025
https://github.com/adritpal08/customer-churn-analysis-report-using-power-bi
Customer Churn Analysis Report using powerbi
customer-churn-analysis dataanalysis datamodeling datavisualization powerbi
Last synced: 03 Jan 2026
https://github.com/farahibrar/programming-in-python
Explore a comprehensive collection of Python programming for diverse data analysis and data science projects. This repository covers data exploration, visualization, statistical analysis, machine learning, NLP, and model deployment. Perfect for enthusiasts looking to delve into practical examples and advanced techniques.
beautifulsoup dataanalysis docker flask folium jupyter-notebook machine-learning matplotlib nltk numpy pandas python pytorch scikit-learn scikitlearn scipy seaborn spacy statsmodels tensorflow
Last synced: 16 Jul 2025
https://github.com/adritpal08/export-and-import-data-analysis-dashborad-using-power-bi
Export and Import Data Analysis Dashboard using Power BI
dataanalysis datamodeling datavisualization export-import powerbi
Last synced: 17 Aug 2025
https://github.com/praveendecode/youtube-data-harvesting-warehousing
Efficient YouTube data harvesting and warehousing with Python, SQL, MongoDB, and Streamlit, enabling seamless analysis and visualization for insightful decision-making in content management and audience engagement strategies
api apiintegration dataanalysis dataharvesting datawarehousing eda mongodb postgres python sql
Last synced: 16 Aug 2025
https://github.com/sevdanurgenc/pythonsamples
The scope of this project includes examples of data analysis related to python.
artifical-intelligence data dataanalysis datascience machine-learning oop python
Last synced: 11 Oct 2025
https://github.com/tirendazacademy/hands-on-data-science-with-gcp
Google BigQuery Tutorial
big-data big-data-analytics bigdata bigquery bigquery-ml bigqueryml cloud-computing data-analysis data-analytics data-engineering data-science dataanalysis dataengineering google-bigquery google-cloud-platform machienlearning machine-learning
Last synced: 06 Oct 2025
https://github.com/adritpal08/e-commerce-sales-analysis-dashboard-using-power-bi
E-Commerce Sales Analysis Dashboard using Power BI
dataanalysis datamodeling datavisualization e-commerce-project powerbi
Last synced: 03 Jul 2025
https://github.com/vidhi1290/brain-tumor-detection
Brain Tumor Detection using CNN: Achieving 96% Accuracy with TensorFlow: Highlights the main focus of your project, which is brain tumor detection using a Convolutional Neural Network (CNN) implemented in TensorFlow. It also emphasizes the impressive achievement of reaching 96% accuracy, which showcases the effectiveness of your model.
artificial-intelligence artificial-neural-networks braintumorclassification braintumour convolutional-neural-networks data-science dataanalysis datavisualization deep-learning deep-neural-networks kaggle machine-learning machine-learning-algorithms python tensorflow
Last synced: 10 Apr 2025
https://github.com/shervinnd/bazar_app_store_eda
Bazar App Data analysis code to find the most downloaded category and most popular installed apps
data data-analysis data-science dataanalysis eda python
Last synced: 15 Apr 2025
https://github.com/varunbanka/data-insights
Data Insights is a user-friendly tool for analyzing large CSV files. Its advanced analytics helps uncover hidden patterns and trends, making it perfect for data scientists and analysts.
artificial-intelligence automation data-analysis data-science dataanalysis datahive numpy pandas python
Last synced: 22 Jun 2025
https://github.com/ashishpatel26/parul-university-sttp-on-data-analysis-with-python
Data Analysis with Python STTP at Parul University
data-science dataanalysis python
Last synced: 14 May 2025
https://github.com/easonlai/databricks_odbc_connection_to_azure_sql_db_with_azure_ad_user_access_token
Making ODBC connection from Databricks (Azure Databricks) to Azure SQL Database with Azure AD User Access Token.
azure azuread azuredatabricks azuresql azuresqldb bigdata data-analysis dataanalysis dataanalytics databricks databricks-notebooks datascience microsoft microsoft-azure microsoftazure odbc odbc-driver pandas pyodbc spark
Last synced: 25 Feb 2025
https://github.com/huseyincenik/data_science
Data Science materials
data data-science data-structures data-visualization dataanalysis dataengineering datapreparation dataprocessing datascience dataset time-series time-series-analysis timeline timeseries timeseries-analysis timeseriesforecasting
Last synced: 25 Jul 2025
https://github.com/rajkhanke/pune_houseprice_prediction_r
The repository contains a Pune house price prediction system build using R programming Language. The System efficiently calculates and analyze house prices in multiple areas across Pune using machine learning models and Data science and analytical tools
algorithms data-science data-visualization dataanalysis eda machine-learning r
Last synced: 07 Oct 2025
https://github.com/mchenryspagg/google-play-store-apps-analysis-visualization
An analysis and visualization of google play store apps scraped data for the period of 2010 - 2018 . This project aims at cleaning the dataset, analyzing the given dataset, and mining informational quality insights. This project also involves visualizing the data to better and easily understand trends and different categories.
dataanalysis datacleaning datavisualization documentation mysql powerbi preprocessing python sql
Last synced: 20 Feb 2025
https://github.com/ersinaksar/make-your-jarvis-usin-gpt-3-and-python
Python to convert audio input from the microphone to text, generate a response from GPT-3 using the OpenAI API, convert the response to speech using the gTTS library, and save the audio to a file.
artificial-intelligence data-science dataanalysis gpt-3 jarvis machine-learning neural-network python smartassistant voice-assistant voice-commands
Last synced: 13 Apr 2025
https://github.com/tommaso-dognini/astropi_f2d2
RaspberryPi camera experiment promoted by ESA to be run on the ISS, written in python.
dataanalysis image-classification image-processing matplotlib python raspberry-pi skyfield
Last synced: 11 Apr 2025
https://github.com/nafisalawalidris/blockchain-transaction-analysis-and-fund-tracing
A project for analysing blockchain transactions and tracing fund movements using Tronscan APIs. Verifies recipient addresses, checks transaction accuracy, and maps fund flows to detect discrepancies, ensuring transparency and integrity in cryptocurrency transactions.
apiintegration apitools blockchainanalysis blockchaindevelopment cryptoanalysis cryptoinvestigation dataanalysis dataanalysisusingpython datascience fundtracing machinelearning python webscraping
Last synced: 08 Apr 2025
https://github.com/vrm-piyush/python-projects
Open source Python Projects. Feel Free to contribute!
data dataanalysis games open-source pygame-games python python-app
Last synced: 30 Oct 2025
https://github.com/karan-malik/uberdataanalysis
Uber Data Analysis and Visualization using Python
data-analysis data-analysis-python data-analytics data-science data-visualization dataanalysis matplotlib-pyplot numpy pandas pandas-dataframe python python3 seaborn uber uber-data
Last synced: 04 Mar 2025
https://github.com/pavankethavath/microsoft-classifying-cybersecurity-incidents-with-ml
A machine learning pipeline for classifying cybersecurity incidents as True Positive(TP), Benign Positive(BP), or False Positive(FP) using the Microsoft GUIDE dataset. Features advanced preprocessing, XGBoost optimization, SMOTE, SHAP analysis, and deployment-ready models. Tools: Python, scikit-learn, XGBoost, LightGBM, SHAP and imbalanced-learn
classificationreport correlation-analysis dataanalysis decision-tree-classifier exploratory-data-analysis feature-engineering feature-selection gradientboosting hyperparameter-tuning joblib lgbmclassifier logistic-regression machine-learning modelselection pandas randomforestclassifier randomsearchcv shap smote xgboost-classifier
Last synced: 23 Apr 2025
https://github.com/sumitgirwal/google-play-store-data-analysis
This Google Play Store dataset from Kaggle, analysis using Python, NumPy , Pandas , and Matplotlib.In dataset analysis, you can view the best app 5-star rating app, most review app, or most download app, etc.
dataanalysis matplotlib numpy pandas playstore-data-analysis python3
Last synced: 08 May 2025
https://github.com/newking9088/sql-guide-to-solve-complex-data-science-problems
This is a comprehensive SQL guide for both MySQL users and PostgreSQL users, covering topics from basic `SELECT` statements to advanced window functions. My SQL learning journey and the suggestions from my mentees, colleagues and seniors to document this were the motivations to write this document.
dataanalysis datascience-machinelearning mysql postgresql sql
Last synced: 28 Oct 2025
https://github.com/yassin522/data-sceince-projects
data-science dataanalysis machine-learning python streamlit
Last synced: 21 Jun 2025
https://github.com/sumitgirwal/super-market-research
This is simple research on supermarket.its must you know how your organization or corporation growth is increasing or decreasing.
dataanalysis matplotlib numpy pandas python3 supermarket-dataset
Last synced: 03 Oct 2025
https://github.com/sumitgirwal/pokemon-dataset-data-analysis
This is a simple analysis of Pokemon dataset from Kaggle using Python , NumPy, Pandas, and Matplotlib.
dataanalysis kaggledatasets matplotlib numpy pandas-python pokemon-dataset python3
Last synced: 30 Dec 2025
https://github.com/code-str8/income-prediction-challange
The goal of this task is to develop a machine learning model capable of forecasting if a person's income surpasses or falls below a predetermined threshold.
classification dataanalysis machine-learning streamlit
Last synced: 16 Aug 2025
https://github.com/waldohidalgo/desafioclase2inmersiondedatos
Repositorio con el notebook y el código con el cual resuelvo de modo completo los desafíos dejados por los instructores en la segunda clase de la semana de Inmersión de Datos
dataanalysis inmersionaluralatamdatos python
Last synced: 08 Oct 2025
https://github.com/sukanyabag/alexa-reviews-sentiment-analysis
This repository's notebook provides the insight of customer reviews on Amazon's kid Alexa, by data visualization, sentiment analysis and classification using NLP techniques.
dataanalysis datavisualization kaggle-dataset natural-language-processing sentiment-analysis sentiment-classification
Last synced: 19 Jul 2025
https://github.com/trainingbypackt/beginning-data-science-with-python-and-jupyter-elearning
Perform reproducible data analyses with these data exploration tools
dataanalysis datascience html jupyter jupyter-notebook python
Last synced: 01 Sep 2025
https://github.com/arya920/wasibi_bank_atm_transaction_project
This data analytics project focuses on analyzing ATM transactions for Wisabi Bank, utilizing Power BI for comprehensive insights.
data-visualization dataanalysis dax-languague powerbi powerquery
Last synced: 06 Jan 2026
https://github.com/sarincr/data-visualizations-and-dashboards
Data visualization is an interdisciplinary field that deals with the graphic representation of data. It is a particularly efficient way of communicating when the data is numerous as for example a Time Series.
analytics artificial-intelligence big-data bokeh businessintelligence dashboard data-science dataanalysis dataanalytics datavisualization dataviz deep-learning machine-learning matplotlib plotly python python3 seaborn visualization
Last synced: 14 Mar 2025
https://github.com/bhavik-jikadara/movie-recommendation-system
Recommendation systems are among the most popular applications of data science. They are used to predict the Rating or Preference that a user would give to an item.
data-science dataanalysis deep-learning machine-learning movie-recommendation movierecommendationsystem python
Last synced: 14 Jul 2025
https://github.com/ctroupin/ctroupin.github.io
Personal webpage
data-visualization dataanalysis julia leaflet oceanography python remote-sensing running science trail-running
Last synced: 13 Oct 2025
https://github.com/tushar2704/pizza-sales-analysis
This repository contains valuable insights and visualizations derived from an extensive Pizza dataset with over 48,000 rows.
dashboard data-science dataanalysis excel portfolio postgresql powerbi sql streamlit-tushar2704 tushar2704
Last synced: 04 Nov 2025
https://github.com/billy-enrizky/sales-analysis
"Sales Data Analysis Project: Analyzing sales data, cleaning, and exploring insights. Python and Pandas used for data analysis."
dataanalysis exploratory-data-analysis jupyter-notebook pandas python
Last synced: 16 Oct 2025
https://github.com/faizanmohd5/web-scraping-iphone-11-reviews
This is a web scraping project that extracts customer reviews for the iPhone 11 from Flipkart.com using Python and BeautifulSoup. The extracted data is saved in a CSV file for further analysis. Use it as a starting point for your own web scraping projects or for analyzing customer reviews of the iPhone 11.
beautifulsoup csv data-visualization dataanalysis dataextraction datainsights datamining datapreprocessing ecommerce-website ipython-notebook jupyter-notebook python reviews reviewscrapper webscraping
Last synced: 13 Jun 2025
https://github.com/kingabzpro/kaggle-competition-2020
In the past decade, computer science has evolved and the importance of Data Science has become the new norm, every company s looking to invest more in Machine learning. This is where someone like me who never had an Interest before got curious and started learning more about this world of Kaggle, it took me 3 months to get hold of some of the basic and advanced tools used by Data Scientists. I am using those same tools to evaluate this data set and come up with the best conclusion. In this Notebook, I will be telling you the story of data and I will be sharing my own experience so that any beginner can learn from my mistake and get ahead.
beginner-project data-visualization dataanalysis kaggle-competition learning python3
Last synced: 11 Mar 2025
https://github.com/yashmistry-24/ytcomment-iq
YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.
analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube
Last synced: 06 Oct 2025
https://github.com/coderixc/pricefeedx
This Module will load NSE Top 100 Stocks and Its Price (Current LTP and 5 Day Previos Price) will store all NSE Stocks Symbol in Database(using Mysql) .
algorithms algotrading bhavcopy bse dataanalysis equity kiteconnect multicharts nse quant-dev stock-market
Last synced: 21 Mar 2025
https://github.com/akashash01/restaurant-analysis
Power BI report created for Restaurant sales analysis under certain conditions and requirements.
dashboard dataanalysis dax grouping-and-summarizing powerbi sales
Last synced: 01 Apr 2025
https://github.com/elfgk/kc-house-data-analysis
KC House Data Analysis
classification dataanalysis dataanalysis-projects dataanalysisusingpython dataanalystportfolio jupyter-notebook machine-learning python regression-models xgbregressor
Last synced: 24 Jun 2025
https://github.com/pavankethavath/car_dekho_car_price_prediction
A Streamlit web app utilizing Python, scikit-learn, and pandas for used car price prediction. Features data preprocessing (scaling, encoding), Random Forest model optimization with GridSearchCV, and interactive user input handling. Achieves high accuracy (R² score: 0.9028), showcasing skills in machine learning, data engineering, and deployment.
dataanalysis datacleaning datapreprocessing eda encoding feature-extraction feature-selection featureimportance fine-tuning machine-learning minmaxscaling normalization pandas pickle prediction-model python random-forest randomsearch-cv regression streamlit
Last synced: 23 Apr 2025
https://github.com/vidhi1290/python-course
Welcome to the Python Mastery Course Repository! Discover Python's depth and power through our meticulously crafted modules. From foundational concepts to advanced techniques, master loops, functions, data analysis with Pandas, and more. Join us in this exciting journey to Python proficiency! 🐍🚀
artificial-intelligence data-science dataanalysis debugging functions machine-learning merging-algorithms mongodb multithreading mysql-database numpy numpy-arrays object-oriented-programming pandas python pythonbasics regex
Last synced: 27 Aug 2025
https://github.com/thecoderpinar/ml-perceptron-project
This project demonstrates the implementation of the Perceptron algorithm for binary classification tasks. It includes various advanced features such as data augmentation, feature engineering, and deep learning techniques to enhance model performance and robustness.
artificialintelligence binaryclassification dataanalysis datascience deep-learning jupyter-notebook machinelearning opensource perceptronalgorithm programming python
Last synced: 10 Jun 2025
https://github.com/samridhisainii/searchdata-analyzer
Investigating if search traffic predicts stock success, analyzing MercadoLibre's data, and forecasting revenue using Prophet.
Last synced: 23 Mar 2025
https://github.com/omkarpattnaik8080/pandas
Using dataset from kaggle I am implementing basic and advanced panda operations to analyze data from the dataset
colab-notebook data-science dataanalysis jupyter-notebook machine-learning pandas python
Last synced: 03 Mar 2025
https://github.com/emso-exe/venda_de_medicamentos_controlados_e_antimicrobianos_-_industrializados
Projeto de análise de vendas de medicamentos controlados por um período de 12 meses e perfil dos consumidores com base nos dados disponibilizados pela Anvisa.
analise-de-dados anvisa dataanalysis dataanalyst dataanalytics medicament medicamento medicamentos medicaments python python-3 python3
Last synced: 16 Jun 2025
https://github.com/davidemiceli/social-analytics-platform
Open Source Web Platform for Social Media Analytics
analytics artificial-intelligence bigdata dataanalysis datascience hapijs marketing monitoring nodejs socialmedia vuejs2
Last synced: 23 Aug 2025
https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data
This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.
aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python
Last synced: 03 Jan 2026
https://github.com/yassin522/data_science_intern_at_shai_for_ai
The Data Science responsible for: Assisting with data collection, cleaning and preparation, Building and testing predictive models, Developing data visualizations to communicate insights, Staying up-to-date with the latest advancements in data science and machine learning.
data-science dataanalysis machine-learning python
Last synced: 07 Apr 2025
https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis
Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.
data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn
Last synced: 17 Aug 2025
https://github.com/nagipragalathan/animal_prediction
Animal Prediction is a web application developed using Python and Django that allows users to predict the animal species based on uploaded photos. The application utilizes machine learning algorithms to analyze the image data and provide accurate predictions.
animalclassification animalprediction animalrecognition computervision dataanalysis datascience deeplearning django githubproject imageclassification imageprocessing machinelearning opensource predictiveanalysis python speciesprediction webapplication
Last synced: 11 Nov 2025
https://github.com/kingabzpro/digital-learning-during-covid19-eda
In this project, we will be using data analysis tools to figure out trends in digital learning and how it is effective towards improvised communities. We will be comparing districts and states on factors like demography, internet access, learning product access, and finance.
covid-19 data-science dataanalysis eda education learnplatform usa
Last synced: 10 Aug 2025
https://github.com/fortunewalla/airportdb
We used the http://flughafendb.cc/ airport database (small) and converted most of the fields to English.
airport airport-data airport-db airportdb dataanalysis database datascience flughafen flughafendb heatwave machinelearning mysql oracle sample-database sql sqlquery
Last synced: 31 Jul 2025
https://github.com/georgehanymilad/heart-disease-detection
Machine Learning Project
anaconda classification classification-algorithm dataanalysis datascience kaggle machine-learning machine-learning-algorithms matplotlib numpy pandas python python3 seaborn
Last synced: 31 Jul 2025
https://github.com/codewithjaspreet/hr_viz
Human Resource Data Visualisation
dataanalysis datavisualization sql tableau
Last synced: 12 Aug 2025
https://github.com/sumitgirwal/drinks-dataset-data-analysis
A drinks dataset from Kaggle. Applying analysis using Python, NumPy, Pandas, Matplotlib.
dataanalysis drinks-dataset kaggle matplotlib numpy pandas python3
Last synced: 16 Oct 2025
https://github.com/1sumer/power-bi
Explore a diverse collection of Power BI projects that delve into various facets of data analysis and visualization. This repository features four distinct projects, each demonstrating advanced techniques in data analysis, visualization, and business intelligence using Power BI.
dashboard dataanalysis powerbi powerquery schema
Last synced: 19 Jul 2025
https://github.com/tanishq-ctrl/cyberattack-analysis-and-insights
This repository contains an in-depth analysis of a cybersecurity dataset. The primary goal is to identify patterns, vulnerabilities, and trends in cyberattacks by leveraging various visualizations and statistical insights. The project provides actionable insights for enhancing cybersecurity measures.
cyberattack cyberattacks data-science data-visualization dataanalysis dataanalysisusingpython dataanalytics
Last synced: 28 Jun 2025
https://github.com/yessasvini23/cisco-data-analytics-essentials_-virtual-_internship
From the CISCO Networking Academy
data dataanalysis database datascience excel relational-databases sql statistics structured-query tableau
Last synced: 17 Jul 2025
https://github.com/dpgitaccount/data-analytics-repo
"Comprehensive data analytics toolkit: In this Python , SQL, Excel, Jupyter and visualizations is contain for data exploration, cleaning, modeling, and insights."
Last synced: 05 Mar 2025
https://github.com/arfazrll/data-mining-competition
Repository ini berisi partisipasi saya dalam kompetisi ADIKARA 2024 - Data Mining Competition. Repository ini terkait mengembangkan model prediksi Food Price Index menggunakan dataset spatiotemporal.
dataanalysis datamining kaggle-competition machine-learning predictive-modeling spatiotemporal-forecasting
Last synced: 24 Jul 2025
https://github.com/hectorta1989/802.15.4-wireless-mac-level-performance
MAC-level performance and parameters dataset for 802.15.4 wireless networks.
802154 dataanalysis datasets layer2 lr-wpan macprotocol python3 wireless
Last synced: 28 Jul 2025
https://github.com/syedfaiqueali/the-sparks-foundation
Data Science and Business Analytics Tasks
data-science dataanalysis sparksfoundation-intern
Last synced: 13 Aug 2025