Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with dataanalysis

A curated list of projects in awesome lists tagged with dataanalysis .

https://github.com/prateekiiest/code-sleep-python

Awesome Projects in Python - Machine Learning Applications, Games, Desktop Applications all in Python :snake:

analysis caesar-cipher case-study cipher classification dataanalysis hacktoberfest hamlet kharagpur-winter-of-code projects python python-script social-network-analysis translation

Last synced: 18 Dec 2024

https://github.com/prateekiiest/Code-Sleep-Python

Awesome Projects in Python - Machine Learning Applications, Games, Desktop Applications all in Python :snake:

analysis caesar-cipher case-study cipher classification dataanalysis hacktoberfest hamlet kharagpur-winter-of-code projects python python-script social-network-analysis translation

Last synced: 15 Nov 2024

https://github.com/mtahiraslan/data-analyst-roadmap

Based on my own experience, I think this roadmap will answer all the questions of how to become a data analyst from zero, which technologies and programming languages are better to know, what kind of soft skills do we need, how do I start my professional career in this field.

blogs businessintelligence courses data dataanalysis dataanalyst excel interview mtahiraslan powerbi programming python resources resume roadmap softskills sql statistics tableau tutorials

Last synced: 25 Oct 2024

https://github.com/GeoDaCenter/rgeoda

R library for spatial data analysis based on libgeoda and GeoDa

dataanalysis geoda geospatial r

Last synced: 22 Nov 2024

https://github.com/gher-uliege/diva

DIVA (Data-Interpolating Variational Analysis) is a software tool dedicated to the spatial interpolation of in situ data in oceanography.

analysis dataanalysis emodnet interpolation ocean-data ocean-sciences oceanography odv seadatacloud seadatanet

Last synced: 11 Dec 2024

https://github.com/hexastack/eazychart

EazyChart is a reactive chart library 📈, it allows you to easily add SVG charts in your React and Vue web applications.

chart charts d3 data dataanalysis dataviz graphs hacktoberfest hacktoberfest2022 javascript library react typescript visualization vue web

Last synced: 17 Dec 2024

https://github.com/uditmahato/heart-attack-analysis

"Heart Attack Analysis" - A data science project for predicting heart attacks using machine learning on health-related data.

dataanalysis heartattack jupyternotebook python

Last synced: 15 Nov 2024

https://github.com/pingcap/dbt-tidb

A dbt adapter for TiDB

dataanalysis database dbt mysql python sql tidb

Last synced: 06 Nov 2024

https://github.com/developer-student-clubs/dataxchange

Welcome to DataScience Collaborative, a community-driven data science project where data enthusiasts, analysts, and machine learning practitioners come together to collaborate on data analysis tasks and projects. Whether you're a seasoned data scientist or just getting started with data analysis, this is the place to learn, contribute, and grow you

data-sciense dataanalysis hacktoberfest machine-learning

Last synced: 23 Nov 2024

https://github.com/arm-university/arduino-projects-for-schools

Arduino MKR Projects for Schools is a colourful entry-level resource, which introduces learners to the exciting world of microcontrollers, the Internet of Things and Data Science. Learners use both simulators and physical devices to build systems and solve real-life problems.

arduino arduinomkr computerscience computing cs dataanalysis dataanalysisusingpython datascience education embeddedsystems iot pbl physicalcomputing

Last synced: 29 Nov 2024

https://github.com/cecivieira/cotas-genero-eleicoes-e-proposicoes-legislativas

Análise de dados sobre cotas de gênero e seu impacto nas eleições e proposições legislativas da Câmara dos Deputados Federais entre 1934 e 2021. Parte do TCC da pós-graduação em Inteligência Artificial e Aprendizado de Máquina na @pucminas

dataanalysis pandas preprocessing-data python randomforestclassifier

Last synced: 13 Dec 2024

https://github.com/ahammadnafiz/predicta

Predicta: Simplify your workflow with our powerful data analysis and machine learning tool.

analytics data-science data-visualization dataanalysis machine-learning pandas project python streamlit streamlit-webapp webapp

Last synced: 03 Dec 2024

https://github.com/29dch/pythondataanalysis

网易云课堂py数据分析demo代码

dataanalysis python

Last synced: 11 Nov 2024

https://github.com/akashkobal/data-science

I'm excited to share my data science project🚀, where I've applied various techniques and insights to solve a specific problem. The project follows best practices for maintainability and reproducibility, using the Data Science Project Template. Dive into the project to explore the code, datasets, documentation, and resources that showcase MyJourney

akash akash-kobal akashkobal applied-data-science artificial-intelligence classification data-science dataanalysis dataanalytics datascienceproject datascientist deep-learning kobal machine-learning prediction regression

Last synced: 05 Dec 2024

https://github.com/praveendecode/youtube-data-harvesting-warehousing

Efficient YouTube data harvesting and warehousing with Python, SQL, MongoDB, and Streamlit, enabling seamless analysis and visualization for insightful decision-making in content management and audience engagement strategies

api apiintegration dataanalysis dataharvesting datawarehousing eda mongodb postgres python sql

Last synced: 17 Dec 2024

https://github.com/ersinaksar/make-your-jarvis-usin-gpt-3-and-python

Python to convert audio input from the microphone to text, generate a response from GPT-3 using the OpenAI API, convert the response to speech using the gTTS library, and save the audio to a file.

artificial-intelligence data-science dataanalysis gpt-3 jarvis machine-learning neural-network python smartassistant voice-assistant voice-commands

Last synced: 12 Dec 2024

https://github.com/ashishpatel26/parul-university-sttp-on-data-analysis-with-python

Data Analysis with Python STTP at Parul University

data-science dataanalysis python

Last synced: 19 Nov 2024

https://github.com/farahibrar/programming-in-python

Explore a comprehensive collection of Python programming for diverse data analysis and data science projects. This repository covers data exploration, visualization, statistical analysis, machine learning, NLP, and model deployment. Perfect for enthusiasts looking to delve into practical examples and advanced techniques.

beautifulsoup dataanalysis docker flask folium jupyter-notebook machine-learning matplotlib nltk numpy pandas python pytorch scikit-learn scikitlearn scipy seaborn spacy statsmodels tensorflow

Last synced: 06 Dec 2024

https://github.com/sevdanurgenc/pythonsamples

The scope of this project includes examples of data analysis related to python.

artifical-intelligence data dataanalysis datascience machine-learning oop python

Last synced: 30 Nov 2024

https://github.com/sondosaabed/data-analyst-nanodegree

I aquired a full scholarship from Google Launchpad. Advanced data wrangling skills to work with messy, complex real-world datasets. Highly customized visualizations using the Matplotlib Python library

data-science dataanalysis datawrangling nanodegree python udacity-nanodegree

Last synced: 06 Nov 2024

https://github.com/thecoderpinar/ml-perceptron-project

This project demonstrates the implementation of the Perceptron algorithm for binary classification tasks. It includes various advanced features such as data augmentation, feature engineering, and deep learning techniques to enhance model performance and robustness.

artificialintelligence binaryclassification dataanalysis datascience deep-learning jupyter-notebook machinelearning opensource perceptronalgorithm programming python

Last synced: 16 Dec 2024

https://github.com/coderixc/pricefeedx

This Module will load NSE Top 100 Stocks and Its Price (Current LTP and 5 Day Previos Price) will store all NSE Stocks Symbol in Database(using Mysql) .

algorithms algotrading bhavcopy bse dataanalysis equity kiteconnect multicharts nse quant-dev stock-market

Last synced: 28 Nov 2024

https://github.com/sumitgirwal/google-play-store-data-analysis

This Google Play Store dataset from Kaggle, analysis using Python, NumPy , Pandas , and Matplotlib.In dataset analysis, you can view the best app 5-star rating app, most review app, or most download app, etc.

dataanalysis matplotlib numpy pandas playstore-data-analysis python3

Last synced: 25 Nov 2024

https://github.com/nagipragalathan/animal_prediction

Animal Prediction is a web application developed using Python and Django that allows users to predict the animal species based on uploaded photos. The application utilizes machine learning algorithms to analyze the image data and provide accurate predictions.

animalclassification animalprediction animalrecognition computervision dataanalysis datascience deeplearning django githubproject imageclassification imageprocessing machinelearning opensource predictiveanalysis python speciesprediction webapplication

Last synced: 18 Dec 2024

https://github.com/newking9088/sql-guide-to-solve-complex-data-science-problems

This is a comprehensive SQL guide for both MySQL users and PostgreSQL users, covering topics from basic `SELECT` statements to advanced window functions. My SQL learning journey and the suggestions from my mentees, colleagues and seniors to document this were the motivations to write this document.

dataanalysis datascience-machinelearning mysql postgresql sql

Last synced: 27 Oct 2024

https://github.com/faizanmohd5/web-scraping-iphone-11-reviews

This is a web scraping project that extracts customer reviews for the iPhone 11 from Flipkart.com using Python and BeautifulSoup. The extracted data is saved in a CSV file for further analysis. Use it as a starting point for your own web scraping projects or for analyzing customer reviews of the iPhone 11.

beautifulsoup csv data-visualization dataanalysis dataextraction datainsights datamining datapreprocessing ecommerce-website ipython-notebook jupyter-notebook python reviews reviewscrapper webscraping

Last synced: 13 Nov 2024

https://github.com/omkarpattnaik8080/pandas

Using dataset from kaggle I am implementing basic and advanced panda operations to analyze data from the dataset

colab-notebook data-science dataanalysis jupyter-notebook machine-learning pandas python

Last synced: 14 Nov 2024

https://github.com/sarincr/data-visualizations-and-dashboards

Data visualization is an interdisciplinary field that deals with the graphic representation of data. It is a particularly efficient way of communicating when the data is numerous as for example a Time Series.

analytics artificial-intelligence big-data bokeh businessintelligence dashboard data-science dataanalysis dataanalytics datavisualization dataviz deep-learning machine-learning matplotlib plotly python python3 seaborn visualization

Last synced: 20 Nov 2024

https://github.com/kingabzpro/kaggle-competition-2020

In the past decade, computer science has evolved and the importance of Data Science has become the new norm, every company s looking to invest more in Machine learning. This is where someone like me who never had an Interest before got curious and started learning more about this world of Kaggle, it took me 3 months to get hold of some of the basic and advanced tools used by Data Scientists. I am using those same tools to evaluate this data set and come up with the best conclusion. In this Notebook, I will be telling you the story of data and I will be sharing my own experience so that any beginner can learn from my mistake and get ahead.

beginner-project data-visualization dataanalysis kaggle-competition learning python3

Last synced: 17 Nov 2024

https://github.com/sukanyabag/alexa-reviews-sentiment-analysis

This repository's notebook provides the insight of customer reviews on Amazon's kid Alexa, by data visualization, sentiment analysis and classification using NLP techniques.

dataanalysis datavisualization kaggle-dataset natural-language-processing sentiment-analysis sentiment-classification

Last synced: 10 Nov 2024

https://github.com/sumitgirwal/pokemon-dataset-data-analysis

This is a simple analysis of Pokemon dataset from Kaggle using Python , NumPy, Pandas, and Matplotlib.

dataanalysis kaggledatasets matplotlib numpy pandas-python pokemon-dataset python3

Last synced: 25 Nov 2024

https://github.com/emso-exe/venda_de_medicamentos_controlados_e_antimicrobianos_-_industrializados

Projeto de análise de vendas de medicamentos controlados por um período de 12 meses e perfil dos consumidores com base nos dados disponibilizados pela Anvisa.

analise-de-dados anvisa dataanalysis dataanalyst dataanalytics medicament medicamento medicamentos medicaments python python-3 python3

Last synced: 15 Nov 2024

https://github.com/arm-university/asp_smart-schools-on-arduino

Our Smart Schools resource provides accessible and engaging projects for teachers and learners that utilise the more advanced features of Arduino in real-world contexts.

arduino computerscience computing computingscience cs dataanalysis electronics embeddedsystems iot physicalcomputing pthon

Last synced: 29 Nov 2024

https://github.com/fortunewalla/airportdb

We used the http://flughafendb.cc/ airport database (small) and converted most of the fields to English.

airport airport-data airport-db airportdb dataanalysis database datascience flughafen flughafendb heatwave machinelearning mysql oracle sample-database sql sqlquery

Last synced: 04 Dec 2024

https://github.com/solrikk/datadigger

DataDigger is a powerful and intuitive web application designed to extract and analyze data from web pages.

automation competitive-analysis content-analysis data data-automation data-processing dataanalysis excel excelize go golang goquery http scraper-tools web-crawler webscraping

Last synced: 08 Dec 2024

https://github.com/camara94/data-science

Bienvenu dans ce tutorie, aucours duquel nous allons découvrir la librairie pandas qui est l'une des libraire les plus importantes en python, lorsque nous voulons découvrir la data science. Avec cette librairie nous pouvons faire tout ce dont nous pouvons imaginer en data science en python

data-science dataanalysis pandas-dataframe python

Last synced: 05 Nov 2024

https://github.com/syedfaiqueali/the-sparks-foundation

Data Science and Business Analytics Tasks

data-science dataanalysis sparksfoundation-intern

Last synced: 17 Dec 2024

https://github.com/rushilsharma1/financialdataset_frauddetection_analysis

Analysis of the fraud detection from the financial dataset using sql workbench and command line

command-line dataanalysis datascience insights sql sqlquery sqlworkbench

Last synced: 09 Nov 2024

https://github.com/keanteng/kaggle_notebook

📚A repository that stores my notebooks on Kaggle and other learnings.

dataanalysis datascience kaggle learning notebook python r

Last synced: 02 Dec 2024

https://github.com/joyalshaji135/r-workshop

R programming is a popular language for data analysis and statistical computing. It offers a rich ecosystem of packages and libraries for a wide range of data-related tasks. In R, you can manipulate data frames, perform statistical analysis, create data visualizations, and generate reports with ease. The language's open-source nature and active com

dataanalysis datavisualization r

Last synced: 28 Nov 2024

https://github.com/shuklayash02/complete_data_analysis_project

A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process

data data-visualization dataanalysis database datacleaning powerbi sql

Last synced: 05 Nov 2024

https://github.com/sumitgirwal/super-market-research

This is simple research on supermarket.its must you know how your organization or corporation growth is increasing or decreasing.

dataanalysis matplotlib numpy pandas python3 supermarket-dataset

Last synced: 25 Nov 2024

https://github.com/mart-dore/datascience-portfolio

Complete data science analysis + API deployemeny of 'MPG car' dataset

api dataanalysis datascience flask machinelearning

Last synced: 09 Nov 2024

https://github.com/pavankethavath/microsoft-classifying-cybersecurity-incidents-with-ml

A machine learning pipeline for classifying cybersecurity incidents as True Positive(TP), Benign Positive(BP), or False Positive(FP) using the Microsoft GUIDE dataset. Features advanced preprocessing, XGBoost optimization, SMOTE, SHAP analysis, and deployment-ready models. Tools: Python, scikit-learn, XGBoost, LightGBM, SHAP and imbalanced-learn

classificationreport correlation-analysis dataanalysis decision-tree-classifier exploratory-data-analysis feature-engineering feature-selection gradientboosting hyperparameter-tuning joblib lgbmclassifier logistic-regression machine-learning modelselection pandas randomforestclassifier randomsearchcv shap smote xgboost-classifier

Last synced: 10 Dec 2024

https://github.com/aisurjyasamantaray/customer-purchase-analysis

This project involves an in-depth analysis of customer purchasing behavior and sales performance to drive business insights and strategies.

customer-purchasing customer-segmentation data-visualization dataanalysis dataanalysis-projects marketing-insights matplotlib numpy pandas python revenue-and-performance-analysis seaborn-plots

Last synced: 18 Nov 2024

https://github.com/sumitgirwal/drinks-dataset-data-analysis

A drinks dataset from Kaggle. Applying analysis using Python, NumPy, Pandas, Matplotlib.

dataanalysis drinks-dataset kaggle matplotlib numpy pandas python3

Last synced: 25 Nov 2024

https://github.com/reddyprasade/bicycle-sharing-system-in-us

A bicycle-sharing system, public bicycle system, or bike-share scheme, is a service in which bicycles are made available for shared use to individuals on a short term basis for a price or free. Many bike share systems allow people to borrow a bike from a "dock" and return it at another dock belonging to the same system. Docks are special bike racks that lock the bike, and only release it by computer control. The user enters payment information, and the computer unlocks a bike. The user returns the bike by placing it in the dock, which locks it in place. Other systems are dockless. For many systems, smartphone mapping apps show nearby available bikes and open docks.

dataanalysis jupyter maplotlib numpy pandas python r

Last synced: 06 Dec 2024

https://github.com/sarincr/basics-of-julia-programming-language

Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.

data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning

Last synced: 20 Nov 2024

https://github.com/harmanveer2546/movie-industry

Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.

dataanalysis datetime matplotlib numpy pandas python seaborn

Last synced: 13 Nov 2024

https://github.com/kingabzpro/digital-learning-during-covid19-eda

In this project, we will be using data analysis tools to figure out trends in digital learning and how it is effective towards improvised communities. We will be comparing districts and states on factors like demography, internet access, learning product access, and finance.

covid-19 data-science dataanalysis eda education learnplatform usa

Last synced: 17 Nov 2024

https://github.com/shibam120302/data-scraper-swiggy-restaurants

A simple scraper to scrape restaurant data from swiggy

dataanalysis scraper selenium webscraping

Last synced: 20 Nov 2024

https://github.com/lisakey/datacamp-data-analyst-python-sql-projects

Several projects completed during my Data Analyst 📊 training on the DataCamp platform with Python 🐍 and SQL 🗃️. Each project addresses real-world challenges using modern analytical tools and techniques.

analysis cleaning-data data dataanalysis dataanalyst matplotlib pandas python seaborn sql transformation visuali

Last synced: 06 Nov 2024

https://github.com/mchenryspagg/creating-a-dimensional-data-model

This project involves creating a dimensional data model using MySQL Workbench for a car repair shop’s operations in western Canada by examining a sample invoice,

dataanalysis dimensional-modeling erdiagram mysql mysql-database salesanalysis

Last synced: 06 Dec 2024

https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data

This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.

aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python

Last synced: 19 Nov 2024

https://github.com/mishaa931/aircovid

This repository contains Python code using Plotly and Pandas to analyze airport traffic data during the COVID-19 pandemic. The code aims to provide visual insights into the impact of the pandemic on airport operations .

data-science dataanalysis datavisualization interactive-visualizations pandas python

Last synced: 12 Nov 2024

https://github.com/montanaz0r/imdb-ratings-auto-inserter

A Python script that enables auto-inserting movie ratings into the IMDB profile.

data data-science dataanalysis imdb movies pandas pandas-dataframe python3 selenium selenium-webdriver webscraping

Last synced: 14 Dec 2024

https://github.com/trainingbypackt/beginning-data-science-with-python-and-jupyter-elearning

Perform reproducible data analyses with these data exploration tools

dataanalysis datascience html jupyter jupyter-notebook python

Last synced: 14 Nov 2024

https://github.com/san089/black-friday-sales-analysis

This Project gives an insight into few statistics related to black Friday Sale.

custom data dataanalysis insights sales statistics

Last synced: 16 Nov 2024

https://github.com/bhavik-jikadara/movie-recommendation-system

Recommendation systems are among the most popular applications of data science. They are used to predict the Rating or Preference that a user would give to an item.

data-science dataanalysis deep-learning machine-learning movie-recommendation movierecommendationsystem python

Last synced: 09 Nov 2024

https://github.com/hectorta1989/802.15.4-wireless-mac-level-performance

MAC-level performance and parameters dataset for 802.15.4 wireless networks.

802154 dataanalysis datasets layer2 lr-wpan macprotocol python3 wireless

Last synced: 10 Nov 2024

https://github.com/anoopgeorge418/end2end-datascience

This repository is a treasure trove of data science projects that showcase my learning journey. Here, you'll find a variety of projects covering the entire data science lifecycle—from data cleaning and exploration to machine learning, model evaluation, and deployment. Each project is designed to tackle real-world problems, with detailed explanation

dataanalysis datacollection datascience flask machine-learning python

Last synced: 10 Nov 2024

https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis

Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.

data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn

Last synced: 21 Dec 2024

https://github.com/pavankethavath/car_dekho_car_price_prediction

A Streamlit web app utilizing Python, scikit-learn, and pandas for used car price prediction. Features data preprocessing (scaling, encoding), Random Forest model optimization with GridSearchCV, and interactive user input handling. Achieves high accuracy (R² score: 0.9028), showcasing skills in machine learning, data engineering, and deployment.

dataanalysis datacleaning datapreprocessing eda encoding feature-extraction feature-selection featureimportance fine-tuning machine-learning minmaxscaling normalization pandas pickle prediction-model python random-forest randomsearch-cv regression streamlit

Last synced: 10 Dec 2024

https://github.com/shoaib-33/dhaka-city-rent-analysis

A project to visualize the rent analysis of entire Dhaka city

data-science dataanalysis datavisualization selenium tableau webscraping

Last synced: 13 Nov 2024

https://github.com/akashash01/hr-analytics

Power BI report created for Job application tracking(HR analysis) under certain requirements. note: Flat file converted into Star Schema(Data modelling).

dashboard dataanalysis powerbi starschema

Last synced: 14 Dec 2024

https://github.com/akashash01/restaurant-analysis

Power BI report created for Restaurant sales analysis under certain conditions and requirements.

dashboard dataanalysis dax grouping-and-summarizing powerbi sales

Last synced: 14 Dec 2024

https://github.com/tanay-dwivedi/netflix-dataset-data-analysis

This project involves analyzing a Netflix dataset to derive insights on show ratings, release trends, durations, geographical distribution, and content types, facilitating strategic decision-making and content curation efforts.

dataanalysis netflix python seaborn visualization

Last synced: 07 Nov 2024

https://github.com/akashash01/kpi-analysis

Power Bi report created for KPI analysis of an Industrial machines under certain requirements.

dashboard dataanalysis dax kpi-report powerbi

Last synced: 14 Dec 2024

https://github.com/akashash01/performance-report

This is an simple Data analysis report shows the Annual Performance rate of an Employee.

dataanalysis dax-languague excel performance-analysis powerbi

Last synced: 14 Dec 2024

https://github.com/tanay-dwivedi/marathon-running-data-analysis

The project involves exploratory data analysis of marathon race data to uncover patterns, explore relationships between variables, and generate insights aimed at optimizing athlete performance and race organization strategies.

dataanalysis dataset marathon matplotlib-pyplot python seaborn visualization

Last synced: 07 Nov 2024

https://github.com/akashash01/energy-consumption-report

Analyzed the energy consumption data on different attributes over the years on USA.

dataanalysis datavisualization dax energy-consumption excel powerbi

Last synced: 14 Dec 2024

https://github.com/akashash01/coffee_sales_analysis

Analyzed the coffee sales data using My SQL and retrieved Business related insights.

dataanalysis mysql sales-analysis

Last synced: 14 Dec 2024