Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/misaghmomenib/data-analysis-projects

A Repository Featuring a Collection of Data Analysis Projects, Showcasing Various Techniques and Tools for Extracting Insights From Data. Explore, Learn, and Utilize These Projects to Enhance Your Data Analysis Skills and Workflows.

data-analysis data-analysis-python data-visualization jupyter-notebook python

Last synced: 21 Jan 2025

https://github.com/quantumudit/analyzing-whiskyexchange-whisky

This project focuses on scraping data related to Japanese Whiskey from the Whiskey Exchange website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 17 Feb 2025

https://github.com/virajbhutada/spotify-track-analysis-and-recommendation

Experience a comprehensive exploration of Spotify's musical landscape seamlessly transitioned from Tableau visualizations to SQL analysis. Dive into track inventory, streaming metrics, and sonic trends via interactive dashboards, while leveraging SQL queries for deeper insights into KPIs and cross-platform rankings.

audio-analysis data-analysis data-analytics data-science data-visualization eda machine-learning-library ml-models mysql recommendation-system spotify spotify-data spotify-dataset sql-database sql-server streaming-metrics tableau tableau-public trends-analysis

Last synced: 11 Nov 2024

https://github.com/scottgriv/river-charts

🌊 📉 A Python, Django, Plotly, and Pandas web application that visualizes river data in real time, pulled using an API from the United States Geological Survey (USGS).

api charts data data-analysis data-visualization dataset django pandas plotly python usgs usgs-api visualization webapp

Last synced: 14 Dec 2024

https://github.com/theengineeringworld/numpy-data-science

NumPy Data Science Essential Traing COurse. Part of Youtube Course Offered by TheEngineeringWorld.

data-analysis data-science numpy numpy-exercises numpy-library numpy-tutorial python python-3-6 python3 scipy2018

Last synced: 08 Nov 2024

https://github.com/abhash-rai/traffic-image-classifier

A web-based solution utilizing a robust tensorflow model for precise traffic condition classification made in ReactJs and FastAPI for backend.

cnn cnn-classification cnn-keras cnn-model data-analysis data-science data-visualization fastapi keras keras-tensorflow python python-3 python3 react reactjs tensorflow traffic traffic-classification transfer-learning

Last synced: 16 Jan 2025

https://github.com/davidzajac1/reptoro

A Data Visualization and Analytics Platform for the Reptile Industry

analytics data-analysis data-visualization plotly-dash python

Last synced: 11 Feb 2025

https://github.com/sondosaabed/introduction-to-data-analysis-with-pandas-and-numpy

Learning the data analysis process of questioning, wrangling, exploring, analyzing, and communicating data. Working with data in Python using libraries like NumPy and pandas.

data-analysis data-analyst-nanodegree data-wrangling numpy pandas python

Last synced: 06 Nov 2024

https://github.com/srinivasrm/mutual-funds-analysis-and-prediction

In this project I have performed analysis and prediction on 1,3,and 5 year returns on 1064 mutual funds in India. I have scraped data from a website which is the most visited website for mutual fund investments.I have tested regression models linear model,SGD Regressor , Random Forest Regressor,Decision Tree Regressor,Ridge,MLP Regressor and linear model (Lasso).After which I have selected the best perorming model and performed Hyper parameter tuning and then deployed an interactive application which can generate the visualization and send an email with the visualization to the users email address.

beautifulsoup data-analysis data-base data-cleaning data-science deployment etl finanace frontend funds machine-learning mutual mutual-funds pgsql python scikit-learn sql streamlit web webapplication

Last synced: 12 Feb 2025

https://github.com/rikard-helgegren/leverage_analysis_tool

Analyst tool for portfolio construction. How can levereged certificates be used to increase returns in a portfolio while keeping the risk as low as possible. Use the tool and find out.

cpp data-analysis investment kivy-framework python3

Last synced: 14 Oct 2024

https://github.com/negativenagesh/spam-ham_email_detection_machine_learning

This project focuses on classifying spam/ham emails, using machine learning algorithms like LGR, NB, RF, DT etc.. and based on the accuracy score and precision score I chose logistic regression for the classification. And I have used streamlit for frontend.

app data-analysis data-cleaning data-engineering data-science data-visualization data-visualizations jupyter-notebook logistic-regression machine-learning modeling naive-bayes-classifier nlp python

Last synced: 23 Dec 2024

https://github.com/martinthoma/bad-stats

Examples of how not to do statistics / visualizations

data-analysis statistics visualizations

Last synced: 03 Feb 2025

https://github.com/alyssonmach/9-data-science-apps-with-python

[freeCodeCamp Course]: build interactive and data-driven web apps in Python using the Streamlit library.

data-analysis data-science data-visualization machine-learning streamlit-webapp

Last synced: 06 Nov 2024

https://github.com/vuthanhhai2302/apply-machine-learning-on-data-analytics

My project of applied machine learning on data analytics, using pandas, numpy and scikit-learn to analyze data

data-analysis numpy pandas scikit-learn

Last synced: 11 Nov 2024

https://github.com/coderjolly/football-player-prediction

There is an intense transfer speculation that surrounds all major player transfers today. An important part of negotiations is predicting the fair market price for a player. Therefore, we are predicting this Market Value of a player using the data provided in csv format.

data-analysis data-visualization decision-tree-regression machine-learning xgboost-regression

Last synced: 02 Feb 2025

https://github.com/avinashkranjan/basic-data-analysis-and-visualization-in-python

📊 Some of the most important python tools in data science for Data Analysis and Data Visualization.

data-analysis data-science matplotlib matplotlib-pyplot numpy pandas plotly seabourne

Last synced: 13 Dec 2024

https://github.com/globeandmail/startr-cli

A command-line scaffolder for the startr R project template

data-analysis data-journalism data-visualization journalism r

Last synced: 26 Dec 2024

https://github.com/jimmymugendi/email-sms-spam_classifier

Email SMS Spam Classifier is a cutting-edge machine learning solution designed to combat spam messages in email and SMS communications. Leveraging advanced Natural Language Processing (NLP) techniques, the system preprocesses text data by tokenizing, removing stopwords, and stemming, ensuring the most accurate classification results.

data-analysis data-visualization machine-learning-algorithms pandas seaborn-plots sklearn-library

Last synced: 16 Jan 2025

https://github.com/quantumudit/regional-sales-analysis

This project focuses on analyzing and visualizing the United States regional sales for a fictitious company in between 2018-2020 using Python & Power BI.

data-analysis data-visualization databases jupyter-notebook power-bi python sqlite

Last synced: 17 Feb 2025

https://github.com/ptyadana/dv-data-visualization-with-python

Data analysis and Data Visualization of Countries's GDP, Life Expectancy comparison across continents, GDP per Capita Relative Growth, Population Reative Growth comparison etc using Pandas, Matplotlib.

csdojo data-analysis data-visualization datavisualization matplotlib matplotlib-pyplot numpy pandas pluralsight python python3

Last synced: 15 Nov 2024

https://github.com/afsalashyana/whatsapp-chat-analyzer

Analyze WhatsApp chats with beautiful graphs. Written in JavaFX

data-analysis data-visualization javafx javafx-14 javafx-application whatsapp

Last synced: 31 Dec 2024

https://github.com/super-lou/exstat

🌾 R package to provide an efficient and simple solution to aggregate and analyze the stationarity of time series

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics time-series

Last synced: 23 Dec 2024

https://github.com/yusufcinarci/web-scraping-projects

In these project files, I will host the web scraping examples that I will make day by day.

data-analysis data-science jupyter-notebook python web-scraping

Last synced: 26 Dec 2024

https://github.com/depressioncenter/mden

Mobile technologies code from the University of Michigan's Mobile Data Experts Network (MDEN), featuring data cleaning automations, REDCap project templates, and links to useful external modules. [DOI: 10.6084/m9.figshare.25438714]

automation data-analysis data-cleaning fitness-tracker heart-rate-data mobile-data mobile-development mquery powerautomate powerbi powerquery python r sleep-data smartwatch-data tableau

Last synced: 25 Nov 2024

https://github.com/jonzeolla/lab-securitydataanalysis

An introductory lab to Security Data Analysis (using Apache Metron (incubating)).

apache-metron data-analysis lab metron security

Last synced: 18 Nov 2024

https://github.com/sonigarima/donation-management-system

A donation management system for NGOs and Donors. The project is designed for Cognizance IITR 2021 - Salesforce Codathon.

data-analysis donation-management reactjs

Last synced: 02 Jan 2025

https://github.com/sayakpaul/analysis-of-college-database-of-2017-passouts

Contains my analysis of a database containing information about the students of an engineering college.

data-analysis data-visualization matplotlib python-3

Last synced: 08 Feb 2025

https://github.com/thecoderpinar/earthquake_prediction_analysis_project

🌍 Welcome to the Earthquake Prediction Analysis Project! 🚀 This project aims to predict earthquake magnitudes using LSTM neural networks and analyze seismic data. Explore, analyze, and forecast earthquakes with ease! 📈🔮

analysis data-analysis data-science earthquake-prediction geocoding geology lstm lstm-neural-networks machine-learning matlab matlab-deep-learning open-source time-series visualization

Last synced: 16 Dec 2024

https://github.com/ahammadmejbah/different-types-of-data-splitting-methods

In order to prevent overfitting and guarantee that our model can generalize to new data, data splitting is essential in machine learning.

data data-analysis data-engineering data-mining data-science data-visualization

Last synced: 09 Jan 2025

https://github.com/psyteachr/quant-fun-v2

Fundamentals of Quantitative Analysis

data-analysis data-visualization datawrangling r statistics

Last synced: 29 Jan 2025

https://github.com/nafisalawalidris/911-call-analysis

The 911 Call Analysis project explores and visualises emergency call data to uncover patterns and trends. It includes data preparation, exploratory analysis, visualizing call volume and reasons and generating heatmaps. Users can customize the code for their dataset. The project relies on libraries like Pandas, NumPy, Matplotlib, Seaborn, and SciPy

cluster-analysis data-analysis data-visualization decision-making emergency-calls emergency-services exploratory-data-analysis heatmaps matplotlib numpy pandas patterns-and-trends resource-allocation scipy seaborn

Last synced: 23 Jan 2025

https://github.com/nafisalawalidris/predicting-credit-card-approvals

Explore credit card approval prediction through data analysis and machine learning. Preprocess data, train logistic regression models, and optimize hyperparameters. Learn data preprocessing, feature engineering, model training, and evaluation. Dive into the world of machine learning with Python and popular libraries.

approval-prediction credit-card data-analysis data-preprocessing feature-engineering hyperparameter-optimization libraries logistic-regression machine-learning model-evaluation model-training python python3

Last synced: 23 Jan 2025

https://github.com/sowinskibraeden/schedulegeneratorapp

The Desktop Application for my schedule-generator algorithm, allowing users to easily interact with the algorithm and its variables to generate schedules as documents for students individually as well as the master timetable

algorithm csv data-analysis dataclasses python-docx python-typing python311 xlsxwriter

Last synced: 20 Nov 2024

https://github.com/fatihilhan42/data-science-projects

In this repo, there are (beginner-upper) level projects in the field of data science. I will host these projects that I have done in this field every day in this repo. With the hope that it will be useful to those who are interested in the field of data science like me and will just start...

data-analysis data-engineering data-mining data-science data-structures data-visualization database datascience fatihilhan fortytwo fortytwofficial jupyter-notebook python

Last synced: 29 Jan 2025

https://github.com/maksimekin/umd_data_challange_2020

Ocean Clean up data analysis project for the UMD Data Challenge 2020. Data Exploration for a Sustainable Planet.

cleanup competition data-analysis data-science folium geolocation machine-learning ocean planet pollution sklearn sustainability time-series trash umd

Last synced: 15 Dec 2024

https://github.com/ogoodness/vbreaker-js

CSC 483 Project - Ciphers: Caeser, Multiplicitive, Affine, Vigenere, Hill, Columnar Transposition

affine-cipher caesar-cipher columnar-transposition-cipher cryptography data-analysis decoder decryption encoder encryption hill-cipher parsing vigenere-cipher

Last synced: 14 Nov 2024

https://github.com/astrodynamic/retailanalitycs-in-postgresql

Develop a SQL script to create a database with tables, views, roles, and functions. Form personalized offers to increase average check, frequency of visits, and cross-selling.

bd csv data-analysis data-export data-input data-manipulation data-validation database-management functions git margin offers postgresql retail role-permission-management selling sql transaction tsv views

Last synced: 12 Jan 2025

https://github.com/stimulsoft/stimulsoft.dashboards.php

Dashboards.PHP is a complete software package for designing and viewing dashboards. Includes the JS data analysis engine, dashboard designer and viewer. Support PHP 5, PHP 7, and PHP 8 versions.

charts dashboard-builder dashboards data-analysis data-grid data-visualization datatable dynamic-dashboard interactive-dashboards live-data mysql-data php php-bi-tools php-dashboard php-kpi php7 php8 pivot-tables sql-datasources statistics

Last synced: 30 Jan 2025

https://github.com/waveform80/structa

A small utility for analyzing data structures (e.g. JSON files)

csv data-analysis data-visualization datajournalism datawrangling json yaml

Last synced: 01 Jan 2025

https://github.com/orkunaktas/sofascore-webscraping

⚽️I scraped the shot data of the Fenerbahçe - Adana Demirspor match from Sofascore⚽️

beautifulsoup data-analysis football-analytics football-data selenium webscraping

Last synced: 13 Feb 2025

https://github.com/shervinnd/bazar_app_store_eda

Bazar App Data analysis code to find the most downloaded category and most popular installed apps

data data-analysis data-science dataanalysis eda python

Last synced: 18 Jan 2025

https://github.com/ethan-wickstrom/rrrs

Welcome to RRRS, a rapid, hyper-optimized CSV random sampling tool designed with performance and efficiency at its core. Crafted meticulously in Rust, RRRS offers an unparalleled solution for extracting random data samples from CSV files swiftly and effortlessly.

analytics cli command-line command-line-tool data data-analysis data-science dataset rust rust-lang sample samples

Last synced: 19 Nov 2024

https://github.com/negativenagesh/whatsapp_chat_analyzer

This is an end to end project of whatsapp chat analysis, here I have used my hostel whatsapp group's chat data.

data-analysis data-science frontend modeling python streamlit

Last synced: 23 Dec 2024

https://github.com/pythondeveloper6/store-sales-eda

simple EDA with some insights on Store Sales

data-analysis eda matplotlib numpy pandas seaborn

Last synced: 19 Jan 2025

https://github.com/lit26/trump_tweet_analysis

Analysis of Trump's original tweets.

data-analysis lda-model topic-modeling

Last synced: 30 Jan 2025

https://github.com/mohammadkarbalaee/python-for-data-analysis-book

All the practice and code that I am doing while I read the book called, Python for data analysis

data-analysis data-science python

Last synced: 01 Feb 2025

https://github.com/super-lou/aeag_toolbox

🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics

Last synced: 15 Feb 2025

https://github.com/andreantonacci/eu2019

Social Network Analysis of Twitter Topic-Network Structures during the 2019 European Elections

data-analysis election-analysis european-elections gephi latex master-thesis social-network-analysis

Last synced: 24 Nov 2024

https://github.com/benjamindpb/wikidata-preprocessing

Wikidata dump preprocessing & analysis of georreferencial entities

data-analysis preprocessing wikidata wikidata-dump

Last synced: 19 Jan 2025

https://github.com/nmicht/food-language-variety

A map to display distribution of food names using their synonyms in different regions.

data-analysis language-distribution language-variety map

Last synced: 13 Feb 2025

https://github.com/DataHerb/dataherb-python

Python Package for DataHerb: create, search, and load datasets.

data data-analysis data-mining database dataset python

Last synced: 15 Nov 2024

https://github.com/surajv311/data_analysis-food_recipes_ds

Data preprocessing, cleaning, <Analysis> & plotting 📊 of Food Recipies Dataset (from Kaggle). 🐍 Libraries used: Pandas, Matplotlib, Seaborn, Plotly.📈

data-analysis kaggle-dataset matplotlib numpy pandas plotly seaborn

Last synced: 21 Jan 2025

https://github.com/juangesino/behaviouraleconomics

All the files and data for the experiment performed during the course Behavioural Economics @ University of Amsterdam

behavioral-economics behavioural-economics data-analysis economics game-theory statistics

Last synced: 23 Jan 2025

https://github.com/arnavk-09/world-population

🌏 Taipy demo to explore world population data...

data-analysis python taipy world-population

Last synced: 06 Feb 2025

https://github.com/lafayettegabe/nlp-resume-extraction

📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.

big-data data data-analysis data-science eda ner nlp resume-extractor

Last synced: 08 Feb 2025

https://github.com/darsan-in/rumour-monger-spotter

Rumour Monger Spotter is a prototype developed during a national-level cyber hackathon to identify false information on Twitter. Using the Google Fact Check API and a Multinomial Naive Bayes classifier, the tool analyzes tweet content to assess the likelihood of misinformation. Despite a development window of less than 24 hours, the project won a t

ai data-analysis fact-checking hackathon india naive-bayes national-competition natural-language-processing prototype real-time-analysis social-media text-classification tweet-content twitter

Last synced: 12 Dec 2024

https://github.com/ndleah/self-quantified

😊 EDA, self-quantified data analysis

data-analysis personal-data-analysis r self-quantify

Last synced: 12 Jan 2025

https://github.com/gher-uliege/seadatacloud

Tools and interfaces to work with DIVA interpolation software tool.

data-analysis data-visualization interpolation nco netcdf ocean-sciences oceanography

Last synced: 05 Feb 2025

https://github.com/andreip/twitter-authorities

Find authorities for Twitter topics. [Licenta][Undergraduate thesis]

data-analysis mongodb python twitter-topics

Last synced: 12 Feb 2025

https://github.com/pnnl/archive_walker

Archive Walker Software to read and examine PMU data to detect events and conditions for further analysis.

data-analysis pmu synchrophasor

Last synced: 25 Jan 2025

https://github.com/c0deta1ker/arpescape

ARPEScape is a MATLAB-based app that contains a set of tools and functions for analysing the electronic structure of materials using photoelectron spectroscopy (PES) techniques, such as X-ray photoelectron spectroscopy (XPS) and angle-resolved photoelectron spectroscopy (ARPES).

analysis analysis-package angle-resolved-photoemission angle-resolved-spectroscopy arpes condensed-matter-physics data-analysis lcn matlab photoelectron-spectra photoelectron-spectroscopy photoemission psi sls ucl xps

Last synced: 30 Nov 2024

https://github.com/lussierc/foodborneillnessdataanalysis

A data analysis of foodborne illnesses using R Scripting methods.

data-analysis database foodborne-disease-outbreaks foodborne-illnesses rstudio

Last synced: 01 Feb 2025

https://github.com/splch/qbs

An effective and flexible Quantile-Based Balanced Sampling algorithm for addressing class imbalance in datasets while preserving the underlying data distribution, improving model performance across various machine learning applications.

classification data-analysis imbalanced-classification imbalanced-data machine-learning resampling

Last synced: 07 Feb 2025

https://github.com/agungbudiwirawan/e-commerce_analysis_using_sql

The objective of this project is to provide an analysis of Olist Store (Brazilian E-commerce) sales.

data-analysis data-science e-commerce e-commerce-project microsoft-sql-server sql sql-server

Last synced: 30 Jan 2025

https://github.com/nasdin/drone-strike-visualization

Connecting to Drone Strike API and performing an analysis on frequency of drone strikes, trend and patterns

analysis api basemap data-analysis data-science data-visualization drone drone-strikes eda geopy jupyter-notebook visualization

Last synced: 19 Feb 2025

https://github.com/neutrinoceros/gpgi

A Generic Interface for Grid + Particle data

data-analysis grid particles performance

Last synced: 16 Nov 2024

https://github.com/agungbudiwirawan/socioeconomic_analysis

The objective of this project is to analyze the socio-economic in Chicago.

chicago-crime crime-data data-analysis data-science microsoft-sql-server project sql sql-project sql-server

Last synced: 02 Dec 2024

https://github.com/ascender1729/iris-flower-classification-2024

An exploratory data analysis and machine learning project using the Iris dataset to classify flower species with a K-Nearest Neighbors classifier. It includes data visualization, feature scaling, model training, and evaluation with 100% accuracy on the test set.

classification data-analysis iris-dataset k-nearest-neighbors machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 06 Feb 2025

https://github.com/thecoderpinar/earthquake-explorer

🌍🔍 Explore and analyze earthquake data worldwide using this interactive data science project. Visualize earthquake occurrences, patterns, and geographical distribution. Dive deep into seismic data, perform exploratory data analysis, and gain insights into earthquake trends.

data-analysis data-science data-visualization earthquake-analysis exploratory-data-analysis geospatial-analysis interactive-maps jupyter-notebook machine-learning python seismic-data

Last synced: 16 Dec 2024