Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/dionixius7/titanic-disaster-ml-model

This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances

data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction

Last synced: 17 Nov 2024

https://github.com/l2nce/datamining-study

Introduction to data mining

data-analysis data-mining matplotlib numpy panda

Last synced: 11 Nov 2024

https://github.com/madhursinghbhadoriya/data_analysis_fifa-players

• Using NumPy, Matplotlib, Pandas, etc processed important Information and Characteristic traits on Jupyter Notebook.

analysis data-analysis data-science graphs jupyter-notebook pandas python

Last synced: 05 Nov 2024

https://github.com/madhursinghbhadoriya/data_analysis_sales_insights_using_tableau

• Performed Data Cleaning using MySQL. • Data analysis and ETL in Tableau. • Created an Interactive Dashboard with significant information about the Sales Insights, Profit and Revenue Analysis.

data-analysis data-visualization dataanalysis etl mysql tableau-dashboards tableau-desktop

Last synced: 05 Nov 2024

https://github.com/changyeop-yang/study-datasciencefoundation

Big Data Science and its Analytics plays a major role in this decade. How to clean and prepare your data for analysis is still a challenge, like How to perform basic visualization of your data, How to model your data, How to curve-fit your data, And finally, how to present your findings and wow the audience

data-analysis ios kyungpook-national-university swift

Last synced: 06 Nov 2024

https://github.com/tsffarias/my-books

Exploratory analysis of my Dataset 'All_the_Books_I_read' which contains all the books I've read

books data-analysis python tableau

Last synced: 05 Nov 2024

https://github.com/virajbhutada/diamond-price-estimator

This project develops a predictive model to estimate diamond prices based on characteristics like carat, cut, color, and clarity. It covers data preprocessing, feature engineering, model selection, training, and evaluation. The final product is a web app where users can input diamond attributes to get accurate and instant price predictions.

cross-validation css data-analysis data-science-projects data-visualization eda feature-engineering html hyperparameter-tuning jupyter-notebooks machine-learning ml-algorithms model-deployment model-selection performance-optimization predictive-modeling python python-app user-interface

Last synced: 11 Nov 2024

https://github.com/virajbhutada/hr-analytics-excel-sql-tableau-powerbi

Explore a comprehensive HR Analytics portfolio showcasing data analysis and visualization skills. Featuring dashboards in Power BI, Excel, and Tableau, along with SQL queries for deeper insights. A holistic view of expertise in HR analytics, data visualization, and database management. Let's dive into the game of data insights!

data-analysis data-management data-visualization excel hr-analytics interactive-dashboards portfolio-project postgresql powerbi powerbi-visuals sql sql-queries tableau tableau-public

Last synced: 11 Nov 2024

https://github.com/anjalikumari021/sports_data_analysis_using_excel

Analyzed Sports data and prepared advanced dashboard using MS Excel.

data-analysis data-cleaning excel-dashboard ms-excel pivot-tables reporting

Last synced: 14 Nov 2024

https://github.com/aniketmondal/dataanalysis

Contains cleaning, transformation, and exploratory analysis of various data sets using Python Pandas, NumPy, re, random, etc.

analysis data-analysis data-science pandas python

Last synced: 07 Nov 2024

https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters

Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.

data-analysis data-science data-visualization machine-learning pandas

Last synced: 05 Nov 2024

https://github.com/camara94/data_analyse_series_temporelles

Dans ce tutoriel, nous allons répondre aux questions suivantes: 1. Lire les données Microsoft à l'aide du package **Pandas Data reader** 2. Obtenez le **prix maximum** de l'action de **2017 à 2022** 3. Quelle est la **date du cours le plus élevé** de l'action ? 4. Quelle est la **date du cours le plus bas** de l'action ?

data-analysis data-analysis-python data-science data-structures-and-algorithms data-visualization serie series-forecasting

Last synced: 05 Nov 2024

https://github.com/cassiofb-dev/fide-rating-analysis

The plot speaks for itself

chess data-analysis fide hans rating

Last synced: 07 Nov 2024

https://github.com/greenpau/esqrunner

Run Elasticsearh queries and create metrics based on the result of the queries in Elasticsearch database.

data-analysis elasticsearch query-builder querydsl

Last synced: 13 Oct 2024

https://github.com/steviecurran/gbt-scripts

IDL scripts for the reduction of Green Bank Telescope data

data-analysis data-compression data-visualization radio-astronomy spectroscopy

Last synced: 14 Nov 2024

https://github.com/airscholar/data_analysis_with_ai

A repository showing how to use AI and ChatGPT for Data Analysis with Pandas and Python

chatgpt data-analysis gpt4 openai pandas pandasai python

Last synced: 14 Nov 2024

https://github.com/steviecurran/prediction-plot

C (pgplot) code to plot predicted versus measured values (generated from Tensorflow)

astrophysics data-analysis high-redshift machine-learning statistics tensorflow visualization

Last synced: 14 Nov 2024

https://github.com/tomijuarez/lemmatisation

Lemmatisation fully implemented in Java.

algorithms data-analysis data-science java-8 lemmatization oop

Last synced: 05 Nov 2024

https://github.com/kishlayjeet/zomato-data-exploration

In this project, we will be exploring a dataset containing information on various restaurants and their ratings, location, and other attributes.

data-analysis eda zomato-data-exploration

Last synced: 06 Nov 2024

https://github.com/manikantasanjay/time_series_data_analysis_on_stocks

Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.

data-analysis pandas stock time-series time-series-analysis

Last synced: 05 Nov 2024

https://github.com/hayatiyrtgl/cryptocurrency_time_series_rnn

Python script for training a Simple RNN model on cryptocurrency price data to predict future prices, including data exploration and evaluation

data-analysis data-science data-visualization keras pandas pandas-python prediction predictive-modeling python python-script rnn rnn-tensorflow tensorflow time-series time-series-analysis

Last synced: 05 Nov 2024

https://github.com/malucor/analise_exploratoria_dados

Programa em Python para fazer uma Análise Exploratória de Dados de Logística.

analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-analysis ebac exploratory-data-analysis python

Last synced: 09 Nov 2024

https://github.com/malucor/livros

Programa em Python para fazer uma análise de dados sobre livros, a partir de um arquivo Excel.

analise-de-dados book books bookshelf data-analysis livro livros python

Last synced: 09 Nov 2024

https://github.com/hayatiyrtgl/data_analysis_project

Financial data analysis: preprocess, visualize, calculate technical indicators.

data-analysis data-analysis-python data-science dataframe numpy pandas python python3 stock-price-prediction talib trade-analysis

Last synced: 05 Nov 2024

https://github.com/bhushan148/finance-domain-bank-loan-report-tableau

I analyzed 🏦 bank loan data to reveal trends, KPIs, and insights. Using Tableau 📈 for dashboards and SQL 🗃️ for data extraction, I visualized loan applications, borrower profiles, and repayment behaviors 💡.

bussiness-intelligence dashboard-design data-analysis data-visualization excel figma sql sqlqueries tableau

Last synced: 05 Nov 2024

https://github.com/ymorsi7/caliwageanalysis

California employment and wage analysis on data from the past decade.

data-analysis data-science ipynb jupyter-notebook

Last synced: 05 Nov 2024

https://github.com/aleskandro/r-hadoop-madreduce-examples

A lot of examples about using R with hadoop for MapReduce with and without libraries as rhadoop/rhipe - [email protected] - Advanced Programming Languages

data-analysis hadoop mapreduce r

Last synced: 07 Nov 2024

https://github.com/noturlee/iris-dataanalyis

This project aims to classify Iris flowers into three species—setosa, versicolor, and virginica—based on their sepal and petal measurements using machine learning techniques. The dataset comprises 150 samples evenly distributed among these species

data-analysis data-modeling data-science data-structures-and-algorithms data-visualization

Last synced: 04 Nov 2024

https://github.com/mosalem149/pythonutilities

A collection of Python scripts for common utility tasks including file manipulation, word counting, longest word detection, and grade categorization. Perfect for quick and easy solutions to everyday programming problems.

data-analysis educational-tools file-io file-manipulation grade-calculation python text-analysis text-processing utility word-counting

Last synced: 10 Nov 2024

https://github.com/tralahm/parliament-2017-dataset

Concise, Clean data sets of the 2017 Kenyan General Election results for the Members of the Senate and National Assembly Composition

csv-parsing data-analysis data-visualization datasets election-data ipynb-jupyter-notebook kaggle-dataset kenya-constituencies kenya-counties matplotlib python3 tralahtek

Last synced: 05 Nov 2024

https://github.com/mehrab-kalantari/olympics-data-analysis

A streamlit application to analyze the Olympics dataset from several views

data-analysis streamlit-dashboard streamlit-webapp

Last synced: 15 Nov 2024

https://github.com/jasoncobra3/whatsapp_chat_analyzer

WhatsApp Chat Analyzer is a powerful tool that provides insightful analytics from your WhatsApp conversations. Whether you're curious about your chatting habits, want to analyze group dynamics, or need to extract meaningful data from your conversations, this tool has got you covered!

data-analysis data-science data-visualization machine-learning streamlit streamlit-webapp whatsapp-chat whatsapp-chat-analyzer

Last synced: 31 Oct 2024

https://github.com/riborings/python_projects

Python projects and other programming experiences

data-analysis machine-learning project python regression-analysis

Last synced: 30 Oct 2024

https://github.com/riborings/uranouchi42microdiversity

In this repository live the bash, R and Julia scripts used to explore the microdiversity of the prokaryotic community at Uranouchi Inlet (42-sample time-series) by means of metagenomic shotgun sequencing under the supervision of the Ogata Lab.

big-data data-analysis data-visualisation diversity-analysis marine-ecology marine-ecosystem metagenomics microbiome-analysis prokaryotic-genomes

Last synced: 30 Oct 2024

https://github.com/maxwelllzh/linearizer

Linearizing parameters for linear regression

data-analysis machine-learning scikit-learn

Last synced: 04 Nov 2024

https://github.com/mayankyadav23/air-bnb-data-analysis

Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.

advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt

Last synced: 11 Nov 2024

https://github.com/paul0vinicius/ad2

Repositório da disciplina de Análise de Dados 2 (Data Analysis II)

data-analysis data-science

Last synced: 23 Oct 2024

https://github.com/jm199504/data-analysis-practice

数据分析练习(Titanic / BankCustomers)

data-analysis python

Last synced: 30 Oct 2024

https://github.com/doughtnerd/pod-old

Read and write Excel data

data data-analysis excel poi-library workbook

Last synced: 04 Nov 2024

https://github.com/lucycatherine/healthinsuranceproject

This repository contains a machine learning project that analyzes the factors influencing health insurance charges, such as age, smoking status, and medical conditions.

data-analysis data-science data-visualization jupyter-notebook machine-learning python

Last synced: 31 Oct 2024

https://github.com/balajimohan18/sql-projects

The repository contains Structured Query Language (SQL) Scripts. The Multiple SQL scripts for various projects which includes data cleaning, data pre-processing, data processing, data transformation and insights gaining through Query Language

data-analysis data-mining data-science eta microsoft-sql-server query-language sql sql-server sql-server-management-studio sqlqueries

Last synced: 12 Oct 2024

https://github.com/tsbarr/toronto-open-data

Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses

api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics

Last synced: 31 Oct 2024

https://github.com/namratha2301/python-dashboard-streamlit

Experimenting with Streamlit. Streamlit app provides an interactive visualization of the best-selling books, showcasing trends, top-selling books, top authors, genre distributions, and sales by decade.

css dashboard data-analysis pandas plotly python seaborn streamlit

Last synced: 04 Nov 2024

https://github.com/its-kanii/predictive-maintenance-for-healthcare-equipment

Predictive Maintenance for Healthcare Equipment utilizes machine learning to analyze operational metrics and predict equipment failures. This project leverages a dataset of usage hours, temperature, and maintenance history to enhance equipment reliability and reduce downtime.

data-analysis data-science failure-prediction feature-engineering healthcare-equipment jupyter-notebook machine-learning predictive-maintenance python time-series-analysis

Last synced: 31 Oct 2024

https://github.com/karlyndiary/smartphone-price-analytics

A data pipeline for analyzing smartphone pricing by retrieving data from Flipkart using RapidAPI, transforming it, and visualizing insights using SQL Server and Excel.

beautifulsoup data-analysis data-pipeline data-visualization data-visualization-dashboard etl microsoft microsoft-excel microsoft-sql-server python smartphone-price-analysis

Last synced: 12 Oct 2024

https://github.com/kislerdm/github-repo-details

Application and library to fetch open source libraries details from github to perform due diligence

data-analysis golang opensource

Last synced: 11 Nov 2024

https://github.com/ahmedtaher10/covid-19-cases

The data we are using contains the data on covid-19 cases and their impact on GDP from December 31, 2019, to October 10, 2020.

data-analysis python visualization

Last synced: 04 Nov 2024

https://github.com/matteospanio/speed-analysis

A project to analyze the internet speed

bash-script data-analysis

Last synced: 09 Nov 2024

https://github.com/devlucho/modelos-predictivos

Modelos predictivos utilizando los algoritmos de Regresión Lineal, Regresión Logística y Árboles de Decisión.

data-analysis jupyter-notebook python3

Last synced: 31 Oct 2024

https://github.com/casassg/ms_thesis

Social Media Analysis for Crisis Informatics in the Cloud

casassg-thesis data-analysis google-cloud kubernetes

Last synced: 29 Oct 2024

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 Nov 2024

https://github.com/dissorial/prx21_erikz

Analysis of self-tracked data: interactive visualizations & predictive algorithms

analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization

Last synced: 30 Oct 2024

https://github.com/teamtigers/echartify

A web application built with .net core 2.2 that has come with the idea of reading the National Election's Data-set of Bangladesh in a fastest possible time and then representing the data-set with different statistical charts.

bangladesh chartjs code-first-migration cross-platform data-analysis data-structures data-visualization dotnet-core election-analysis election-data entity-framework-core materializecss mvc npoi razor-pages

Last synced: 16 Nov 2024

https://github.com/avratanubiswas/fluorpenplugin

A matlab user interface for analysing OJIP curve datasets from FluorPen instrument. That is, serving as an additional plug in for "quick categorical analysis".

data-analysis fluorpen ojip-curve

Last synced: 03 Nov 2024

https://github.com/rainbowatcher/simple

Make data work easier, saving your working time

bigdata data-analysis etl

Last synced: 05 Nov 2024

https://github.com/colindean/allegheny_voter_reg_analysis

Allegheny County Voter Registration Analysis Tools

data-analysis data-science elections pandas polars python voting

Last synced: 29 Oct 2024

https://github.com/myles/notebooks

Some of my random Jupyter Notebooks.

data-analysis data-science jupyter-notebooks

Last synced: 29 Oct 2024

https://github.com/georgiifirsov/educational-research-work

Educational research project on 3rd year (6th semester). Topic: ARMA models in time series analysis

arma data-analysis jupyter-notebook python time-series time-series-analysis tsa

Last synced: 06 Nov 2024

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 03 Nov 2024

https://github.com/idaraabasiudoh/credit_card_fraud_detection

This repository contains a machine learning project focused on detecting credit card fraud using Decision Tree and Support Vector Machine (SVM) classifiers.

data-analysis jupyter-notebook machine-learning python3 scikit-learn snapml

Last synced: 03 Nov 2024

https://github.com/engineertolulope/us_states_living_ranking_analysis

Python script for analyzing and ranking U.S. states based on factors like cost of living, tax burden, diversity, crime rates, and climate. Uses weighted criteria to identify the best states to live in according to these metrics. Ideal for decision-making on relocation.

data-analysis data-science linear-regression machine-learning python scikit-learn

Last synced: 03 Nov 2024

https://github.com/jweinst1/xenon

A processing based language

data-analysis interpreter reactive-programming

Last synced: 08 Nov 2024

https://github.com/mikeesto/ausvotes19

:bird: A collection of 67,284 public tweets published on the night of the 2019 Australian election

australia data-analysis data-visualization elections open-data twitter

Last synced: 31 Oct 2024

https://github.com/nakshjainsonigara/vba-canteenmanagementsystem

The Canteen Management System is a comprehensive software solution designed to modernize and optimize canteen operations. It aims to simplify the complexities of managing a canteen by automating key processes such as order management, payment processing, and report generation.

canteen canteen-mangement-system charts data-analysis email excel microsoft payment-gateway vba vba-excel vba-macros word

Last synced: 05 Nov 2024

https://github.com/achique-luisdan/tops-songs-db

Base de datos de Tops Semanales de Canciones🎵 más reproducidas en Spotify🎶. Prácticas de SQL enfocadas en el Análisis de Datos (Data Analysis).

data-analysis plpgsql sql

Last synced: 11 Nov 2024

https://github.com/mchenryspagg/investigate_a_dataset

This is a data analysis project that demonstrates the student's ability to use python data analysis libraries such as pandas, numpy and pyplot in matplotlib to investigate a dataset and answer specific questions from the dataset, thus demonstrating skills in data cleaning, data wrangling, and exploratory data analysis.

data-analysis datetime descriptive-analysis descriptive-statistics exploratory-data-analysis numpy pandas pyplot python visualization

Last synced: 08 Nov 2024

https://github.com/karlyndiary/coffee-shop-sales-analysis

Analyzing coffee shop sales using Pandas for data cleaning and exploratory data analysis (EDA), and Streamlit for data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 02 Nov 2024

https://github.com/eve-ning/ppshift

Analyzes maps and scores from 2015

data-analysis data-mining osu osugame

Last synced: 27 Oct 2024

https://github.com/remram44/apex-legends-ocr-data

Get data from Apex Legends streams using OCR

apex-legends data-analysis video-games

Last synced: 25 Oct 2024

https://github.com/shoebjoarder/superstore

A Dash app to analyze Superstore dataset.

dashboard data-analysis data-visualization python-3

Last synced: 27 Oct 2024