Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/thecoderpinar/hms-brainactivity-analysiss

Welcome to the GitHub repo for "HMS - EEG Exploration & Neurocritical Care Journey"! Explore EEG data, understand wave patterns, and delve into conditions like LPDs, GPDs, LRDA, and GRDA.

critical-care data-analysis data-science data-visualization deep-neural-networks eeg eeg-signals exploratory-data-analysis healthcare medical-research neuroscience signal-processing

Last synced: 16 Dec 2024

https://github.com/juliusmarkwei/concrete-data

Data analysis, machine learning, model evaluation and optimization on the Concret_ Dataset

data-analysis data-science data-visualization ensemble-learning machine-learning modeling

Last synced: 01 Jan 2025

https://github.com/jshinm/web-scrapper

Web Scrapper used to extract NeuroData github repo stats

data-analysis web-scraping

Last synced: 17 Dec 2024

https://github.com/freekatz/english-reading

考研英语(10-19)数据集及相关数据分析

data-analysis dataset

Last synced: 02 Feb 2025

https://github.com/vandita2020/merra2_nasa_wind_speed_analysis

In this study, we aim to explore the vulnerability of power grids in the south-east region of the USA with the help of data analysis tools and machine learning algorithms

data-analysis data-science machine-learning-algorithms python

Last synced: 11 Jan 2025

https://github.com/gustavohnsv/teamwork_mqa

Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.

data-analysis group-project r team-repo

Last synced: 16 Dec 2024

https://github.com/tathithienthanh/datamining-banking-dataset

Implement some learned data mining techniques and predict if the client will subscribe to a term deposit

apriori association-rules classification clustering data-analysis data-mining data-processing google-colab ipynb kmeans naive-bayes py python scikit-learn svm visualization

Last synced: 25 Jan 2025

https://github.com/mindful-ai-assistants/credit-card-prediction

💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.

artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning predictive-modeling python3 scikit-learn

Last synced: 09 Dec 2024

https://github.com/poga/dat-ipynb-demo

use ipython notebook to analyze data in dat archive

dat data-analysis distributed jupyter-notebook

Last synced: 15 Dec 2024

https://github.com/nhsdigital/sde_example_analysis

Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.

data-analysis data-science databricks-notebooks machine-learning mlflow

Last synced: 23 Dec 2024

https://github.com/atxtechbro/flightradar24

Advanced Python application leveraging the power of APIs and the pandas library to retrieve and perform in-depth analysis of flight data from Flightradar24. It uncovers insights such as the most common departure and arrival cities, contributing to the field of aviation data science.

api-integration aviation-data data-analysis data-science data-visualization flightradar24-api pandas-library python requests-library web-scraping

Last synced: 25 Jan 2025

https://github.com/seyedhosseinzadeh/ws_tm

Weather web scraping and Time series model to predict temperature, humidity and barometer

data-analysis deep-learning lstm-model machine-learning prediction prediction-model weather web-scraping

Last synced: 10 Jan 2025

https://github.com/kenvilar/data-analysis-using-python

Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3

bs4 data-analysis jupyter pandas python python3 requests xlrd

Last synced: 18 Nov 2024

https://github.com/louislefevre/sstubs-miner

Data mining and analysis for the ManySStuBs4J dataset.

data-analysis data-mining manysstubs4j-dataset msr

Last synced: 05 Feb 2025

https://github.com/casualcomputer/sql.mechanic

Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).

data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server

Last synced: 04 Dec 2024

https://github.com/hvignolo87/ortex-programming-challenge

Coding challenges required for the Python Developer and Data Engineer job positions.

challenge data-analysis finance pandas python scripting sql sqlalchemy

Last synced: 02 Jan 2025

https://github.com/yusufcinarci/covid-19-data-analysis-visualization

The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.

covid-19-data-visualization data-analysis data-science data-visualization

Last synced: 26 Dec 2024

https://github.com/goggle/dataisbeautiful

Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.

data-analysis data-visualisation jupyter-notebook notebook reddit

Last synced: 26 Dec 2024

https://github.com/viper373/baidutieba

爬取百度贴吧(指定吧名、起始页数/重点页数、日志输出)

baidutieba-crawler bert data-analysis deep-learning python spider

Last synced: 05 Feb 2025

https://github.com/lafayettegabe/nlp-resume-extraction

📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.

big-data data data-analysis data-science eda ner nlp resume-extractor

Last synced: 16 Dec 2024

https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm

📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.

big-data data data-analysis data-science data-visualization eda gotomarket

Last synced: 16 Dec 2024

https://github.com/mynenik/xyplot-win32

XYPLOT Plotting and Data Analysis Program for 32-bit Windows

cpp data-analysis data-manipulation data-visualization forth mfc windows-app

Last synced: 24 Jan 2025

https://github.com/vitia-fritelle/ipynb_converter

Jupyter notebook to Python file conversor

data-analysis data-science jupyter-notebook python

Last synced: 16 Dec 2024

https://github.com/savinrazvan/degrees

A program that computes the "degrees of separation" between two actors by identifying the sequence of movies connecting them, inspired by the Six Degrees of Kevin Bacon game. Uses IMDb-based datasets for actors, movies, and their relationships.

actor-connections ai data-analysis degrees-of-separation educational-project graph-theory imdb movie-database python six-degrees-of-kevin-bacon

Last synced: 10 Jan 2025

https://github.com/nafisalawalidris/northwind-traders-sales-analysis

Northwind Traders Sales Analysis project, which analyses sales data for a fictitious company. It utilises the Northwind Database and includes SQL queries to provide insights on employees, products, suppliers and revenue. The project aims to help the company gain valuable information for business decision-making.

business-insights data-analysis database northwind-traders sales sql

Last synced: 23 Jan 2025

https://github.com/nafisalawalidris/analyzing-nobel-prize-dataset-demographics-and-trends

This project analyses a Nobel Prize dataset using Python and data analysis libraries. It explores the distribution of winners by category and country, examines the proportion of female winners over time, investigates the age of winners when they received the prize and identifies the oldest and youngest recipients.

age-at-award country-distribution data-analysis data-manipulation dataset demographics filtering gender-balance grouping nobel-prize notable-laureates python trends visualisation winners

Last synced: 23 Jan 2025

https://github.com/chen0040/pyspark-advanced-algorithms

Samples of Advanced Algorithms and Data Analysis implemented in pyspark

advanced-algorithms data-analysis map-reduce pyspark

Last synced: 16 Dec 2024

https://github.com/luminati-io/airbnb-dataset-samples

A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.

airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping

Last synced: 23 Jan 2025

https://github.com/fer-aguirre/taller-cookiecutter

Taller sobre cómo usar Cookiecutter para análisis de datos.

cookiecutter data-analysis project-template workshop

Last synced: 23 Dec 2024

https://github.com/shridhar1504/foreign-exchange-rate-time-series-datascience-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis

Last synced: 23 Dec 2024

https://github.com/bretsw/subreddits-over-time

Study of the r/Teachers and r/education subreddits over time

data-analysis dataset reddit

Last synced: 06 Feb 2025

https://github.com/akash1070/project---applied-statistics-

To dive deep into this data & find some valuable insights.

data-analysis data-science python statistics

Last synced: 29 Jan 2025

https://github.com/shridhar1504/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning

Last synced: 23 Dec 2024

https://github.com/shridhar1504/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides

Last synced: 23 Dec 2024

https://github.com/akash1070/data-science-advanced-analytics-virtual-experience-program

The BCG Open-Access Data Science & Advanced Analytics Virtual Experience Program

data-analysis data-science machine-learning-algorithms

Last synced: 29 Jan 2025

https://github.com/gracysapra/r-in-data-science

This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.

data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science

Last synced: 15 Dec 2024

https://github.com/thecoderpinar/samsung_stock_analysis_forecasting_and_volatility_analysis

A comprehensive analysis and forecasting project for Samsung stock data, utilizing historical data to build predictive models and analyze volatility.

data-analysis deep-learning financial-analysis forecasting machine-learning python stock-analysis volatility-forecasting

Last synced: 16 Dec 2024

https://github.com/sijuswamy/data-analytics-using-r

Course Repository for Data Analysis using R- Add-on course

data-analysis

Last synced: 31 Jan 2025

https://github.com/dogoncouch/dhcptranslate

Parses ISC DHCP server config, performs DNS resolution as needed, and outputs lease data in CSV format.

configuration csv-format data-analysis isc-dhcp isc-dhcp-server migration-tool

Last synced: 25 Jan 2025

https://github.com/0xjeremy/me-18-final

Data collection and Analysis tools for IMUs

data-analysis imu raspberry-pi

Last synced: 31 Jan 2025

https://github.com/pawlo77/kaggle-project

Repository for 'kaggle' project of Data Science Scientific Circle at Faculty of Mathematics and Information Science, Warsaw University of Technology

data-analysis data-science eda maschine-learning

Last synced: 25 Jan 2025

https://github.com/vitia-fritelle/analise_dieese

Análise realizada com base nos dados extraídos do site https://www.dieese.org.br/analisecestabasica/salarioMinimo.html

data-analysis economic-data

Last synced: 23 Dec 2024

https://github.com/akash1070/data-science-virtual-internship-by-accenture

data merging and data cleaning in python as well as data visulaisation with dashboard in Tableau.

data-analysis data-cleaning data-science python3 tableau visualization

Last synced: 29 Jan 2025

https://github.com/shriram-vibhute/data-analysis

This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.

data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn

Last synced: 15 Jan 2025

https://github.com/gher-uliege/stareso-data-processing

A set of tools to read, plot and process data from STARESO

coastal corsica data-analysis data-processing ocean-sciences oceanography

Last synced: 05 Feb 2025

https://github.com/gher-uliege/bluecloud-plankton

Spatial interpolation of plankton data using a neural network

data data-analysis data-visualization neural-network oceanography

Last synced: 05 Feb 2025

https://github.com/agustinmusanti/delitosencaba-proyectofinal-dataanalytics-coderhouse

En este repositorio muestro mi proyecto final en el curso "Data Analytics" de Coderhouse.

data-analysis excel powerbi

Last synced: 23 Dec 2024

https://github.com/cworld1/novel-analysis

A simple project for analyzing Chinese novels

data-analysis novel

Last synced: 23 Jan 2025

https://github.com/rani-sikdar/python-data-structures

A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀

data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3

Last synced: 05 Feb 2025

https://github.com/hariyebk/eplinsights

English Premier League 2018/2019 Data Analysis

class-composition data-analysis filesystem-library

Last synced: 25 Jan 2025

https://github.com/mxagar/airbnb_data_analysis

An analysis of the AirBnB dataset from Euskadi / the Basque Country.

airbnb data-analysis data-science eda feature-engineering modeling pandas regression

Last synced: 23 Dec 2024

https://github.com/kalebers/economic_analysis_data_science

Data Analysis Python project using economic data base to predict percentage of good and bad payers

data-analysis data-science machine-learning pandas python scipy sklearn-library

Last synced: 21 Jan 2025

https://github.com/agungbudiwirawan/supermarket_sales_dashboard-sales-analysis-using-pivot-table-and-chart

The objective of this project is to analyze supermarket sales using pivot table and chart in Microsoft Excel. After doing the analysis, I created an interactive dashboard that aims to make it easier for the audience to explore data.

dashboard data-analysis data-science data-visualization excel sales-dashboard spreadsheet

Last synced: 06 Feb 2025

https://github.com/shriram-vibhute/digit_classification

This project demonstrates various machine learning techniques for classifying handwritten digits from the MNIST dataset. It covers data preprocessing, model training, evaluation, and advanced classification strategies.

classification data-analysis data-visualization machine-learning matplotlib numpy pandas sk-learn

Last synced: 15 Jan 2025

https://github.com/bcko/ud-da-eda-whitewinequality

Udacity Data Analyst Nanodegree Project : Exploratory Data Analysis : White Wine Quality dataset

data-analysis exploratory-data-analysis rmarkdown rstudio udacity udacity-data-analyst-nanodegree

Last synced: 25 Jan 2025

https://github.com/bcko/ud-da-stroopeffect

Udacity Data Analyst Nanodegree Project : Test a Perceptual Phenomenon (Stroop Effect)

data-analysis data-analyst-nanodegree stroop-effect udacity udacity-data-analyst-nanodegree

Last synced: 25 Jan 2025

https://github.com/sumitkumargiri/machine-learning-project

This repository contain all the best practices for managing Github repository.

data-analysis github machine-learning opensource project python

Last synced: 19 Jan 2025

https://github.com/neerajcodes888/whatsapp-chat-analyzer

A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱

analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud

Last synced: 31 Jan 2025

https://github.com/narenkhatwani/arkouda-projects

This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)

arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python

Last synced: 06 Feb 2025

https://github.com/jasontanx/capstone-project-machine-learning

A final semester project from my MSc Data Science course

data-analysis datascience machinelearningprojects tourism-data

Last synced: 01 Feb 2025

https://github.com/rupav/fifa17-detailed-analysis

⚽ FIFA 17 data analysis using various Machine Learning Algorithms. ⚽

data-analysis data-visualization fifa17 machine-learning-algorithms radar-chart

Last synced: 17 Dec 2024

https://github.com/frankelavsky/political-polarization-challenge

I had 8 hours to build a solution to the research claim that "politics have become more divided in the past 50 years." You can navigate views of congressional voting patterns using arrows. I used d3, require, MVC pattern, and vanilla js. Pre-processed the data in node.js. Data is from DW-NOMINATE: ftp://k7moa.com/junkord/HANDSL01114A20_STAND_ALONE_30.DAT

client-side css d3 d3js data-analysis data-visualization frontend frontend-app html interactive interactive-visualizations javascript modular nodejs political-science politics requirejs research single-page-app visualization

Last synced: 19 Jan 2025

https://github.com/ernestaroozoo/memestocks.net

MemeStocks.net is a Python web app that tracks the historical popularity of specific stocks by monitoring Reddit mentions. Users can search for a stock symbol and view information such as the stock's name, price, and historical popularity data. The data is gathered using the Pushshift API and stored in a PostgreSQL database.

dashboard data-analysis financial-data meme-stock python reddit-scraper scraping sql stock streamlit

Last synced: 05 Feb 2025

https://github.com/ankan24/machine-learning-data-analysis

This repository contains a collection of Jupyter Notebooks that demonstrate various machine learning and data analysis techniques. The project does not provide a detailed description or specific use cases, but the notebooks cover a range of topics related to machine learning and data analysis.

data-analysis jupiter-notebook machine-learning

Last synced: 01 Feb 2025

https://github.com/mengyaohuang/data-manipulation-and-analysis

Data processing implementation with tools in Python

data-analysis nlp-machine-learning pandas-dataframe python

Last synced: 01 Feb 2025