Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/kaz-yos/distributed

Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)

data-analysis epidemiology statistics

Last synced: 12 Nov 2024

https://github.com/garciparedes/castile-and-leon-crops

Data Analysis of Castile and Leon Crops Area over the last years

castile-and-leon crops data-analysis data-science jupyter jupyter-notebook notebook spain

Last synced: 15 Nov 2024

https://github.com/hatamiarash7/ir-system

IR System for Reuters DB

data-analysis data-mining ir python

Last synced: 21 Oct 2024

https://github.com/yash22222/data-analysis-with-python

This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.

binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3

Last synced: 09 Nov 2024

https://github.com/junpenglao/spafv

SPAFV - Surface Profile Analysis for Free Viewing eye movement experiment in 2AFC task

data-analysis statistics temporal-logic

Last synced: 25 Oct 2024

https://github.com/junpenglao/jaefa

Just Another Eye-movement Filtering Algorithm

data-analysis eye-movement-data eye-tracking

Last synced: 25 Oct 2024

https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data

Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.

data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping

Last synced: 09 Nov 2024

https://github.com/yash22222/cinesphere-crafting-personalized-movie-experiences

"CineSphere" is a groundbreaking project developing a personalized movie recommendation engine. By analyzing user preferences and viewing history, CineSphere suggests movies tailored to individual tastes, revolutionizing the movie-watching experience.

cinesphere data-analysis imdb machine-learning movie-recommendation-engine movie-recommendation-system movielens real-time

Last synced: 09 Nov 2024

https://github.com/lulloooo/python-googlemilestoneproject

Google Data Analysis Milestone Project about Chicago Bike Sharing Service

data-analysis data-visualization python tableau

Last synced: 12 Nov 2024

https://github.com/jofaval/sonar

Binary Classification of Sonar Signals of Rocks and Metal cylinders in 1987

data-analysis data-science data-visualization machine-learning python scikit-learn sonar uci

Last synced: 21 Oct 2024

https://github.com/campagnucci/exercitando_pandas

Exercícios práticos de pandas com dados abertos da educação de São Paulo

data-analysis data-science education-data exercises pandas-tutorial

Last synced: 16 Nov 2024

https://github.com/haritha1005/data-analysis-portfolio

This repository showcases my data analytics and data science skills through projects, fostering collaboration and community engagement

data-analysis data-visualization etl excel matplotlib numpy-library pandas powerbi-report python3 r scipy sql tableau

Last synced: 15 Oct 2024

https://github.com/swat1563/recommendation-system

This repository features a recommendation system and analytics engine using datasets on users, organizations, contents, contacts, events, and recommendations. It includes data preprocessing, building a recommendation system, and creating visual reports with Power BI.

analytics data-analysis data-visualization engine kaggle numpy pandas powerbi powerbi-dashboards powerbi-desktop powerbi-reports python recommendation-engine recommendation-system recommender-systems scikit-learn scipy

Last synced: 15 Oct 2024

https://github.com/yash22222/literacy-exploration-analysis

Delve into India's literacy landscape through data analysis. Uncover regional disparities, high/low literacy states & gender imbalances.

csv data-analysis data-visualization government-data india literacy literacy-analysis states

Last synced: 09 Nov 2024

https://github.com/gemaquejr/restaurant-orders

Projeto com o objetivo de aplicar os conceitos de POO e trabalhar com Set, Hashmap e Dict. Este projeto foi criado para avaliação final na seção 06 do módulo de ciência da computação do Curso de Desenvolvimento Web na Trybe.

data-analysis dict hashmap poo python set

Last synced: 07 Nov 2024

https://github.com/shridhar1504/foreign-exchange-rate-time-series-datascience-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis

Last synced: 05 Nov 2024

https://github.com/yash22222/data-analysis-on-real-time-social-media-comments

EngageInsight analyzes user interactions in comment data. It provides insights through visualizations created using Python libraries like Pandas and Matplotlib. The project aims to uncover patterns and trends in user engagement. The visualizations provide an overview of comment lengths, the frequency of different types of replies.

data-analysis data-cleaning-and-preprocessing data-visualization matplotlib pandas pattern-recognition real-time-social-media-data seaborn trend-analysis

Last synced: 09 Nov 2024

https://github.com/adagio/ivoox_episodes

iVoox Episodes: Scraping & Analysis

beautifulsoup4 data-analysis ivoox pandas python python3 scraping

Last synced: 07 Nov 2024

https://github.com/jayita11/customer-engagement-insights-for-yelp-restaurant-business-success

This project analyzes Yelp restaurant data using SQLite, Python, and Tableau to explore user engagement, reviews, and ratings. It provides insights into restaurant success across cities, regions, and user behavior.

customer-engagement data-analysis interactive-visualizations json python ratings review sqlite3 tableau-dashboards-for-data-visualization yelp-restaurants

Last synced: 02 Nov 2024

https://github.com/pablo1785/receipt-rs

Receipt processing backend built with Shuttle.rs, Axum and Azure Form Recognizer API

api-rest axum azure backend cognitive-services computer-vision data-analysis rust shuttle-rs sqlx

Last synced: 05 Nov 2024

https://github.com/shridhar1504/loan-classification-datascience-project

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

classification data-analysis data-cleaning data-science data-visualization eda loan-prediction loan-status machine-learning predictive-modeling sql supervised-learning

Last synced: 05 Nov 2024

https://github.com/fer-aguirre/covid19-venezuela

Análisis de datos de muertes por covid-19 en Venezuela

covid-19 data-analysis dataviz line-chart

Last synced: 05 Nov 2024

https://github.com/fer-aguirre/cookiecutter-data-analysis-extensive

A cookiecutter template for data analysis projects using Python.

cookiecutter data-analysis project-template python

Last synced: 05 Nov 2024

https://github.com/cego669/datathonengopevi

Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística).

data-analysis data-science datathon python r streamlit xgboost-classifier

Last synced: 19 Oct 2024

https://github.com/amlanmohanty1/genai-data-analysis-report-generator

Generating data analysis and EDA reports from CSV files using Generative AI - Langchain, Llama, Groq.

ai data-analysis data-science flask generative-ai groq langchain llama3 llm prompt-engineering python

Last synced: 19 Oct 2024

https://github.com/fer-aguirre/cookiecutter-data-analysis-lite

A starter template for data analysis projects that offers a simplified and beginner-friendly structure.

cookiecutter data-analysis project-template python

Last synced: 05 Nov 2024

https://github.com/tameronline/ai-financial-analyst

AI-driven financial analyst system utilizing LangChain and Ollama for real-time stock analysis, market trends, and financial insights.

ai data-analysis finance financial-analysis langchain machine-learning nlp ollama stock-market

Last synced: 19 Oct 2024

https://github.com/mituskillologies/aiml-dypiemr-sep24

Programs conducted at DYPIEMR, Pune in training on AIML during September 2024.

artificial-intelligence data-analysis data-science machine-learning matplotlib neural-network numpy pandas python3

Last synced: 31 Oct 2024

https://github.com/harmanveer-2546/motor-vehicle-accidents-in-india

As per the report, a total of 4,61,312 road accidents have been reported by States and Union Territories (UTs) during the calendar year 2022, which claimed 1,68,491 lives and caused injuries to 4,43,366 persons.

accidents accidents-analysis darkgrid data-analysis eda exploratory-data-analysis indian-roads inline matplotlib motor-vehicles numpy pandas review seaborn visualization

Last synced: 31 Oct 2024

https://github.com/yalai92/alfalfa_imp_exp_analysis

This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.

data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau

Last synced: 31 Oct 2024

https://github.com/muskanmi/data_analysis_python

Data analysis on students result dataset using python libraries.

boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn

Last synced: 05 Nov 2024

https://github.com/jagoda11/elastic-vision

This repository contains a full-stack application designed to explore data from ElasticSearch🧐indices and visualize it using charts and graphs. The backend is built using Node.js and the frontend is powered🚀 by React.

backend chartjs dashboard-development data-analysis data-visualization docker elasticsearch frontend fullstack javascript material-ui monorepo mui-x node pie-chart react restful-api tables

Last synced: 08 Nov 2024

https://github.com/smahala02/calorimtery

A calorimetry lab project involving Python and Excel for computing heat transfer from experimental data.

calorimetry chemistry data-analysis excel jupyter-notebook python thermodynamics

Last synced: 31 Oct 2024

https://github.com/smahala02/magnetism-lab

This repository contains Python scripts and data for analyzing inductance in toroidal coils to calculate the magnetic permeability of ferrite materials. The project helps classify materials as soft or hard magnets based on experimental data.

data-analysis inductance jupyter-notebook magnetism python toroids

Last synced: 31 Oct 2024

https://github.com/bonelesswater/tradingbot

This project is a web application for a trading bot that displays financial data and indicators. It includes functionality for researching financial data, displaying market indicators, and more.

ai azure css d3 data-analysis django html javascript jquery materializecss python stock-market

Last synced: 25 Oct 2024

https://github.com/dcs-training/exploratory-data-analysis-and-visualisation-with-observable-plot

This two-hour workshop will teach you how to follow an exploratory data analysis pipeline with Observable Plot, a new JavaScript library based on the Grammar of Graphics, that proposes a simple yet expressive interface to create powerful graphics easily shareable on the web. Go to the Readme file

d3 data-analysis data-visualisation javascript observable-notebook

Last synced: 25 Oct 2024

https://github.com/jayqi/data-analysis-tools

Presentation on Data Analysis Tools

data-analysis presentation-slides

Last synced: 21 Oct 2024

https://github.com/fortunewalla/birdstrikes

birdstrikes database created for postgresql with simple sample queries

birdstrikes csv data-analysis data-science database dataset pgsql postgresql practice sample sql sql-query workshop

Last synced: 27 Sep 2024

https://github.com/shridhar1504/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides

Last synced: 05 Nov 2024

https://github.com/shridhar1504/rafik-s-kitchen-data-analysis

The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.

business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server

Last synced: 05 Nov 2024

https://github.com/cosmoduende/r-twitter

Explore your Twitter activity with R: Sentiment Analysis and Data Visualization. How to analyze your Twitter account (or any account), discover your habits and sentiments with the "rtweet" package and NLP.

data-analysis data-visualization lemmatization nlp nlp-library nlp-resources nltk nltk-library r-package r-programming r-studio rtweet stemming twitter twitter-api twitter-data twitter-data-analysis twitter-data-extraction twitter-sentiment-analysis udpipe

Last synced: 07 Nov 2024

https://github.com/ibrahimhabibeg/national-university-of-singapore-sms-analysis

Analysis of SMS messages collected by the National University of Singapore

analytics data-analysis data-science nlp python

Last synced: 05 Nov 2024

https://github.com/cosmoduende/r-ggcats

StrangeR things: Adding… Cats? to your plots on R. How to analyze and visualize data with the help of funny cats with the ”ggcat” package.

data-analysis data-analytics data-science data-visualisation data-visualization data-viz dataviz ggcats r-language r-library r-package r-programming r-scripts r-studio rstats rstudio

Last synced: 07 Nov 2024

https://github.com/imranr98/brokeetl

Parse transactions from bank statement PDFs into a JSON array.

automation bank-statement banking data-analysis data-mining data-ownership etl finance json lifestyle pdf pdf-converter tracking

Last synced: 19 Nov 2024

https://github.com/abeltavares/nps_performance_analysis

Analyzing and Monitoring Net Promoter Score (NPS) Performance for Healthcare Companies using SQL and Power BI

customer-satisfaction dashboard data-analysis data-visualization healthcare net-promoter-score nps-analysis performance-monitoring power-bi sql

Last synced: 09 Nov 2024

https://github.com/cosmoduende/r-earthquakes

Análisis y visualización de datos de actividad sísmica en México con R. Cómo analizar y visualizar la historia sísmica de México con datos del SSN (Servicio Sismológico Nacional)

data-analysis data-analytics data-science dataviz earthquakes r-code r-programming r-studio rstudio sismo sismologia sismos ssn ssnmx terremoto terremotos

Last synced: 07 Nov 2024

https://github.com/katiesaund/tidy_tuesday

A weekly data project in R from the R4DS online learning community

data-analysis data-visualization datascience plot r rstats tidytuesday

Last synced: 14 Oct 2024

https://github.com/bishopce16/school_district_analysis

The school board requested an analysis on the various performance metrics for the school district.

data-analysis jupyter-notebook numpy pandas python visual-studio-code

Last synced: 10 Nov 2024

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 10 Nov 2024

https://github.com/rahul-jha98/restauranttrends.stats-backend

Application that scrapes the Zomato Dataset and enables the user to visualise the results.

data-analysis data-extraction firebase-storage web-scraping zomato-api

Last synced: 19 Nov 2024

https://github.com/deeksha-dhawan/pizza-outlet-analysis-using-sql

This project analyzes pizza sales data to gain insights into customer behavior and revenue patterns. Key analyses include customer insights, popular pizza types and sizes, revenue generation, and order trends. The findings help optimize menu offerings, staffing, and marketing strategies to boost overall business performance.

coding-challenge data-analysis data-science microsoft my portfolio-project programming project projects sql sql-analysis sql-project sqlproject sqlserver

Last synced: 14 Oct 2024

https://github.com/muneeb706/exploratory-data-analysis

Exploratory Data Analysis of some problems using python (numpy & pandas)

data-analysis exploratory-data-analysis jupyter-n numpy pandas python3

Last synced: 15 Oct 2024

https://github.com/muneeb706/r-programming

R-Programming examples for data analysis.

data-analysis r-programming

Last synced: 15 Oct 2024

https://github.com/muneeb706/human_activity_recognition

This project performs data cleaning and data exploration steps for Human Activity Recognition Using Smartphones Data Set in R programming language.

data-analysis data-cleaning data-exploration r-programming

Last synced: 15 Oct 2024

https://github.com/marlysson/craw

A system to show the data collected from various sources using chartjs - ⚡️

chartsjs data-analysis data-science web-scraping

Last synced: 14 Oct 2024

https://github.com/thenorthkun/movies-dataset-analysis

Analysis & categorizing of Movies based on Actors, Genres, Gross covered etc 🦸🏼🧜🏼‍♀️🎧

data-analysis data-visualization filtering

Last synced: 14 Oct 2024

https://github.com/dmvianna/python-nix

Trivial Nix environment with pandas and postgresql

data-analysis nix

Last synced: 15 Oct 2024

https://github.com/eubrunoo/beer-consumption-predictor

An R project analyzing the impact of environmental factors on beer consumption in São Paulo, with a predictive linear regression model.

data-analysis data-science data-visualization machine-learning r statistical-analysis statistics

Last synced: 28 Oct 2024

https://github.com/upes-open/open-cryptocurrency-analysis

A web app to visualise and predict the cryptocurrency’s impact by using Web scraping, data exploration, EDA and Data Visualization.

analysis cryptocurrency data-analysis data-science data-visualization jupyter-notebook streamlite visualization

Last synced: 08 Nov 2024

https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends

Last synced: 05 Nov 2024

https://github.com/akshaypratapsingh09/zomato-blogs-all-links-dataset

Engineering / Culture / Blogs Data gathered for Educational and Learning purposes from Zomato's Blogs and spreading the better problem solving Methodologies adapted by Modern Unicorns

data-analysis dataset regex selenium webdriver zomato-data-analysis

Last synced: 01 Nov 2024

https://github.com/prangonghose/wikipedia-blocking-policies

This study investigates the relationship between editors’ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction.

data-analysis data-visualization matplotlib open-source pandas python3 seaborn

Last synced: 20 Oct 2024

https://github.com/Fisseha-Estifanos/telecom

A showcase repository for a specific telecommunication company. Used to analyze several telecommunication data set features and generate useful insights accordingly. Insights generated could be seen at https://github.com/Fisseha-Estifanos/telecom-visualizer or at https://fisseha-estifanos-telecom-visualizer-home-huxgy0.streamlitapp.com/

data-analysis notebooks-jupyter python visual-studio-code visualization

Last synced: 23 Oct 2024

https://github.com/thijswillemmoens/historical_document_analysis

Trying to do some Data Science with OpenAI and LLMs.

data-analysis llama2 ollama-api openai openai-api python

Last synced: 08 Nov 2024

https://github.com/ezmiller/esd-viz

Visualization of European Social Survey (http://www.europeansocialsurvey.org/data/)

clojure data-analysis visualization

Last synced: 15 Nov 2024

https://github.com/jayita11/healthcare-management-optimization-analysis-and-visualization

This project analyzes healthcare data from 2019 to May 2024, optimizing patient care, resource allocation, and financial management. Insights include billing trends, blood bank management, doctor performance, and medication demand, supported by excel,interactive Tableau dashboards and SQL analysis.

data-analysis excel healthcare interactive-dashboards mysql sql tableau-dashboards

Last synced: 14 Oct 2024

https://github.com/ashwin331133/sql-healthcare-data

This repository contains SQL queries designed to analyze health care data. The queries focus on patient demographics, encounter costs, and flu shot statistics, aiming to provide insights into patient behavior and financial impacts. The datasets include information on patient encounters, flu shots, and hospital admissions.

data-analysis mysql sql

Last synced: 14 Oct 2024

https://github.com/touchesir/twitter_physicalactivity

Companion Data / Analysis for "Monitoring Physical Activity Levels using Social Media Data"

data-analysis twitter

Last synced: 15 Oct 2024

https://github.com/jidesamuell/data-analytics-projects

This is a repository i have created to showcase my skills, share projects and track my progress in Data Analytics areas.

data-analysis excel matplotlib powrebi python sql

Last synced: 18 Oct 2024

https://github.com/mkoeppe/jiawei-computations

Computations supporting Chapters 2 and 3 of Jiawei Wang's dissertation "Subadditivity of Piecewise Linear Functions", UC Davis, Ph.D. program in Mathematics, 2020

benchmark-framework branch-and-bound cluster cutting-planes data-analysis hpc integer-programming reproducible-research sagemath

Last synced: 23 Oct 2024

https://github.com/abeltavares/hotel_performance_analysis

A Power BI project that analyzes the performance of a hotel, including revenue, expenses, customer data, hospitality metrics and financial ratios.

business-intelligence data-analysis expenses financial-analysis hospitality-industry power-bi revenue

Last synced: 09 Nov 2024

https://github.com/mdaffailhami/king_county_home_sales_analysis

This repository contains code and analysis for exploring home sales data in King County, featuring geospatial mapping to visualize trends and factors influencing housing prices, including location, size, and various property features, using Python and popular data analysis libraries.

data-analysis data-science folium-maps geospatial python

Last synced: 07 Nov 2024

https://github.com/lit26/novel-corona-virus-2019

Data Analysis for Novel Corona Virus 2019

analysis coronavirus-case data-analysis sir-model

Last synced: 15 Oct 2024

https://github.com/lit26/data_jobs_analyzing

Data analysis for data jobs

data-analysis topic-modeling

Last synced: 15 Oct 2024

https://github.com/mh0386/motorcycle_data_analysis

Data analysis applied to motorcycle dataset.

data-analysis

Last synced: 07 Nov 2024

https://github.com/pedrosfaria2/analisandopostshn

Projeto para analisar as postagens da comunidade HackerNews

analise-de-dados data-analysis datetime jupyter-notebook matplotlib python python3

Last synced: 09 Nov 2024

https://github.com/matteofasulo/cdc-finf

Project of fundamentals of Computer Science

data-analysis data-science data-visualization numpy pandas python python3

Last synced: 19 Nov 2024

https://github.com/erickchacon/day2day

Functions that can be useful in the day-to-day data analysis. It comprehends functions to find paths for projects, make summaries of databases inside folder and so on.

data-analysis exploratory-data-analysis simulation spatial-analysis

Last synced: 17 Nov 2024