Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/davidzajac1/reptoro

A Data Visualization and Analytics Platform for the Reptile Industry

analytics data-analysis data-visualization plotly-dash python

Last synced: 11 Feb 2025

https://github.com/jimbrig/lossrunAnalyzer

R Package and Shiny App to Analyze Insurance Lossruns

actuarial data-analysis data-mining data-science insurance r record-linkage risk-management shiny

Last synced: 04 Dec 2024

https://github.com/ptyadana/dv-data-visualization-with-python

Data analysis and Data Visualization of Countries's GDP, Life Expectancy comparison across continents, GDP per Capita Relative Growth, Population Reative Growth comparison etc using Pandas, Matplotlib.

csdojo data-analysis data-visualization datavisualization matplotlib matplotlib-pyplot numpy pandas pluralsight python python3

Last synced: 15 Nov 2024

https://github.com/orkunaktas/sofascore-webscraping

⚽️I scraped the shot data of the Fenerbahçe - Adana Demirspor match from Sofascore⚽️

beautifulsoup data-analysis football-analytics football-data selenium webscraping

Last synced: 11 Oct 2024

https://github.com/sayakpaul/analysis-of-college-database-of-2017-passouts

Contains my analysis of a database containing information about the students of an engineering college.

data-analysis data-visualization matplotlib python-3

Last synced: 08 Feb 2025

https://github.com/ethan-wickstrom/rrrs

Welcome to RRRS, a rapid, hyper-optimized CSV random sampling tool designed with performance and efficiency at its core. Crafted meticulously in Rust, RRRS offers an unparalleled solution for extracting random data samples from CSV files swiftly and effortlessly.

analytics cli command-line command-line-tool data data-analysis data-science dataset rust rust-lang sample samples

Last synced: 19 Nov 2024

https://github.com/irfanchahyadi/ml-notes

Complete personal notes for performing Data Analysis, Preprocessing, and Training ML model.

data-analysis machine-learning plotting python

Last synced: 07 Nov 2024

https://github.com/sondosaabed/introduction-to-data-analysis-with-pandas-and-numpy

Learning the data analysis process of questioning, wrangling, exploring, analyzing, and communicating data. Working with data in Python using libraries like NumPy and pandas.

data-analysis data-analyst-nanodegree data-wrangling numpy pandas python

Last synced: 06 Nov 2024

https://github.com/globeandmail/startr-cli

A command-line scaffolder for the startr R project template

data-analysis data-journalism data-visualization journalism r

Last synced: 26 Dec 2024

https://github.com/rikard-helgegren/leverage_analysis_tool

Analyst tool for portfolio construction. How can levereged certificates be used to increase returns in a portfolio while keeping the risk as low as possible. Use the tool and find out.

cpp data-analysis investment kivy-framework python3

Last synced: 14 Oct 2024

https://github.com/jimmymugendi/email-sms-spam_classifier

Email SMS Spam Classifier is a cutting-edge machine learning solution designed to combat spam messages in email and SMS communications. Leveraging advanced Natural Language Processing (NLP) techniques, the system preprocesses text data by tokenizing, removing stopwords, and stemming, ensuring the most accurate classification results.

data-analysis data-visualization machine-learning-algorithms pandas seaborn-plots sklearn-library

Last synced: 16 Jan 2025

https://github.com/accurat/react-dataviz

⚛📊🚀 React components to build powerful interactive data visualizations

d3 data-analysis data-visualization react react-components

Last synced: 06 Nov 2024

https://github.com/negativenagesh/whatsapp_chat_analyzer

This is an end to end project of whatsapp chat analysis, here I have used my hostel whatsapp group's chat data.

data-analysis data-science frontend modeling python streamlit

Last synced: 23 Dec 2024

https://github.com/i10mm/gpt-arxiv-fetcher

Revolutionize your research with our GitHub repository, where GPT meets arXiv API for seamless access and analysis of the latest academic papers!

artificial data-analysis intelligence llm machine-learning

Last synced: 22 Nov 2024

https://github.com/femtotrader/dukascopyticksreader.jl

A Julia library to download tick data from Dukascopy https://www.dukascopy.com/swiss/english/marketwatch/historical/

data data-analysis dataset dukascopy html julia stock-data

Last synced: 12 Oct 2024

https://github.com/martinthoma/bad-stats

Examples of how not to do statistics / visualizations

data-analysis statistics visualizations

Last synced: 03 Feb 2025

https://github.com/tzerk/gammaspec

A collection of functions to analyse gamma spectra

data-analysis gamma-ray-spectrometry r

Last synced: 08 Nov 2024

https://github.com/ogoodness/vbreaker-js

CSC 483 Project - Ciphers: Caeser, Multiplicitive, Affine, Vigenere, Hill, Columnar Transposition

affine-cipher caesar-cipher columnar-transposition-cipher cryptography data-analysis decoder decryption encoder encryption hill-cipher parsing vigenere-cipher

Last synced: 14 Nov 2024

https://github.com/coderjolly/football-player-prediction

There is an intense transfer speculation that surrounds all major player transfers today. An important part of negotiations is predicting the fair market price for a player. Therefore, we are predicting this Market Value of a player using the data provided in csv format.

data-analysis data-visualization decision-tree-regression machine-learning xgboost-regression

Last synced: 02 Feb 2025

https://github.com/mjunaidca/sql-auditor-pro-gpt

Get quick insights from your Data Sources in tables, charts and graphs with with AI-driven audits and optimize your SQL databases.

ai ai-agent auditing-data cloudflared compound-ai-systems custom-gpt data-analysis fastapi-template gpt sql sqlmodel

Last synced: 03 Nov 2024

https://github.com/pythondeveloper6/store-sales-eda

simple EDA with some insights on Store Sales

data-analysis eda matplotlib numpy pandas seaborn

Last synced: 19 Jan 2025

https://github.com/virajbhutada/spotify-track-analysis-and-recommendation

Experience a comprehensive exploration of Spotify's musical landscape seamlessly transitioned from Tableau visualizations to SQL analysis. Dive into track inventory, streaming metrics, and sonic trends via interactive dashboards, while leveraging SQL queries for deeper insights into KPIs and cross-platform rankings.

audio-analysis data-analysis data-analytics data-science data-visualization eda machine-learning-library ml-models mysql recommendation-system spotify spotify-data spotify-dataset sql-database sql-server streaming-metrics tableau tableau-public trends-analysis

Last synced: 11 Nov 2024

https://github.com/maksimekin/umd_data_challange_2020

Ocean Clean up data analysis project for the UMD Data Challenge 2020. Data Exploration for a Sustainable Planet.

cleanup competition data-analysis data-science folium geolocation machine-learning ocean planet pollution sklearn sustainability time-series trash umd

Last synced: 15 Dec 2024

https://github.com/shervinnd/bazar_app_store_eda

Bazar App Data analysis code to find the most downloaded category and most popular installed apps

data data-analysis data-science dataanalysis eda python

Last synced: 18 Jan 2025

https://github.com/vuthanhhai2302/apply-machine-learning-on-data-analytics

My project of applied machine learning on data analytics, using pandas, numpy and scikit-learn to analyze data

data-analysis numpy pandas scikit-learn

Last synced: 11 Nov 2024

https://github.com/dermatologist/goscar-export

:fire: CSV to FHIR (For OSCAR EMR EForm Export)

data-analysis data-warehouse fhir fhir-r4 fhir-server hacktoberfest oscar-emr

Last synced: 27 Oct 2024

https://github.com/srinivasrm/mutual-funds-analysis-and-prediction

In this project I have performed analysis and prediction on 1,3,and 5 year returns on 1064 mutual funds in India. I have scraped data from a website which is the most visited website for mutual fund investments.I have tested regression models linear model,SGD Regressor , Random Forest Regressor,Decision Tree Regressor,Ridge,MLP Regressor and linear model (Lasso).After which I have selected the best perorming model and performed Hyper parameter tuning and then deployed an interactive application which can generate the visualization and send an email with the visualization to the users email address.

beautifulsoup data-analysis data-base data-cleaning data-science deployment etl finanace frontend funds machine-learning mutual mutual-funds pgsql python scikit-learn sql streamlit web webapplication

Last synced: 11 Oct 2024

https://github.com/astrodynamic/retailanalitycs-in-postgresql

Develop a SQL script to create a database with tables, views, roles, and functions. Form personalized offers to increase average check, frequency of visits, and cross-selling.

bd csv data-analysis data-export data-input data-manipulation data-validation database-management functions git margin offers postgresql retail role-permission-management selling sql transaction tsv views

Last synced: 12 Jan 2025

https://github.com/stimulsoft/stimulsoft.dashboards.php

Dashboards.PHP is a complete software package for designing and viewing dashboards. Includes the JS data analysis engine, dashboard designer and viewer. Support PHP 5, PHP 7, and PHP 8 versions.

charts dashboard-builder dashboards data-analysis data-grid data-visualization datatable dynamic-dashboard interactive-dashboards live-data mysql-data php php-bi-tools php-dashboard php-kpi php7 php8 pivot-tables sql-datasources statistics

Last synced: 30 Jan 2025

https://github.com/itzmeanjan/indian-railway

Exploring Indian Railways time table dataset, with :heart:

data-analysis data-visualization indian-railways matplotlib python python3 railway

Last synced: 02 Feb 2025

https://github.com/lisa-ho/three-investigators

Respository for scraping and analysing fan data on a German audio drama called 'Die Drei Fragezeichen' (the three investigators).

data-analysis data-viz datawrapper python webscraping

Last synced: 10 Feb 2025

https://github.com/ynikitenko/lena

Lena is an architectural framework for data analysis

analysis-framework analysis-pipeline data-analysis data-science

Last synced: 11 Nov 2024

https://github.com/hevalhazalkurt/exploring_the_data_of_lego_history

A data exploration project on LEGO history in Python with pandas, matplotlib etc. (WIP)

data data-analysis data-science data-visualization datascience datasets lego lego-history matplotlib pandas python python3

Last synced: 21 Jan 2025

https://github.com/abhash-rai/traffic-image-classifier

A web-based solution utilizing a robust tensorflow model for precise traffic condition classification made in ReactJs and FastAPI for backend.

cnn cnn-classification cnn-keras cnn-model data-analysis data-science data-visualization fastapi keras keras-tensorflow python python-3 python3 react reactjs tensorflow traffic traffic-classification transfer-learning

Last synced: 16 Jan 2025

https://github.com/palewire/baseball-notebooks

Python notebooks exploring Major League Baseball data

baseball baseball-statistics data-analysis jupyter-notebook pandas python

Last synced: 18 Oct 2024

https://github.com/vedadiyan/genql

GenQL is a generic querying language fully written in Go

data-analysis data-mapping data-processing data-science data-translation json json-data sql

Last synced: 11 Nov 2024

https://github.com/josmarcristello/goprotimeocr

Python-based OCR tool using EasyOCR and OpenCV for automated text extraction from images. Customizable image preprocessing steps and options for GPU acceleration make this a versatile and efficient solution for various OCR tasks

computer-vision data-analysis easyocr gopro gopro-camera ocr opencv pytesseract python

Last synced: 22 Jan 2025

https://github.com/josmarcristello/geokmlanalyzer

The Geo Kml Analyzer is a Python-based tool designed to process and analyze elevation data from KML files. Uses google maps to obtain elevation data (interpolates if necessary) and spherical trigonometry equations to calculate the distance.

data-analysis geolocation geospatial gis google-maps-api gps kml python

Last synced: 22 Jan 2025

https://github.com/easonlai/chat_with_csv_streamlit_with_chart

In this repository, you will find an example code for creating an interactive chat experience that allows you to ask questions about your CSV data with chart visualization capabilities.

azure-openai azure-openai-api azure-openai-service chart-visualization data-analysis data-visualization gpt gpt-3 langchain langchain-python matplotlib openai python streamlit streamlit-webapp

Last synced: 10 Nov 2024

https://github.com/andreip/twitter-authorities

Find authorities for Twitter topics. [Licenta][Undergraduate thesis]

data-analysis mongodb python twitter-topics

Last synced: 19 Dec 2024

https://github.com/grand-27-master/data-science-course

One-stop repo for learning data science along with roadmap!

data-analysis data-science machine-learning python statistics

Last synced: 06 Jan 2025

https://github.com/quantumudit/analyzing-whiskyexchange-whisky

This project focuses on scraping data related to Japanese Whiskey from the Whiskey Exchange website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 26 Dec 2024

https://github.com/agungbudiwirawan/e-commerce_analysis_using_sql

The objective of this project is to provide an analysis of Olist Store (Brazilian E-commerce) sales.

data-analysis data-science e-commerce e-commerce-project microsoft-sql-server sql sql-server

Last synced: 30 Jan 2025

https://github.com/dcs-training/summerschool2024-stream2

Welcome to the repository for all the materials for Stream 2 of the CDCS 2024 summer school. Go to the readme file

data-analysis data-visualisation data-wrangling r sentiment-analysis statistics text-analysis web-scraping

Last synced: 10 Nov 2024

https://github.com/ktmud/github-life

A data explorer for GitHub projects' life cycles

data-analysis github scraper time-series

Last synced: 23 Jan 2025

https://github.com/trainingbypackt/splunk-7-essentials-elearning

Build an elaborate Splunk enterprise environment that will extract powerful insights from your machine-generated big data

data-analysis eventgen indexing machine-learning splunk sub-search visualization

Last synced: 14 Nov 2024

https://github.com/garrettj403/albertaenergysources

Get grid data from Alberta Electric System Operator (AESO)

alberta canada data-analysis energy-data

Last synced: 27 Jan 2025

https://github.com/prajwalchapke055/accenture-data-analytics-and-visualization-forage

NAVIGATING NUMBERS - Apply your data analytics & visualization skills to advise a social media client on their content creation strategy as a Data Analyst at Accenture

accenture communication data-analysis data-modeling data-understanding data-visualization forage internship internship-task job-simulation presentation project-planning public-speaking storytelling strategy teamwork virtual-internship

Last synced: 22 Jan 2025

https://github.com/gcappon/py_agata

Official Python porting of AGATA (Automated Glucose dATa Analysis) a toolbox to analyse glucose data.

continuous-glucose-monitoring data-analysis hacktoberfest python toolbox

Last synced: 12 Nov 2024

https://github.com/romac/adaproject

🔬 Project proposal for the Applied Data Analysis course at EPFL

data-analysis

Last synced: 24 Dec 2024

https://github.com/pottekkat/matplotlib-basics

A jupyter notebook that walks you through the basics of matplotlib for data science applications

data-analysis data-visualization machine-learning matplotlib pyplot python

Last synced: 07 Jan 2025

https://github.com/lebrancconvas/personal-saved-datasets

Saved Datasets for doing stuff in the area of Statistics & Data Science.

data-analysis data-science database datasets excel kaggle kaggle-dataset microsoft-excel sample-dataset

Last synced: 08 Jan 2025

https://github.com/andreantonacci/eu2019

Social Network Analysis of Twitter Topic-Network Structures during the 2019 European Elections

data-analysis election-analysis european-elections gephi latex master-thesis social-network-analysis

Last synced: 24 Nov 2024

https://github.com/crafterkolyan/applied-statistical-data-analysis

Курс прикладного статистического анализа данных. ВМК МГУ. Весна 2020

autoexec-scripts autotest data-analysis github-actions statistics university

Last synced: 11 Feb 2025

https://github.com/zmyzheng/signature-authentication-pen

Signature Authentication Pen, a cloud based IoT project which realizes identity authentication by exploiting the signature biometric features of the users. Details:

android aws data-analysis identity-authentication iot neural-network signature-authentication-pen

Last synced: 05 Feb 2025

https://github.com/gher-uliege/seadatacloud

Tools and interfaces to work with DIVA interpolation software tool.

data-analysis data-visualization interpolation nco netcdf ocean-sciences oceanography

Last synced: 05 Feb 2025

https://github.com/mggg/gerrytools

Tools for evaluating districting plans.

data-analysis tooling

Last synced: 06 Dec 2024

https://github.com/danielpuentee/outdpik

The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.

data-analysis matplotlib numpy python

Last synced: 17 Dec 2024

https://github.com/ouhscbbmc/data-science-practices-1

Collection of publicly available practices of data science and analysis

bbmc-investigation data-analysis data-science python r sql

Last synced: 18 Nov 2024

https://github.com/cafferychen777/microbiomestat-turtorial-professional-version

MicrobiomeStat Tutorial Repository: This is a comprehensive resource for learning how to use the MicrobiomeStat package. It provides a step-by-step guide to effectively analyze complex microbiome data.

16s 16s-rrna 16s-seq analysis data-analysis metagenomes metagenomic-analysis metagenomic-pipeline metagenomics microbiome microbiome-analysis microbiome-analysis-pipelines microbiome-data microbiome-workflow omics omics-data omics-data-analysis omics-data-integration visualization

Last synced: 23 Jan 2025

https://github.com/juangesino/behaviouraleconomics

All the files and data for the experiment performed during the course Behavioural Economics @ University of Amsterdam

behavioral-economics behavioural-economics data-analysis economics game-theory statistics

Last synced: 23 Jan 2025

https://github.com/darsan-in/rumour-monger-spotter

Rumour Monger Spotter is a prototype developed during a national-level cyber hackathon to identify false information on Twitter. Using the Google Fact Check API and a Multinomial Naive Bayes classifier, the tool analyzes tweet content to assess the likelihood of misinformation. Despite a development window of less than 24 hours, the project won a t

ai data-analysis fact-checking hackathon india naive-bayes national-competition natural-language-processing prototype real-time-analysis social-media text-classification tweet-content twitter

Last synced: 12 Dec 2024

https://github.com/mutasim77/dbt-analytics

🍉 Repo for analytics engineering with dbt, transforming raw data into actionable insights.

big-query data data-analysis dbt warehouse

Last synced: 26 Jan 2025

https://github.com/ihabbendidi/diamond-analysis

Exploratory statistical analysis of a Diamond dataset

data-analysis data-visualization exploratory-data-analysis machine-learning r

Last synced: 09 Feb 2025

https://github.com/bishopce16/stroke_prediction_analysis

The purpose of this project is to derive insight on characteristics and statistics regarding the dataset to see which factors influence whether or not a patient has had a stroke.

data-analysis data-visualization machine-learning nump pandas plpgsql python seaborn tableau

Last synced: 11 Nov 2024

https://github.com/super-lou/card

🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with EXstat

aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly

Last synced: 23 Dec 2024

https://github.com/erictleung/microbiome-analysis-resources

:microscope: Resources and notes on studying the human microbiome

bioinformatics data-analysis gut-microbiome microbiology microbiome notes resources

Last synced: 09 Feb 2025

https://github.com/super-lou/aeag_toolbox

🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics

Last synced: 23 Dec 2024

https://github.com/lucs1590/strava-analysis

🏃📊 Using strava to do personal analyses and to practice data scientist skills.

data-analysis data-science github jupyter-notebook python3 strava strava-api

Last synced: 10 Feb 2025

https://github.com/storopoli/r_scripts

Couple of handy R Scripts that I use in a daily basis for Scientific Research

data-analysis data-science data-visualization r scientific

Last synced: 20 Nov 2024