Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/clima/climaanalysis.jl

An analysis library for ClimaDiagnostics (and, more generally, NetCDF files)

climate data-analysis julia visualization

Last synced: 01 Nov 2024

https://github.com/vishnu-t-r/sql_solved_questions

In this repository you will find SQL queries of multi-difficulty levels that can applied on to large datasets for various business requirements.

business-solutions complex-sql data-analysis sql-data-analysis sql-queries

Last synced: 10 Nov 2024

https://github.com/tusharnankani/data-analysis-with-python

A complete introduction to data analysis covering the basics of Python, Numpy, Pandas, Data Visualization, and Exploratory Data Analysis.

data data-analysis data-visualization hacktoberfest jovian jovian-ml jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 07 Nov 2024

https://github.com/jhrcook/spotify-data-analysis

The analysis of my Spotify streaming data.

data-analysis data-analysis-r r rlang rstudio spotify-data

Last synced: 11 Oct 2024

https://github.com/dr-montasir/mnjs

MATH NODE JS (MNJS): A tiny math library for node.js & JavaScript on browser

data-analysis data-science javascript js jsdelivr library math nextjs npm react svelte sveltekit ts typescript yarn

Last synced: 11 Nov 2024

https://github.com/jwngr/notreda.me

Historical Notre Dame Fighting Irish football schedules, stats, and analyses

college-football data-analysis data-visualization football notre-dame sports stats

Last synced: 06 Nov 2024

https://github.com/dennis-van-gils/python-fluidprop

Easy access to thermodynamic fluid properties as a function of temperature and pressure. With a minimal command-line interface.

command-line-tool coolprop data-analysis fluids thermodynamic-properties

Last synced: 12 Nov 2024

https://github.com/cosmoduende/r-google-location-history

Explore your activity on Google with R: How to analyze and visualize your Location History. Find out how and how much you have allowed Google to track you, using a copy of your personal data.

data-analysis data-analytics data-visualisation data-visualization data-viz geolocation-data google-data-analytics google-location-api google-location-history google-location-service google-takeout location-history maps-data r- r-analytics r-language r-programming r-stats

Last synced: 07 Nov 2024

https://github.com/simeononsecurity/track-helium-mobile-wifi

A collection of scripts and tools that tracks the availability of helium mobile wifi networks in the wild from the Wigle Dataset and Helium API. Updates every 24 hours.

automation bigdata carrieroffload data data-analysis dataset dataset-generation helium openroaming passpoint python

Last synced: 08 Nov 2024

https://github.com/deshima-dev/decode

:zap: DESHIMA code for data analysis

astronomy data-analysis deshima python spectroscopy submillimeter

Last synced: 05 Nov 2024

https://github.com/abubakkar32/machine-learning-practice

This GitHub repository is a valuable resource for machine learning and Python enthusiasts. It includes a wide range of projects and tools, covering topics like Data Visualization, Data Analysis, ML, DL, Automation, NLP, Web Scraping, and more. Contributors are welcome to join and learn together in this supportive community. Happy coding!

automation big-data chatbot data-analysis data-visualization datastractures machine-learning mathplotlib nlp-machine-learning numpy pandas plotly pypdf2-lib pytest python3 seaborn selenium-webdriver skit-learn webscraping

Last synced: 28 Oct 2024

https://github.com/abmenzel/messenger-wrapped

Explore your Facebook Messenger chat history, in a manner inspired by Spotify Wrapped

data-analysis data-visualization facebook meta nextjs reactjs spotify

Last synced: 07 Aug 2024

https://github.com/cusyio/datenanalyse-in-python

Kurs zur automatisierten Aufbereitung, Zusammenfassung und Erstellung von Diagrammen tabellarischer Daten mit Python.

data-analysis pandas-python pandas-tutorial python

Last synced: 10 Nov 2024

https://github.com/stappit/blog

I often post solutions to textbook exercises, including: Bayesian Data Analysis (BDA) by Gelman et al; Causal Inference in Statistics Primer (CISP) by Pearl et al; Purely Functional Data Structures (PFDS) by Okasaki.

bayesian-data-analysis blog data-analysis data-science gelman hakyll haskell pearl purely-functional-data-structures solutions stan static-site statistical-inference statistics

Last synced: 25 Oct 2024

https://github.com/acs/ghtorrent

Experimental GItHub project analysis based on GHTorrent

data-analysis data-visualization ghtorrent github visualization

Last synced: 27 Oct 2024

https://github.com/sanvishal/Exoplanet-Explore

An Interactive data visualization of Exoplanets

animation d3js data-analysis data-science exoplanet python space visualization

Last synced: 08 Nov 2024

https://github.com/coalio/Assistant

A data science library providing flexible dataframes for Lua 5.1+

data-analysis data-science data-structures dataframe lua

Last synced: 07 Nov 2024

https://github.com/cosmoduende/r-ggsoccer

StrangeR things: Visualizing Soccer Data with R… on a Soccer Pitch? How to analyze, visualize and report soccer data and strategies on a soccer pitch with the "ggsoccer" package

data-analysis data-analysis-in-r data-analytics ggsoccer package packages r-language r-package r-programming r-programming-projects r-studio soccer soccer-analytics soccer-data soccer-game soccer-matches soccer-simulation

Last synced: 07 Nov 2024

https://github.com/cosmoduende/r-spotify-history-analysis

Explore your activity on Spotify with R and "spotifyr": How to analyze and visualize your streaming history and music tastes. Find out how and how much you consume from Spotify, using a copy of your personal data and the "spotifyr" package

analisis-de-data analytics data-analysis data-analysis-r data-analytics data-visualization r-language r-programming sentiment-analysis spotify-analysis spotify-api spotify-connect spotify-data spotify-playlist spotify-streaming-history spotify-web-api spotifyr streaming-history visualizacion-de-datos visualizaciones

Last synced: 07 Nov 2024

https://github.com/drkostas/eestech-bigdata-challenge

EESTech Challenge is a brand new competition organized by EESTEC, that has the aim to create opportunities for European students to gain knowledge in the field of EECS and develop a professional network. The technological topic of 2017-2018ths competition was Big Data. This is the code I sumbitted with my team (BFS), which consisted of 3 members in total.

apache-spark data-analysis data-visualization eestec machine-learning matplotlib python

Last synced: 10 Nov 2024

https://github.com/kaustubhgupta/blogathon-analysis

Analytics Vidhya Blogathon Data Analysis: Python Data Extraction with PowerBI dashboard

data-analysis data-mining data-scraping excel pandas powerbi powerbi-report project python tqdm

Last synced: 14 Oct 2024

https://github.com/ztjhz/sc1015-project

Predict the success of an anime using data science and machine learning (regression + classification)

anime data-analysis data-science machine-learning

Last synced: 28 Oct 2024

https://github.com/ayaanhossain/sharedb

An on-disk pythonic embedded key-value store based on LMDB for compressed data storage and distributed data analysis

data-analysis data-storage distributed embedded-database key-value lmb msgpack multiprocessing parallel python python-3 python-library python-script python2 python3 store

Last synced: 15 Oct 2024

https://github.com/nico-curti/data-analysis

Data Analysis Utilities (from Statistics to Machine Learning)

data-analysis deep-learning-algorithms linear-algebra machine-learning-algorithms statistics

Last synced: 07 Nov 2024

https://github.com/bydevmar/master_masd_fpo

Ce dépôt GitHub regroupe tous les cours, TP, TD, projets, et exercices de ma formation en master en mathématiques appliquées pour la science des données. Parcourez-le pour une vue complète de mon parcours académique, offrant une perspective détaillée de mon apprentissage dans ce domaine.

acp afc algebra big-data-analytics dashboards data-analysis datascience economics english graph-theory latex linear-algebra non-linear-algebra probability prog python scientific-research software-package statistics

Last synced: 17 Nov 2024

https://github.com/danieldacosta/airbnb-analysis

Data analysis of AirBnb website history in the city of Rio de Janeiro

airbnb-analysis airbnb-website-history data-analysis

Last synced: 12 Nov 2024

https://github.com/sukanyabag/statistical-analysis-of-my-medium-articles

This repository contains an exploratory data analysis of my writer data at Medium. I use it to carry out data analysis once in every 4 months to see audience and fan growth, and topics they love! You can check out the articles here👇

data-analysis data-storytelling matplotlib pandas seaborn statistical-analysis sweetviz web-scraping

Last synced: 10 Nov 2024

https://github.com/zvtvz/zvdata

an extendable library for recording and analyzing data

analyzing-data dash data-analysis pandas plotly-dash sqlalchemy

Last synced: 12 Nov 2024

https://github.com/pjagielski/worldcup

2018 World Cup match data analysis

clojure data-analysis repl world-cup-2018

Last synced: 05 Nov 2024

https://github.com/shubham18024/census_analysis

This repository contains code and resources for a summer research project focused on statistical analysis of census data. The project aims to analyze demographic trends, population distributions, and other relevant metrics derived from census datasets.

census-data csv-files data-analysis data-visualization jupyter-notebook mitosheet python report statistics

Last synced: 08 Nov 2024

https://github.com/stefen-taime/nifi-etl-data-pipeline

This post will demonstrate the creation of a containerized data engineer environment using Docker Stacks.

apache api big-data cloud data-analysis data-engineering docker-compose etl-pipeline machine-learning nifi postgresql-database slack zookeeper

Last synced: 16 Nov 2024

https://github.com/lpsm-dev/twitter-sentimental-analysis-covid

✔️ Twitter Sentimental Analysis Covid-19 (using Textblob - Naive bayes) + Python Backend Flask + Docker + Docker Compose + MongoDB

adminmongo alpine backend corona coronavirus-analysis covid covid-19 data-analysis docker docker-compose docker-compose-wait flask flask-restful mongodb mongoku python python-api sentiment-analysis twitter

Last synced: 09 Nov 2024

https://github.com/koldlight/r4ds

R for data science course

course data-analysis data-science data-viz r

Last synced: 09 Nov 2024

https://github.com/amrrs/iq18_workshop

Data Science in Python - Workshop Data and Notebook

data-analysis ipl python

Last synced: 15 Nov 2024

https://github.com/vishnu-t-r/sql_functions_reference

This repository contains intermediate to complex sql queries which explains sql concepts. This repository can be helpful when writing queries with complex concepts and can be considered for reference. (Most queries have DDL and DML command within for practise)

complex-sql data-analysis data-mining sql sql-query

Last synced: 10 Nov 2024

https://github.com/pseudomanifold/enchiridion-tda

An enchiridion for instructing mortals in the hidden arts of topological data analysis

data-analysis graph-kernels persistent-homology topology

Last synced: 06 Nov 2024

https://github.com/aglove2189/appias

Machine learning workflow toolkit ✨🦋✨

appias data-analysis data-science machine-learning pandas python sklearn workflow

Last synced: 16 Nov 2024

https://github.com/mribeirodantas/vidente

R package to parse and preprocess the Surveillance, Epidemiology, and End Results (SEER) Program data from NIH/NCI

cancer-patients cancer-research data-analysis data-science data-structures r-package seer

Last synced: 15 Oct 2024

https://github.com/yaricom/english-article-correction

The experiment with applying NLP to correction of definite/indefinite articles in English text corpus

data-analysis glove-vectors nlp nlp-machine-learning numpy pandas scikit-learn umbc-webbase-corpus

Last synced: 05 Nov 2024

https://github.com/iondv/report

IONDV. Framework: Report module is to form the analytical reports.

analytics businessintelligence css data data-analysis data-visualization iondv iondv-module reporting

Last synced: 10 Nov 2024

https://github.com/apache/incubator-devlake-playground

Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.

dashboard-friendly data data-analysis data-engineering data-integration data-transfers devops domain-layer dora etl hacktoberfest integration jira open-source python user-friendly

Last synced: 07 Oct 2024

https://github.com/duhaime/douglasduhaime.com

Data analysis and visualization

data-analysis data-visualization website

Last synced: 14 Oct 2024

https://github.com/alandefreitas/scistats

High-Performance Descriptive Statistics and Hypothesis Tests in C++20

bayesian-statistics data-analysis descriptive-statistics hypothesis-testing performance-statistics statistics

Last synced: 13 Oct 2024

https://github.com/memoryfraction/LLSDA-Lightning-Location-System-Data-Analyzer

LLSDA is a public benefit project that helps lightning engineers and lightning scientists analyze lightning distribution. 一款跨平台的闪电定位(LLS)数据分析工具软件基础类库,用于对LLS数据进行时空特征分析,帮助雷电相关分析人员(科研人员、学生)提高开发效率、避免重复造轮子。

data-analysis data-visualization lightning-location-system public-benefits

Last synced: 13 Nov 2024

https://github.com/labrijisaad/data-scientist-tools-pandas

In this hands-on training notebook, we'll see all the basics of the Pandas library used for data science/data analysis and machine learning tasks.

data-analysis data-science dataframe-objects machine-learning pandas python series-objects

Last synced: 06 Nov 2024

https://github.com/aditeyabaral/lok-sabha-election-twitter-analysis

Twitter Feeds were analysed during the Lok Sabha Elections 2019 to guage the overall popularities of each party and predict the winner based solely on the tweets made by the population. This was made as a part of our Data Science course (UE18CS203) at PES University.

data-analysis data-science data-visualization elections loksabha nlp prediction probabilistic-graphical-models probability python python3 sentiment-analysis sentiment-classification sentiment-polarity sentiment-scores social-media socialmediaanalytics statistical-analysis statistical-models twitter

Last synced: 16 Nov 2024

https://github.com/hrolive/from-data-to-insights-with-google-cloud-platform

Four-course accelerated online specialization teaches course participants how to derive insights through data analysis and visualization using the Google Cloud Platform

data-analysis data-cleaning data-preparation data-visualization sql

Last synced: 09 Nov 2024

https://github.com/mahshaaban/intro_data_r

A gentle introduction to data analysis in R

data-analysis image-analysis qpcr-analysis r

Last synced: 11 Nov 2024

https://github.com/kingabzpro/annual-recycled-energy-saved-in-singapore

Learn how much Singapore is saving energy per years by recycling plastics, paper, glass, ferrous and non-ferrous metal

cleaning-data data-analysis data-science deepnote energy environment

Last synced: 17 Nov 2024

https://github.com/sayakpaul/analysis-of-college-database-of-2017-passouts

Contains my analysis of a database containing information about the students of an engineering college.

data-analysis data-visualization matplotlib python-3

Last synced: 28 Oct 2024

https://github.com/ogoodness/vbreaker-js

CSC 483 Project - Ciphers: Caeser, Multiplicitive, Affine, Vigenere, Hill, Columnar Transposition

affine-cipher caesar-cipher columnar-transposition-cipher cryptography data-analysis decoder decryption encoder encryption hill-cipher parsing vigenere-cipher

Last synced: 14 Nov 2024

https://github.com/ptyadana/dv-data-visualization-with-python

Data analysis and Data Visualization of Countries's GDP, Life Expectancy comparison across continents, GDP per Capita Relative Growth, Population Reative Growth comparison etc using Pandas, Matplotlib.

csdojo data-analysis data-visualization datavisualization matplotlib matplotlib-pyplot numpy pandas pluralsight python python3

Last synced: 15 Nov 2024

https://github.com/mjunaidca/sql-auditor-pro-gpt

Get quick insights from your Data Sources in tables, charts and graphs with with AI-driven audits and optimize your SQL databases.

ai ai-agent auditing-data cloudflared compound-ai-systems custom-gpt data-analysis fastapi-template gpt sql sqlmodel

Last synced: 03 Nov 2024

https://github.com/theengineeringworld/numpy-data-science

NumPy Data Science Essential Traing COurse. Part of Youtube Course Offered by TheEngineeringWorld.

data-analysis data-science numpy numpy-exercises numpy-library numpy-tutorial python python-3-6 python3 scipy2018

Last synced: 08 Nov 2024

https://github.com/i10mm/gpt-arxiv-fetcher

Revolutionize your research with our GitHub repository, where GPT meets arXiv API for seamless access and analysis of the latest academic papers!

artificial data-analysis intelligence llm machine-learning

Last synced: 05 Aug 2024

https://github.com/dermatologist/goscar-export

:fire: CSV to FHIR (For OSCAR EMR EForm Export)

data-analysis data-warehouse fhir fhir-r4 fhir-server hacktoberfest oscar-emr

Last synced: 27 Oct 2024

https://github.com/data-engineering-community/data-engineering-meetup-in-a-box

A collection of guides, resources, and support for DE meetup organizers.

data data-analysis data-engineering data-mining data-structures database meetups

Last synced: 05 Nov 2024

https://github.com/vuthanhhai2302/apply-machine-learning-on-data-analytics

My project of applied machine learning on data analytics, using pandas, numpy and scikit-learn to analyze data

data-analysis numpy pandas scikit-learn

Last synced: 11 Nov 2024

https://github.com/virajbhutada/spotify-track-analysis-and-recommendation

Experience a comprehensive exploration of Spotify's musical landscape seamlessly transitioned from Tableau visualizations to SQL analysis. Dive into track inventory, streaming metrics, and sonic trends via interactive dashboards, while leveraging SQL queries for deeper insights into KPIs and cross-platform rankings.

audio-analysis data-analysis data-analytics data-science data-visualization eda machine-learning-library ml-models mysql recommendation-system spotify spotify-data spotify-dataset sql-database sql-server streaming-metrics tableau tableau-public trends-analysis

Last synced: 11 Nov 2024

https://github.com/anaagg/data-installations-windows

Are you thinking about using the Linux subsystem on Windows? This is your repo, along this repository you will find the necessary steps to install on your computer what you need to work with Python, SQL, miniconda on Windows using the Linux subsystem.

conda conda-environment data-analysis hyper jupyter jupyter-notebook jupyter-notebooks linux miniconda pycharm pycharm-ide python sql vscode windows workbench

Last synced: 09 Oct 2024

https://github.com/davidzajac1/reptoro

A Data Visualization and Analytics Platform for the Reptile Industry

analytics data-analysis data-visualization plotly-dash python

Last synced: 11 Oct 2024

https://github.com/rikard-helgegren/leverage_analysis_tool

Analyst tool for portfolio construction. How can levereged certificates be used to increase returns in a portfolio while keeping the risk as low as possible. Use the tool and find out.

cpp data-analysis investment kivy-framework python3

Last synced: 14 Oct 2024

https://github.com/avinashkranjan/basic-data-analysis-and-visualization-in-python

📊 Some of the most important python tools in data science for Data Analysis and Data Visualization.

data-analysis data-science matplotlib matplotlib-pyplot numpy pandas plotly seabourne

Last synced: 25 Oct 2024

https://github.com/tzerk/gammaspec

A collection of functions to analyse gamma spectra

data-analysis gamma-ray-spectrometry r

Last synced: 08 Nov 2024

https://github.com/martinthoma/shell-history-analysis

Analyze how you use your shell

data-analysis python shell

Last synced: 21 Oct 2024

https://github.com/martinthoma/bad-stats

Examples of how not to do statistics / visualizations

data-analysis statistics visualizations

Last synced: 21 Oct 2024