Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/deshima-dev/decode

:zap: DESHIMA code for data analysis

astronomy data-analysis deshima python spectroscopy submillimeter

Last synced: 05 Nov 2024

https://github.com/abubakkar32/machine-learning-practice

This GitHub repository is a valuable resource for machine learning and Python enthusiasts. It includes a wide range of projects and tools, covering topics like Data Visualization, Data Analysis, ML, DL, Automation, NLP, Web Scraping, and more. Contributors are welcome to join and learn together in this supportive community. Happy coding!

automation big-data chatbot data-analysis data-visualization datastractures machine-learning mathplotlib nlp-machine-learning numpy pandas plotly pypdf2-lib pytest python3 seaborn selenium-webdriver skit-learn webscraping

Last synced: 28 Oct 2024

https://github.com/dr-montasir/mnjs

MATH NODE JS (MNJS): A tiny math library for node.js & JavaScript on browser

data-analysis data-science javascript js jsdelivr library math nextjs npm react svelte sveltekit ts typescript yarn

Last synced: 11 Nov 2024

https://github.com/abmenzel/messenger-wrapped

Explore your Facebook Messenger chat history, in a manner inspired by Spotify Wrapped

data-analysis data-visualization facebook meta nextjs reactjs spotify

Last synced: 07 Aug 2024

https://github.com/stappit/blog

I often post solutions to textbook exercises, including: Bayesian Data Analysis (BDA) by Gelman et al; Causal Inference in Statistics Primer (CISP) by Pearl et al; Purely Functional Data Structures (PFDS) by Okasaki.

bayesian-data-analysis blog data-analysis data-science gelman hakyll haskell pearl purely-functional-data-structures solutions stan static-site statistical-inference statistics

Last synced: 25 Oct 2024

https://github.com/acs/ghtorrent

Experimental GItHub project analysis based on GHTorrent

data-analysis data-visualization ghtorrent github visualization

Last synced: 27 Oct 2024

https://github.com/drkostas/eestech-bigdata-challenge

EESTech Challenge is a brand new competition organized by EESTEC, that has the aim to create opportunities for European students to gain knowledge in the field of EECS and develop a professional network. The technological topic of 2017-2018ths competition was Big Data. This is the code I sumbitted with my team (BFS), which consisted of 3 members in total.

apache-spark data-analysis data-visualization eestec machine-learning matplotlib python

Last synced: 10 Nov 2024

https://github.com/coalio/Assistant

A data science library providing flexible dataframes for Lua 5.1+

data-analysis data-science data-structures dataframe lua

Last synced: 07 Nov 2024

https://github.com/sanvishal/Exoplanet-Explore

An Interactive data visualization of Exoplanets

animation d3js data-analysis data-science exoplanet python space visualization

Last synced: 08 Nov 2024

https://github.com/cosmoduende/r-ggsoccer

StrangeR things: Visualizing Soccer Data with R… on a Soccer Pitch? How to analyze, visualize and report soccer data and strategies on a soccer pitch with the "ggsoccer" package

data-analysis data-analysis-in-r data-analytics ggsoccer package packages r-language r-package r-programming r-programming-projects r-studio soccer soccer-analytics soccer-data soccer-game soccer-matches soccer-simulation

Last synced: 07 Nov 2024

https://github.com/cosmoduende/r-spotify-history-analysis

Explore your activity on Spotify with R and "spotifyr": How to analyze and visualize your streaming history and music tastes. Find out how and how much you consume from Spotify, using a copy of your personal data and the "spotifyr" package

analisis-de-data analytics data-analysis data-analysis-r data-analytics data-visualization r-language r-programming sentiment-analysis spotify-analysis spotify-api spotify-connect spotify-data spotify-playlist spotify-streaming-history spotify-web-api spotifyr streaming-history visualizacion-de-datos visualizaciones

Last synced: 07 Nov 2024

https://github.com/nico-curti/data-analysis

Data Analysis Utilities (from Statistics to Machine Learning)

data-analysis deep-learning-algorithms linear-algebra machine-learning-algorithms statistics

Last synced: 07 Nov 2024

https://github.com/kaustubhgupta/blogathon-analysis

Analytics Vidhya Blogathon Data Analysis: Python Data Extraction with PowerBI dashboard

data-analysis data-mining data-scraping excel pandas powerbi powerbi-report project python tqdm

Last synced: 14 Oct 2024

https://github.com/ztjhz/sc1015-project

Predict the success of an anime using data science and machine learning (regression + classification)

anime data-analysis data-science machine-learning

Last synced: 28 Oct 2024

https://github.com/ayaanhossain/sharedb

An on-disk pythonic embedded key-value store based on LMDB for compressed data storage and distributed data analysis

data-analysis data-storage distributed embedded-database key-value lmb msgpack multiprocessing parallel python python-3 python-library python-script python2 python3 store

Last synced: 15 Oct 2024

https://github.com/cusyio/datenanalyse-in-python

Kurs zur automatisierten Aufbereitung, Zusammenfassung und Erstellung von Diagrammen tabellarischer Daten mit Python.

data-analysis pandas-python pandas-tutorial python

Last synced: 10 Nov 2024

https://github.com/shubham18024/census_analysis

This repository contains code and resources for a summer research project focused on statistical analysis of census data. The project aims to analyze demographic trends, population distributions, and other relevant metrics derived from census datasets.

census-data csv-files data-analysis data-visualization jupyter-notebook mitosheet python report statistics

Last synced: 08 Nov 2024

https://github.com/danieldacosta/airbnb-analysis

Data analysis of AirBnb website history in the city of Rio de Janeiro

airbnb-analysis airbnb-website-history data-analysis

Last synced: 12 Nov 2024

https://github.com/mahshaaban/intro_data_r

A gentle introduction to data analysis in R

data-analysis image-analysis qpcr-analysis r

Last synced: 11 Nov 2024

https://github.com/koldlight/r4ds

R for data science course

course data-analysis data-science data-viz r

Last synced: 09 Nov 2024

https://github.com/pjagielski/worldcup

2018 World Cup match data analysis

clojure data-analysis repl world-cup-2018

Last synced: 05 Nov 2024

https://github.com/lpsm-dev/twitter-sentimental-analysis-covid

✔️ Twitter Sentimental Analysis Covid-19 (using Textblob - Naive bayes) + Python Backend Flask + Docker + Docker Compose + MongoDB

adminmongo alpine backend corona coronavirus-analysis covid covid-19 data-analysis docker docker-compose docker-compose-wait flask flask-restful mongodb mongoku python python-api sentiment-analysis twitter

Last synced: 09 Nov 2024

https://github.com/vishnu-t-r/sql_functions_reference

This repository contains intermediate to complex sql queries which explains sql concepts. This repository can be helpful when writing queries with complex concepts and can be considered for reference. (Most queries have DDL and DML command within for practise)

complex-sql data-analysis data-mining sql sql-query

Last synced: 10 Nov 2024

https://github.com/amrrs/iq18_workshop

Data Science in Python - Workshop Data and Notebook

data-analysis ipl python

Last synced: 15 Nov 2024

https://github.com/pseudomanifold/enchiridion-tda

An enchiridion for instructing mortals in the hidden arts of topological data analysis

data-analysis graph-kernels persistent-homology topology

Last synced: 06 Nov 2024

https://github.com/iondv/report

IONDV. Framework: Report module is to form the analytical reports.

analytics businessintelligence css data data-analysis data-visualization iondv iondv-module reporting

Last synced: 10 Nov 2024

https://github.com/mribeirodantas/vidente

R package to parse and preprocess the Surveillance, Epidemiology, and End Results (SEER) Program data from NIH/NCI

cancer-patients cancer-research data-analysis data-science data-structures r-package seer

Last synced: 15 Oct 2024

https://github.com/memoryfraction/LLSDA-Lightning-Location-System-Data-Analyzer

LLSDA is a public benefit project that helps lightning engineers and lightning scientists analyze lightning distribution. 一款跨平台的闪电定位(LLS)数据分析工具软件基础类库,用于对LLS数据进行时空特征分析,帮助雷电相关分析人员(科研人员、学生)提高开发效率、避免重复造轮子。

data-analysis data-visualization lightning-location-system public-benefits

Last synced: 13 Nov 2024

https://github.com/yaricom/english-article-correction

The experiment with applying NLP to correction of definite/indefinite articles in English text corpus

data-analysis glove-vectors nlp nlp-machine-learning numpy pandas scikit-learn umbc-webbase-corpus

Last synced: 05 Nov 2024

https://github.com/labrijisaad/data-scientist-tools-pandas

In this hands-on training notebook, we'll see all the basics of the Pandas library used for data science/data analysis and machine learning tasks.

data-analysis data-science dataframe-objects machine-learning pandas python series-objects

Last synced: 06 Nov 2024

https://github.com/apache/incubator-devlake-playground

Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.

dashboard-friendly data data-analysis data-engineering data-integration data-transfers devops domain-layer dora etl hacktoberfest integration jira open-source python user-friendly

Last synced: 07 Oct 2024

https://github.com/hrolive/from-data-to-insights-with-google-cloud-platform

Four-course accelerated online specialization teaches course participants how to derive insights through data analysis and visualization using the Google Cloud Platform

data-analysis data-cleaning data-preparation data-visualization sql

Last synced: 09 Nov 2024

https://github.com/alandefreitas/scistats

High-Performance Descriptive Statistics and Hypothesis Tests in C++20

bayesian-statistics data-analysis descriptive-statistics hypothesis-testing performance-statistics statistics

Last synced: 13 Oct 2024

https://github.com/duhaime/douglasduhaime.com

Data analysis and visualization

data-analysis data-visualization website

Last synced: 14 Oct 2024

https://github.com/zvtvz/zvdata

an extendable library for recording and analyzing data

analyzing-data dash data-analysis pandas plotly-dash sqlalchemy

Last synced: 12 Nov 2024

https://github.com/sukanyabag/statistical-analysis-of-my-medium-articles

This repository contains an exploratory data analysis of my writer data at Medium. I use it to carry out data analysis once in every 4 months to see audience and fan growth, and topics they love! You can check out the articles here👇

data-analysis data-storytelling matplotlib pandas seaborn statistical-analysis sweetviz web-scraping

Last synced: 10 Nov 2024

https://github.com/sayakpaul/analysis-of-college-database-of-2017-passouts

Contains my analysis of a database containing information about the students of an engineering college.

data-analysis data-visualization matplotlib python-3

Last synced: 28 Oct 2024

https://github.com/mjunaidca/sql-auditor-pro-gpt

Get quick insights from your Data Sources in tables, charts and graphs with with AI-driven audits and optimize your SQL databases.

ai ai-agent auditing-data cloudflared compound-ai-systems custom-gpt data-analysis fastapi-template gpt sql sqlmodel

Last synced: 03 Nov 2024

https://github.com/vuthanhhai2302/apply-machine-learning-on-data-analytics

My project of applied machine learning on data analytics, using pandas, numpy and scikit-learn to analyze data

data-analysis numpy pandas scikit-learn

Last synced: 11 Nov 2024

https://github.com/ptyadana/dv-data-visualization-with-python

Data analysis and Data Visualization of Countries's GDP, Life Expectancy comparison across continents, GDP per Capita Relative Growth, Population Reative Growth comparison etc using Pandas, Matplotlib.

csdojo data-analysis data-visualization datavisualization matplotlib matplotlib-pyplot numpy pandas pluralsight python python3

Last synced: 15 Nov 2024

https://github.com/theengineeringworld/numpy-data-science

NumPy Data Science Essential Traing COurse. Part of Youtube Course Offered by TheEngineeringWorld.

data-analysis data-science numpy numpy-exercises numpy-library numpy-tutorial python python-3-6 python3 scipy2018

Last synced: 08 Nov 2024

https://github.com/i10mm/gpt-arxiv-fetcher

Revolutionize your research with our GitHub repository, where GPT meets arXiv API for seamless access and analysis of the latest academic papers!

artificial data-analysis intelligence llm machine-learning

Last synced: 05 Aug 2024

https://github.com/franpog859/top-of-the-world

🌍🔝 Proof that your country is the top of the world using GeoTIFF images and a little bit of geometry. Data mining project

data data-analysis data-mining elevation geometry geotiff image-processing matplotlib nvector rasterio

Last synced: 12 Nov 2024

https://github.com/data-engineering-community/data-engineering-meetup-in-a-box

A collection of guides, resources, and support for DE meetup organizers.

data data-analysis data-engineering data-mining data-structures database meetups

Last synced: 05 Nov 2024

https://github.com/ahammadmejbah/different-types-of-data-splitting-methods

In order to prevent overfitting and guarantee that our model can generalize to new data, data splitting is essential in machine learning.

data data-analysis data-engineering data-mining data-science data-visualization

Last synced: 11 Nov 2024

https://github.com/anaagg/data-installations-windows

Are you thinking about using the Linux subsystem on Windows? This is your repo, along this repository you will find the necessary steps to install on your computer what you need to work with Python, SQL, miniconda on Windows using the Linux subsystem.

conda conda-environment data-analysis hyper jupyter jupyter-notebook jupyter-notebooks linux miniconda pycharm pycharm-ide python sql vscode windows workbench

Last synced: 09 Oct 2024

https://github.com/avinashkranjan/basic-data-analysis-and-visualization-in-python

📊 Some of the most important python tools in data science for Data Analysis and Data Visualization.

data-analysis data-science matplotlib matplotlib-pyplot numpy pandas plotly seabourne

Last synced: 25 Oct 2024

https://github.com/rikard-helgegren/leverage_analysis_tool

Analyst tool for portfolio construction. How can levereged certificates be used to increase returns in a portfolio while keeping the risk as low as possible. Use the tool and find out.

cpp data-analysis investment kivy-framework python3

Last synced: 14 Oct 2024

https://github.com/dermatologist/goscar-export

:fire: CSV to FHIR (For OSCAR EMR EForm Export)

data-analysis data-warehouse fhir fhir-r4 fhir-server hacktoberfest oscar-emr

Last synced: 27 Oct 2024

https://github.com/martinthoma/shell-history-analysis

Analyze how you use your shell

data-analysis python shell

Last synced: 21 Oct 2024

https://github.com/davidzajac1/reptoro

A Data Visualization and Analytics Platform for the Reptile Industry

analytics data-analysis data-visualization plotly-dash python

Last synced: 11 Oct 2024

https://github.com/martinthoma/bad-stats

Examples of how not to do statistics / visualizations

data-analysis statistics visualizations

Last synced: 21 Oct 2024

https://github.com/tzerk/gammaspec

A collection of functions to analyse gamma spectra

data-analysis gamma-ray-spectrometry r

Last synced: 08 Nov 2024

https://github.com/orkunaktas/sofascore-webscraping

⚽️I scraped the shot data of the Fenerbahçe - Adana Demirspor match from Sofascore⚽️

beautifulsoup data-analysis football-analytics football-data selenium webscraping

Last synced: 11 Oct 2024

https://github.com/alexeyev/hse-spb-bigdata-python-fall2016

Материалы к курсу по программированию и инструментам анализа данных, прочитанному в петербургском филиале НИУ ВШЭ осенью 2016 года

course-materials data-analysis numpy pandas python scikit-learn sklearn

Last synced: 11 Nov 2024

https://github.com/irfanchahyadi/ml-notes

Complete personal notes for performing Data Analysis, Preprocessing, and Training ML model.

data-analysis machine-learning plotting python

Last synced: 07 Nov 2024

https://github.com/waveform80/structa

A small utility for analyzing data structures (e.g. JSON files)

csv data-analysis data-visualization datajournalism datawrangling json yaml

Last synced: 11 Oct 2024

https://github.com/sondosaabed/introduction-to-data-analysis-with-pandas-and-numpy

Learning the data analysis process of questioning, wrangling, exploring, analyzing, and communicating data. Working with data in Python using libraries like NumPy and pandas.

data-analysis data-analyst-nanodegree data-wrangling numpy pandas python

Last synced: 06 Nov 2024

https://github.com/ynikitenko/lena

Lena is an architectural framework for data analysis

analysis-framework analysis-pipeline data-analysis data-science

Last synced: 11 Nov 2024

https://github.com/femtotrader/dukascopyticksreader.jl

A Julia library to download tick data from Dukascopy https://www.dukascopy.com/swiss/english/marketwatch/historical/

data data-analysis dataset dukascopy html julia stock-data

Last synced: 12 Oct 2024

https://github.com/palewire/baseball-notebooks

Python notebooks exploring Major League Baseball data

baseball baseball-statistics data-analysis jupyter-notebook pandas python

Last synced: 18 Oct 2024

https://github.com/accurat/react-dataviz

⚛📊🚀 React components to build powerful interactive data visualizations

d3 data-analysis data-visualization react react-components

Last synced: 06 Nov 2024

https://github.com/ogoodness/vbreaker-js

CSC 483 Project - Ciphers: Caeser, Multiplicitive, Affine, Vigenere, Hill, Columnar Transposition

affine-cipher caesar-cipher columnar-transposition-cipher cryptography data-analysis decoder decryption encoder encryption hill-cipher parsing vigenere-cipher

Last synced: 14 Nov 2024

https://github.com/alyssonmach/9-data-science-apps-with-python

[freeCodeCamp Course]: build interactive and data-driven web apps in Python using the Streamlit library.

data-analysis data-science data-visualization machine-learning streamlit-webapp

Last synced: 06 Nov 2024

https://github.com/srinivasrm/mutual-funds-analysis-and-prediction

In this project I have performed analysis and prediction on 1,3,and 5 year returns on 1064 mutual funds in India. I have scraped data from a website which is the most visited website for mutual fund investments.I have tested regression models linear model,SGD Regressor , Random Forest Regressor,Decision Tree Regressor,Ridge,MLP Regressor and linear model (Lasso).After which I have selected the best perorming model and performed Hyper parameter tuning and then deployed an interactive application which can generate the visualization and send an email with the visualization to the users email address.

beautifulsoup data-analysis data-base data-cleaning data-science deployment etl finanace frontend funds machine-learning mutual mutual-funds pgsql python scikit-learn sql streamlit web webapplication

Last synced: 11 Oct 2024

https://github.com/dcs-training/datavisualisationwithr

Data Visualisation with R Workshop (delivered by the Centre in December 2020). This workshop is focusing on visualising your data. Go to the readme file

data-analysis data-visualisation data-wrangling r

Last synced: 10 Nov 2024

https://github.com/ilyasmoutawwakil/fcc-data-analysis-with-python

My 5 Data Analysis projects that I've built as part of my freeCodeCamp assignment.

certification course-project data-analysis fcc fcc-assignment fcc-certification fcc-data python

Last synced: 15 Oct 2024

https://github.com/virajbhutada/spotify-track-analysis-and-recommendation

Experience a comprehensive exploration of Spotify's musical landscape seamlessly transitioned from Tableau visualizations to SQL analysis. Dive into track inventory, streaming metrics, and sonic trends via interactive dashboards, while leveraging SQL queries for deeper insights into KPIs and cross-platform rankings.

audio-analysis data-analysis data-analytics data-science data-visualization eda machine-learning-library ml-models mysql recommendation-system spotify spotify-data spotify-dataset sql-database sql-server streaming-metrics tableau tableau-public trends-analysis

Last synced: 11 Nov 2024

https://github.com/maksimekin/umd_data_challange_2020

Ocean Clean up data analysis project for the UMD Data Challenge 2020. Data Exploration for a Sustainable Planet.

cleanup competition data-analysis data-science folium geolocation machine-learning ocean planet pollution sklearn sustainability time-series trash umd

Last synced: 28 Oct 2024