Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-11 00:07:19 UTC
- JSON Representation
https://github.com/goplus/www
Source code of https://xgo.dev
data-analysis data-science golang goplus language playground script stem-education xgo
Last synced: 15 Dec 2025
https://github.com/mainakrepositor/data-analysis
Different types of data analytics projects : EDA, PDA, DDA, TSA and much more.....
data-analysis data-science deeplearning machine-learning-algorithms neural-networks time-series-analysis tsa
Last synced: 06 Mar 2026
https://github.com/mdeff/python_tour_of_data_science
A Python Tour of Data Science
data-analysis data-science education machine-learning python
Last synced: 12 Jul 2025
https://github.com/cdeweyx/medium-stats-analysis
Exploring data and analyzing metrics for user-specific Medium Stats
data-analysis data-mining data-visualization python
Last synced: 25 Apr 2025
https://github.com/nimblelearn/datapackage-m
Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
csv-files data-acquisition data-analysis data-analytics data-package data-transformation data-visualisation data-visualization datapackage excel frictionlessdata json-table-schema open-data power-bi power-query powerbi tabular-data tabular-data-package
Last synced: 30 Jul 2025
https://github.com/180protocol/180protocol
Confidential compute for sensitive data sharing and commercial collaboration
blockchain confidential-computing data data-analysis data-science decentralized-storage distributed dlt enclave filecoin intel-sgx ipfs java kotlin privacy-enhancing-technologies rewards-engine
Last synced: 14 Apr 2025
https://github.com/debiai/debiai
Bias detection and contextual evaluation tool for your AI projects
ai bias contextual-evaluation data-agnostic data-analysis data-exploration machine-learning model-evaluation plotlyjs python visualization vuejs
Last synced: 31 Jul 2025
https://github.com/rshkarin/quanfima
Quanfima (Quantitative Analysis of Fibrous Materials)
data-analysis material-science morphological-analysis volumetric-data
Last synced: 07 May 2025
https://github.com/alessandrocorradini/harvard-data-science-professional
Repository for the Data Science Professional Program from Harvard University on edX
data-analysis data-science datascience edx harvardx machine-learning machinelearning mooc moocs r r-language
Last synced: 13 Jul 2025
https://github.com/ncbi/tree-tool
Incremental building of phylogenetic distance trees
bioinformatics bioinformatics-tool data-analysis distance-measures evolution phylogenetic-trees
Last synced: 31 Jan 2026
https://github.com/lastancientone/data-science
Utilizing Kaggle Data and Real-World Data for Data Science and Prediction in Python, R, Excel, Power BI, and Tableau.
algorithms data-analysis data-science data-visualization datascience deep-learning dimensionality-reduction excel exploratory-data-analysis exploratory-data-visualizations feature-engineering inferential-statistics kaggle kaggle-competiton machine-learning model-tuning powerbi prediction python3 r
Last synced: 09 Apr 2025
https://github.com/ocramz/heidi
heidi : tidy data in Haskell
algebraic-data-types data-analysis data-mining data-science dataframe dataframe-library dataframes generic-programming generics tidy-data
Last synced: 14 Apr 2025
https://github.com/hyperspy/hyperspyui
A user interface for the hyperspy package. https://hyperspy.org/hyperspyUI
data-analysis data-visualization eds eels electron-energy-loss-spectroscoy gui hyperspy life-sciences materials-science multi-dimensional physical-sciences spectroscopy x-ray-spectroscopy
Last synced: 09 Apr 2025
https://github.com/dawievlill/datascience-871
Data science module for economists written mostly in Julia and R
data-analysis data-science machine-learning
Last synced: 27 Feb 2025
https://github.com/open-cogsci/datamatrix
An intuitive, Pythonic way to work with tabular data
analysis data-analysis data-structures python scientific-computing
Last synced: 03 Jun 2026
https://github.com/hugohadfield/bayesfilter
Pure Python/Numpy Bayesian Filtering and Smoothing
data-analysis ekf filtering smoothing ukf
Last synced: 25 Oct 2025
https://github.com/dotbithq/das-account-indexer
Mapping relationship between multi-chain's addresses and accounts
data-analysis docker golang nervos server
Last synced: 09 Oct 2025
https://github.com/tsffarias/data-analysis-queries
Este repositório foi cuidadosamente criado para fornecer uma extensa coleção de consultas SQL que visam facilitar o trabalho dos analistas de dados em diversas áreas de uma empresa, incluindo marketing, logística, comercial, financeiro, recursos humanos, operação, jurídico, suporte e muito mais.
business-intelligence comercial data-analysis data-insights esg finance-management fraud-prevention human-resources juridico kpis logistics marketing marketing-analytics operacao pricing sql suporte
Last synced: 05 Apr 2025
https://github.com/serkor1/slmetrics
A high-performance R :package: for supervised and unsupervised machine learning evaluation metrics witten in 'C++'.
armadillo armadillo-library artificial-intelligence cpp cran cran-r data-analysis data-science eigen3 machine-learning performance-metrics r r-package r-stats rcpp rcpparmadillo rcppeigen statistics supervised-learning
Last synced: 18 Feb 2026
https://github.com/great-northern-diver/loon.ggplot
ggplot to loon
data-analysis ggplot ggplot-features graphics interactive-plots loon visualizations
Last synced: 23 Feb 2026
https://github.com/isisneutronmuon/mdanse
MDANSE: Molecular Dynamics Analysis for Neutron Scattering Experiments
data-analysis molecular-dynamics neutron-scattering python qt-gui science
Last synced: 22 Aug 2025
https://github.com/ActivityWatch/aw-research
Tools to analyse and experiment with ActivityWatch data
activitywatch data-analysis python quantified-self
Last synced: 01 May 2025
https://github.com/activitywatch/aw-research
Tools to analyse and experiment with ActivityWatch data
activitywatch data-analysis python quantified-self
Last synced: 14 Apr 2025
https://github.com/jrbourbeau/pyunfold
Iterative unfolding for Python
data-analysis deconvolution inverse-problems python regularization statistics unfolding
Last synced: 01 May 2025
https://github.com/lumispy/lumispy
Luminescence data analysis with HyperSpy.
data-analysis hyperspy hyperspy-extension multi-dimensional python raman raman-spectroscopy spectroscopy
Last synced: 07 Apr 2025
https://github.com/mkcor/advanced-pandas
Pandas is a powerful tool for data exploration and analysis (including timeseries).
data-analysis data-science labeled-data notebooks python3 teaching-materials
Last synced: 12 Oct 2025
https://github.com/cjroth/chronist
Long-term analysis of emotion, age, and sentiment using Lifeslice and text records.
data data-analysis data-science data-visualization dataset dataviz emotion emotion-analytics es6 javascript matplotlib pandas photoanalysis python sentiment sentiment-analysis
Last synced: 28 Apr 2025
https://github.com/cgivre/data-exploration-with-apache-drill
Data Exploration with Apache Drill
apache-drill data-analysis data-mining data-science
Last synced: 12 Apr 2025
https://github.com/mark-hoffmann/icd
Tools for working with icd codes and comorbidities
Last synced: 02 Apr 2026
https://github.com/anselmoo/spectrafit
📊📈🔬 SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regular expression of distribution functions.
console-application curve-fitting data-analysis data-analysis-python data-science data-visualization fitting juypter-notebook python science science-research scientific-plotting spectral-analysis spectroscopy
Last synced: 25 Nov 2025
https://github.com/fatbobman/objects2xlsx
A powerful, type-safe Swift library for converting Swift objects to Excel (.xlsx) files. Objects2XLSX provides a modern, declarative API for creating professional Excel spreadsheets with full styling support, multiple worksheets, and real-time progress tracking.
business data-analysis dataset excel export-excel reporting spredsheet swift xlsx xlsxwriter
Last synced: 18 Jul 2025
https://github.com/mrankitgupta/python-roadmap
I am sharing Python lessons from scratch to intermediate with practice sets which I have studied into my Journey of 66DaysofData into Data Analytics.
66daysofdata analytics ankitgupta data-analysis data-analysis-python data-analytics data-mining data-science data-structures data-visualization jupyter matplotlib mrankitgupta numpy pandas programming python python-library python3
Last synced: 14 Jul 2025
https://github.com/computationalcore/introduction-to-python
A very useful collection of Jupyter Notebooks, which aims to introduce the Python programming language.
data-analysis data-science fundamental google-colab jupyter-notebook jupyter-notebooks numpy pandas python python-language python-programming python3
Last synced: 24 Apr 2025
https://github.com/theengineeringworld/python-data-science
Python Data Science has all the data sets and jupyter notebook files for the Youtube course at http://youtube.com/theengineeringworld under the name of " Python Data Science Course ".
data data-analysis data-mining data-science data-visualization jupyter-notebook jupyter-notebooks machine-learning python python27
Last synced: 17 Nov 2025
https://github.com/marcogdepinto/python-for-data-analysis-and-machine-learning
This repo contains the projects made for the course of Jose Portilla on Udemy.
analysis data-analysis deep-neural-networks exercise ipynb jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn
Last synced: 28 Oct 2025
https://github.com/codingforentrepreneurs/try-pandas
In this series, we're going to learn the fundamentals of the popular Python data science tool called Pandas.
data-analysis data-science deepnote jupyter nba-api nba-stats notebook pandas python python-pandas
Last synced: 18 Jan 2026
https://github.com/jpenuchot/ctbench
Compiler-assisted variable size benchmarking for the study of C++ metaprogram compile times.
benchmark clang compilation data-analysis data-visualization gcc metaprogramming
Last synced: 26 Oct 2025
https://github.com/sciruby/daru-io
daru-io is a plugin gem to the existing daru gem, which aims to add support to Importing DataFrames from / Exporting DataFrames to multiple formats.
daru data-analysis exporter importer parser ruby ruby-gem
Last synced: 12 Mar 2026
https://github.com/probcomp/cgpm
Library of composable generative population models which serve as the modeling and inference backend of BayesDB.
bayesian-inference data-analysis machine-learning probabilistic-programming tabular-data
Last synced: 19 Oct 2025
https://github.com/data-centric-ai-community/nist-crc-2023
NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!
ctgan data-analysis data-science deeplearning deidentification gans generative-adversarial-network machine-learning privacy-enhancing-technologies python synthetic-data synthetic-dataset-generation
Last synced: 23 Apr 2026
https://github.com/dh-center-tuebingen/spacialist
A Virtual Research Environment for the Spatial Humanities
angular-apps customizable data-analysis digital-humanities e-science geographical-information-system research-tool responsive spatial-data web-gis
Last synced: 09 Mar 2026
https://github.com/staircase-dev/piso
Pandas Interval Set Operations: providing methods for set operations, analytics, lookups and joins on pandas' Interval, IntervalArray and IntervalIndex
data-analysis data-science data-structures interval interval-arithmetic interval-set pandas set set-operations set-theory
Last synced: 20 Aug 2025
https://github.com/sondosaabed/paltaqdeer
🇵🇸 PalTaqdeer is an AI-Driven Student Success Forecaster. Was developed for Hackathon Google Launchpad, data analysis techniques, Linear regression model, and Flask for the web 🇵🇸
data-analysis hackathon hackathon-project linear-regression matplotlib outliers-detection pandas python student-grades
Last synced: 02 Mar 2026
https://github.com/renkun-ken/r-data-practice
R语言数据操作练习
data-analysis data-manipulation practice r
Last synced: 29 Oct 2025
https://github.com/ptyadana/data-analysis-for-digital-music-store
helping Digitial Music Store to optimize their business practices using PostgreSQL
chinook chinook-database data-analysis datavisualization pgadmin4 postgresql sql tableau
Last synced: 12 Apr 2025
https://github.com/cataseven/statistics-graph-chart-card
A highly customizable, smooth, and advanced graph card. Shows historical sensor data with dynamic trend colors, statistics (min, max, avg), and more. A great alternative to the default history graph and sensor cards.
analysis analytics bar-chart chart data data-analysis data-science data-visualization graph graphics histogram historical-data history home-assistant statistical-analysis statistics
Last synced: 12 Apr 2026
https://github.com/csbiology/fsharpgephistreamer
F# functions for streaming any kind of graph/network data to the network visualization tool gephi
data-analysis exploratory-data-analysis fsharp gephi graph-visualization streaming-graph-data visualization
Last synced: 30 Jul 2025
https://github.com/k1rsn7/kaggle-solutions
:basecamp: A collection of Kaggle solutions.
computer-vision cv data-analysis data-science deeplearning deeplearning-ai english english-language jupyter jupyter-notebook jupyter-notebooks kaggle kaggle-challenge russian russian-language
Last synced: 28 Feb 2025
https://github.com/afraniomelo/kydlib
Routines for exploratory data analysis.
autocorrelation correlation-coefficient data-analysis data-exploration data-science data-visualization eda exploratory-data-analysis gaussian machine-learning noise nonlinear plotting python scatter-plot statistics time-series time-series-analysis visualization
Last synced: 09 Apr 2025
https://github.com/luizbizzio/grafana-wallpaper
🖥️ A detailed guide on how to set up Grafana and display its dashboards as your desktop wallpaper. This project allows you to transform your data visualizations into an interactive real-time monitoring background, making data always visible.
app automation data-analysis data-visualization exporter grafana grafana-dashboard graph graphs guide homeautomation iot lively-wallpaper metrics monitoring prometheus real-time tutorial wallpaper windows
Last synced: 23 Feb 2026
https://github.com/jatinagrawal0/youtube-comment-sentimental-analysis
YouTube Sentiment Analysis is a web application that analyzes the sentiment of YouTube comments, providing insights into comment sentiment using VADER sentiment analysis and interactive visualizations.
data-analysis data-visualization natural-language-processing plotly python sentiment-analysis streamli streamlit-cloud vader-lexicon youtube-api-v3 youtube-comment-scraper youtube-comments-downloader
Last synced: 14 Apr 2025
https://github.com/mkrd/pathdict
Easily query and modify Python dicts!
data data-analysis data-mining data-science data-structure data-structures datascience dataset dict dictionaries dictionary json modify object python python-list query query-builder science
Last synced: 04 Jul 2025
https://github.com/hoangsonww/north-carolina-household-analysis
🏠 This repository contains data analysis scripts for the 2022 American Community Survey (ACS) focusing on individuals aged 25 and over in North Carolina, based on 75,340 observations. This repository offers valuable insights into demographic and economic patterns across North Carolina's urban areas.
confidence-interval confidence-score data data-analysis data-analytics data-science data-visualization ggplot2 hypothesis-testing hypothesis-tests north-carolina r r-language r-programming stata
Last synced: 11 Apr 2025
https://github.com/mrankitgupta/kaggle-pandas-solved-exercises
I'm sharing my Kaggle Pandas Course - Exercise complete solution notebook which I have solved while undertaking this course.
66daysofdata ankitgupta ankittalks data-analysis data-science data-structures data-visualization datascience kaggle kaggle-notebook kaggle-notebooks mrankitgupta pandas pandas-dataframe pandas-library pandas-python pandas-tutorial python python-library python3
Last synced: 22 Apr 2025
https://github.com/asadiahmad/edit-distance-spark
Calculating Edit Distance with PySpark
data-analysis edit-distance nlp pyspark spark
Last synced: 28 Apr 2026
https://github.com/djangoaddicts/django-pygwalker
Easily add PyGWalker visualizations to your Django applications
data-analysis django pygwalker tableau tableau-alternative visualization
Last synced: 08 Apr 2026
https://github.com/pravj/ospi
Open Source Presence Infographic of Indian Startups
data-analysis data-visualization india open-source startup
Last synced: 13 Apr 2025
https://github.com/ereh11/datelemur-sql-interview-questions
My solutions for #Datalemur SQL Interview Questions
data-analysis datalemur exploratory-data-analysis postgresql sql-query
Last synced: 30 May 2026
https://github.com/ahmedosamamath/statistics-basics
A comprehensive guide to applying statistical techniques in machine learning, including data preprocessing, model development, evaluation metrics, and real-world applications. This repository provides beginner-to-advanced insights into the statistical foundations of machine learning.
artificial-intelligence data-analysis data-science machine-learning statistics
Last synced: 12 Apr 2025
https://github.com/phisanti/mcpr
MCPR enables AI agents to participate in interactive R sessions for professional analysis workflows.
data-analysis mcp mcp-server r
Last synced: 17 May 2026
https://github.com/rileynwong/spotify-analysis
Data analysis on my monthly playlists
audio-features data-analysis data-scraping lyrics machine-learning natural-language-processing nlp nlp-machine-learning sentiment-analysis spotify-analysis supervised-learning supervised-machine-learning text text-analysis
Last synced: 12 Apr 2025
https://github.com/nnthanh101/sentiment-analysis
Voice of the Customer (VoC) to enhance customer experience with serverless architecture and sentiment analysis, using Amazon Kinesis, Amazon Athena, Amazon QuickSight, Amazon Comprehend, and ChatGPT-LLMs for sentiment analysis.
aws-athena aws-comprehend aws-kinesis aws-quicksight cdk data-analysis data-visualization sentiment-analysis voice-of-the-customer
Last synced: 28 Feb 2026
https://github.com/integerman/gitstractor
A library for visualizing the commits, authors, and files of any git repository
code-analysis data-analysis data-visualization dotnet git powerbi repository-management static-code-analysis utilities visualization
Last synced: 14 Jan 2026
https://github.com/PiotrZakrzewski/merge-chance
Source code of https://merge-chance.info
analysis data data-analysis open-source
Last synced: 26 Mar 2025
https://github.com/aromanro/machinelearning
From linear regression towards neural networks...
adagrad adam-optimizer backpropagation data-analysis data-science generalized-linear-models gradient-descent linear-regression logistic-regression machine-learning machine-learning-algorithms multilayer-perceptron-network nadam nesterov-accelerated-sgd nesterov-momentum neural-network rmsprop
Last synced: 16 Mar 2025
https://github.com/unipept/unipept
🌐 Unipept frontend for metaproteomics data analysis
data-analysis data-visualization metaproteomics unipept uniprot
Last synced: 21 Jan 2026
https://github.com/arm61/uravu
A straightforward Bayesian data fitting library
bayesian-inference bayesian-statistics data-analysis fitting markov-chain-monte-carlo nested-sampling
Last synced: 21 Mar 2025
https://github.com/amkrajewski/nimcso
nim Composition Space Optimization is a high-performance tool leveraging metaprogramming to implement several methods for selecting components (data dimensions) in compositional datasets, as to optimize the data availability and density for applications such as machine learning.
data-analysis data-optimization data-science materials-informatics metaprogramming nim nim-lang
Last synced: 09 Apr 2025
https://github.com/goplus/pandas
Flexible and powerful data analysis / manipulation library for Go+, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
data-analysis data-science data-tech go golang gop goplus pandas scientific-computing
Last synced: 30 Apr 2025
https://github.com/piquette/edgr
A set of tools for dealing with SEC EDGAR corporate filings
api cli-app data-analysis data-mining edgar-database edgar-scraper finance financial-analysis sec-edgar sec-filings
Last synced: 15 May 2025
https://github.com/jasdumas/ttbbeer
An R Dataset Package for US Beer Statistics From TTB :beer:
beer-statistics data-analysis r
Last synced: 05 Mar 2025
https://github.com/asavinov/lambdo
Feature engineering and machine learning: together at last!
data-analysis data-mining data-science feature-engineering forecasting forecasting-models machine-learning time-series
Last synced: 27 Mar 2025
https://github.com/skyzh/meteor
🚆 Fine-grained analysis and visualization of Hangzhou Metro for efficient traveling in metro system. Project report, slide and presentation video included.
cmake data-analysis hangzhou metro qt sqlite visualize
Last synced: 23 Mar 2025
https://github.com/giacbrd/smartpipeline
A framework for rapid development of robust data pipelines following a simple design pattern
data-analysis data-analytics data-mining data-pipelines data-processing data-science dataops design-patterns etl machine-learning mlops pipeline pipeline-framework pipelines reproducibility task-queue workflow
Last synced: 21 Mar 2025
https://github.com/itzmeanjan/chanalyze
A simple WhatsApp Chat Analyzer ( for both Private & Group chats ), made with :heart:
chat-analysis data-analysis datascience dataviz matplotlib python3 visualization whatsapp whatsapp-chat whatsapp-chat-analyzer
Last synced: 06 Oct 2025
https://github.com/hoangsonww/global-covid19-analysis
🌍 This repository hosts an in-depth analysis of COVID-19's impact across five key countries from Jan 2020 to Dec 2021. Through advanced data analysis and visualization, we aim to provide insights into how the pandemic evolved differently across these nations, shedding light on the effectiveness of various health measures and vaccination campaigns.
covid covid-19 covid19-tracker data data-analysis data-analytics data-science data-visualization ggplot2 julia julia-language python r r-language r-markdown r-programming sas sas-programming stata vaccination
Last synced: 10 Apr 2025
https://github.com/martinthoma/edapy
Exploratory Data Analysis with Python
csv data-analysis data-analytics data-science eda exploratory-data-analysis pandas pdf python python-3 python-3-5
Last synced: 09 Apr 2025
https://github.com/cmudig/texture
Visualize your text data with structured attributes
data-analysis llm text visualization
Last synced: 07 May 2025
https://github.com/gagolews/genie
Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
cluster cluster-analysis clustering data-analysis data-mining data-science datascience genie hierarchical-clustering-algorithm machine-learning machine-learning-algorithms outliers r
Last synced: 14 Jul 2025
https://github.com/skyzh/Meteor
🚆 Fine-grained analysis and visualization of Hangzhou Metro for efficient traveling in metro system. Project report, slide and presentation video included.
cmake data-analysis hangzhou metro qt sqlite visualize
Last synced: 12 Apr 2025
https://github.com/saranshbansal/data-science-with-python
Data science with Python: This repository mostly contains DataCamp data-science courses/exercises that I have completed.
data-analysis data-science datacamp-exercises numpy python
Last synced: 07 Oct 2025
https://github.com/rfordatascience/r4dswebsite
Public repository for the R4DS community website.
blogdown data-analysis data-analytics data-science data-visualization r r4ds tidyverse
Last synced: 11 Apr 2025
https://github.com/sarthakjariwala/python_gui_apps
GLabViz - Interactive Analysis and Visualization Application for Scientific Data written in Python using Qt and pyqtrgaph
data-analysis data-visualization databrowser fluorescence fluorescence-decays-analysis fluorescence-microscopy-analysis fluorescence-microscopy-imaging image-analysis lifetime-analysis pyqtgraph python python-gui-apps qt qt-gui spectra-analysis spectral-analysis spectroscopy spectrum-analyzer uv-vis
Last synced: 26 Feb 2026
https://github.com/danvk/march-madness-data
NCAA brackets in JSON form
data-analysis ncaa-basketball sports
Last synced: 03 Mar 2025
https://github.com/rameerez/footprinted
👣 Ruby gem to track geolocated user activity in Rails
analytics data-analysis events gem geolocation ip ip-geolocation monitoring rails ruby ruby-on-rails user user-management user-tracking
Last synced: 09 Feb 2026
https://github.com/mattools/matstats
Statistical Data Analysis Toolbox for Matlab. Provides a Table class similar to R's dataframe, as well a exloratory data analysis tools.
data-analysis data-table matlab matlab-toolbox statistics
Last synced: 21 Jun 2025
https://github.com/gher-uliege/divapythontools
Interface to run Diva software tool (spatial interpolation).
data-analysis diva finite-elements interpolation-methods leaflet-map matplotlib ocean-sciences oceanography python variational-method
Last synced: 16 Aug 2025
https://github.com/learning-zone/d3js-chart-basics
D3.js Chart Basics ( v7.6.x )
d3-interview-questions d3-visualization d3js data-analysis data-visualization dataset force-layout scale svg transition
Last synced: 11 Sep 2025
https://github.com/mr-easy/badminton-stroke-classification
Classifying badminton strokes based on accelorometer and gyroscope sensor data attached to player's wrist. An end-to-end Machine Learning project, from data collection and preprocessing to final model evaluation.
badminton-stroke-classification data-analysis data-analytics data-science deep-learning machine-learning model-evaluation notebook project time-series-analysis tutorial
Last synced: 31 Aug 2025
https://github.com/PySloth/pysloth
A Python Package for Probabilistic Prediction
data-analysis data-science machine-learning python statistics
Last synced: 11 May 2025
https://github.com/ancatmara/data-science-nlp
NLP Section of the Data Science course, NRU HSE
classification clustering data-analysis data-science dimensionality-reduction embeddings fnn language-models morphological-analysis natural-language-processing nlp python regex russian-nlp syntactic-parsing topic-modelling tutorials
Last synced: 11 Jul 2025
https://github.com/nasa/ziggy
Ziggy, a portable, scalable infrastructure for science data processing pipelines, is the child of the Transiting Exoplanet Survey Satellite (TESS) pipeline and the grandchild of the Kepler Pipeline.
algorithm analysis arc data data-analysis data-reduction java k2 kepler linux macos nasa open-source pipeline science tess ziggy
Last synced: 09 May 2026
https://github.com/rubydamodar/the-ultimate-pandas-bootcamp
Welcome to the Pandas for Data Science repository! This course is designed to take you from beginner to proficient in using Pandas, the powerful data manipulation library in Python. Whether you're just starting your data science journey or looking to sharpen your skills, this repository contains all the resources
beginner-friendly csv-data data-analysis data-cleaning data-manipulation data-science data-visualization dataframe exploratory-data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python python-pandas series statistical-analysis time-series titanic-dataset
Last synced: 19 Apr 2025
https://github.com/shdev/phpflashtext
Extract Keywords from sentence or Replace keywords in sentences. @ https://github.com/vi3k6i5/flashtext
data-analysis data-extraction flashtext keyword-extraction nlp php search-in-text string-manipulation string-matching word2vec
Last synced: 12 Jan 2026
https://github.com/cengel/R-data-wrangling
Materials for my my R data workshop. https://cengel.github.io/R-data-wrangling/
data-analysis data-workshop datascience material r rstats social-sciences teaching tidyverse workshop
Last synced: 06 May 2025
https://github.com/collaborative-ai/colda
Collaborative Data Analysis for All
assisted-learning collaborative-machine-learning data-analysis deep-learning distributed-machine-learning
Last synced: 05 May 2025