Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/jmwoloso/pychattr

Python Channel Attribution (pychattr) - A Python implementation of the excellent R ChannelAttribution library

channel-attribution data-analysis data-science machine-learning python python-channel-attribution rpy2 wrapper

Last synced: 13 May 2024

https://github.com/openbiox/awosome-bioinformatics

A curated list of resources for learning bioinformatics.

bioinformatics data-analysis next-generation-sequencing

Last synced: 12 May 2024

https://github.com/404notf0und/FXY

Security-Scenes-Feature-Engineering-Toolkit, Continuous Integration.一款安全数据特征化工具

data-analysis data-mining feature-engineering machine-learning security security-scenes

Last synced: 12 May 2024

https://github.com/ayush1997/visualize_ML

Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and continuous datasets.

data-analysis machine-learning matplotlib python statisics visualization

Last synced: 12 May 2024

https://github.com/bts-cm/airdrop_tool

Fetch & analyse blockchain tickets. View leaderboards and user tickets. Calculate and perform provably fair airdrops.

airdrop bitshares bitsharesjs blockchain crypto data-analysis data-science electron etl javascript nodejs react ticket tusc

Last synced: 11 May 2024

https://github.com/anthonydb/practical-sql

Code and Data for the First Edition of "Practical SQL" by Anthony DeBarros, published by No Starch Press (2018).

data-analysis postgresql sql

Last synced: 11 May 2024

https://github.com/gchq/CyberChef

The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis

compression data-analysis data-manipulation encoding encryption hashing parsing

Last synced: 10 May 2024

https://github.com/1994nikunj/nlp-toolkit-desktop-app

The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.

data-analysis n-grams network-visualization nlp python text-cleaning topic-modeling wordcloud-generator

Last synced: 10 May 2024

https://github.com/airbytehq/airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

bigquery change-data-capture data data-analysis data-collection data-engineering data-integration data-pipeline elt etl java mssql mysql pipeline postgresql python redshift s3 self-hosted snowflake

Last synced: 09 May 2024

https://github.com/SciTools/iris

A powerful, format-agnostic, and community-driven Python package for analysing and visualising Earth science data

data-analysis earth-science grib iris meteorology netcdf oceanography python spaceweather visualisation

Last synced: 09 May 2024

https://github.com/NCAS-CMS/cf-python

A CF-compliant Earth Science data analysis library

cf cfdm cfunits data-analysis earth-science metadata netcdf pp python um

Last synced: 09 May 2024

https://github.com/ZijieZhaoMMHW/m_mhw1.0

A MATLAB toolbox to detect and analyze marine heatwaves (MHWs).

climate-science data-analysis heatwaves marine-heatwaves matlab

Last synced: 09 May 2024

https://github.com/VUKOZ-OEL/3d-forest

Visualization, processing and analysis of Lidar point clouds, mainly focused on forest environment. New version of 3D Forest. Process files with terabytes of data. Edit new point attributes. Simple addition of new features by plugins.

3d classification cpp cross-platform data-analysis desktop-application editor forest gui interactive-visualization las laser-scanning lidar opengl plugins point-cloud qt scientific-computing segmentation tree

Last synced: 08 May 2024

https://github.com/jepegit/cellpy

extract and tweak data from electrochemical tests of cells

battery chemistry data-analysis electrochemistry opensource physics

Last synced: 07 May 2024

https://github.com/rshkarin/quanfima

Quanfima (Quantitative Analysis of Fibrous Materials)

data-analysis material-science morphological-analysis volumetric-data

Last synced: 07 May 2024

https://github.com/MouseLand/suite2p

cell detection in calcium imaging recordings

data-analysis imaging neuroscience

Last synced: 07 May 2024

https://github.com/scikit-learn/scikit-learn

scikit-learn: machine learning in Python

data-analysis data-science machine-learning python statistics

Last synced: 05 May 2024

https://github.com/pandas-dev/pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

alignment data-analysis data-science flexible pandas python

Last synced: 05 May 2024

https://github.com/qinwf/awesome-R

A curated list of awesome R packages, frameworks and software.

awesome awesome-list data-analysis data-science list r rstats

Last synced: 05 May 2024

https://github.com/gonum/gonum

Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more

data-analysis go golang graph matrix scientific-computing statistics

Last synced: 05 May 2024

https://github.com/hbuschme/TextGridTools

Read, write, and manipulate Praat TextGrid files with Python

annotation data-analysis elan linguistics praat python textgrid

Last synced: 02 May 2024

https://github.com/dfm/corner.py

Make some beautiful corner plots

data-analysis data-visualization plotting python

Last synced: 02 May 2024

https://github.com/PatMartin/Dex

Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.

d3 d3js data-analysis data-mining data-science data-visualization datavis datavisualization dataviz groovy java javafx visualization

Last synced: 02 May 2024

https://github.com/thospfuller/rcoboldi

R COBOL DI (Data Integration) Package : Import COBOL CopyBook data files directly into R as properly structured data frames.

analytics cobol cobrix copybook data-analysis data-frames data-mining data-science dataanalysis dataframe docker ebcdic java jrecord microfocus r rjava rocker rproject statistics

Last synced: 02 May 2024

https://github.com/paezha/spatial-analysis-r

Open Educational Resource for teaching spatial data analysis and statistics with R

data-analysis open-educational-resource r r-package r-spatial rstats spatial-data-analysis spatial-statistics statistics

Last synced: 01 May 2024

https://github.com/gagolews/genie

Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)

cluster cluster-analysis clustering data-analysis data-mining data-science datascience genie hierarchical-clustering-algorithm machine-learning machine-learning-algorithms outliers r

Last synced: 01 May 2024

https://github.com/SciRuby/daru-view

daru-view is for easy and interactive plotting in web application & IRuby notebook. daru-view is a plugin gem to the existing daru gem.

charts daru daru-view data-analysis data-visualization graphs iruby-notebook nanoc plot-library rails ruby sinatra

Last synced: 01 May 2024

https://github.com/yoshoku/rumale

Rumale is a machine learning library in Ruby

artificial-intelligence data-analysis data-science machine-learning ml ruby rubyml

Last synced: 01 May 2024

https://github.com/nysportsfan/Gun-Violence-in-the-US

This repository contains all the relevant files for my first capstone project as part of the Springboard Data Science Career Track.

data-analysis data-science data-visualization machine-learning python3 statistics

Last synced: 29 Apr 2024

https://github.com/gradio-app/gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

data-analysis data-science data-visualization deep-learning deploy gradio gradio-interface hacktoberfest interface machine-learning models python python-notebook ui ui-components

Last synced: 28 Apr 2024

https://github.com/DAGWorks-Inc/hamilton

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

dag data-analysis data-engineering data-science dataframe etl etl-framework etl-pipeline feature-engineering featurization hacktoberfest lineage llmops machine-learning mlops numpy orchestration pandas python software-engineering

Last synced: 28 Apr 2024

https://github.com/h2oai/datatable

A Python package for manipulating 2-dimensional tabular data structures

data-analysis data-structure ftrl performance python

Last synced: 28 Apr 2024

https://github.com/ResidentMario/missingno

Missing data visualization module for Python.

data-analysis data-visualization missing-data pandas python

Last synced: 28 Apr 2024

https://github.com/scikit-learn-contrib/imbalanced-learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

data-analysis data-science machine-learning python statistics

Last synced: 28 Apr 2024

https://github.com/aksnzhy/xlearn

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

data-analysis data-science factorization-machines ffm fm machine-learning statistics

Last synced: 28 Apr 2024

https://github.com/akfamily/akshare

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

academic akshare asset-pricing bond currency data data-analysis data-science datasets economic-data economics finance finance-api financial-data fundamental futures option quant stock

Last synced: 28 Apr 2024

https://github.com/has2k1/plotnine

A Grammar of Graphics for Python

data-analysis grammar graphics plotting python

Last synced: 28 Apr 2024

https://github.com/alejandrodumas/kodiak

Enhance your feature engineering workflow with Kodiak

data-analysis pandas

Last synced: 27 Apr 2024

https://github.com/AurelienAubry/Spotlight

Spotlight is a Spotify dashboard that allows user to visualize his listening habits.

backend bootstrap chartjs data data-analysis data-science data-visualization flask frontend javascript js pandas python python3 react react-bootstrap spotify

Last synced: 27 Apr 2024

https://github.com/matthewgrosman/messenger-analytics

Project that ingests Facebook Messenger conversations and generates analytics.

analytics data-analysis excel facebook facebook-messenger java mongodb

Last synced: 27 Apr 2024

https://github.com/DistrictDataLabs/cultivar

Multidimensional data explorer and visualization tool.

data-analysis data-exploration data-management visualization

Last synced: 27 Apr 2024

https://github.com/charliezcr/Kpop-Data-Analysis

Data analysis about K-pop industry, artists, and companies. Visualized business performances of public K-pop companies and analyzed artist management and international marketing strategies

data-analysis data-visualization kpop pandas python

Last synced: 26 Apr 2024

https://github.com/d4t4x/data-selfie

Data Selfie - a browser extension to track yourself on Facebook and analyze your data.

chrome-extension data-analysis data-dashboard firefox-addon privacy

Last synced: 26 Apr 2024

https://github.com/mito-ds/mito

The mitosheet package, trymito.io, and other public Mito code.

data data-analysis data-science data-visualization jupyter pandas python streamlit-component

Last synced: 25 Apr 2024

https://github.com/moosetechnology/Moose

MOOSE - Platform for software and data analysis.

data-analysis moose pharo smalltalk software-analysis

Last synced: 23 Apr 2024

https://github.com/visualpython/visualpython

GUI-based Python code generator for data science, extension to Jupyter Lab, Jupyter Notebook and Google Colab.

bigdata chrome-extension code-generator data-analysis jupyter-lab-extension jupyter-notebook-extension jupyterlab-extension pandas python visual-coding

Last synced: 22 Apr 2024

https://github.com/deanmarchiori/analysis-flow

Data Analysis Workflows & Reproducibility Learning Resources

data-analysis reproducibility reproducible-data-science reproducible-science tooling workflow

Last synced: 22 Apr 2024

https://github.com/nshiab/simple-data-analysis

Easy-to-use and high-performance JavaScript library for data analysis.

data data-analysis data-science duckdb javascript nodejs typescript

Last synced: 20 Apr 2024

https://github.com/javascriptdata/danfojs

Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.

danfojs data-analysis data-analytics data-manipulation data-science dataframe javascript pandas plotting-charts stream-data stream-processing table tensorflow tensors

Last synced: 20 Apr 2024

https://github.com/njanakiev/folderstats

Python module that collects detailed statistics from a folder structure

data-analysis filesystem pandas python statistics

Last synced: 19 Apr 2024

https://github.com/root-project/root

The official repository for ROOT: analyzing, storing and visualizing big data, scientifically

c-plus-plus cling data-analysis geometry graphics hacktoberfest interpreter machine-learning mathematics parallel physics python root root-cern statistics visualization

Last synced: 18 Apr 2024

https://github.com/GoogleCloudPlatform/data-science-on-gcp

Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017

cloud-computing data-analysis data-engineering data-pipeline data-processing data-science data-visualization machine-learning

Last synced: 17 Apr 2024

https://github.com/SuperCowPowers/zat

Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark

bro data-analysis kafka networking pandas python scikit-learn security spark zeek zeek-analysis

Last synced: 17 Apr 2024