Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/okld/streamlit-pandas-profiling
Pandas profiling component for Streamlit.
data-science demo pandas pandas-profiling python streamlit streamlit-component streamlit-pandas-profiling
Last synced: 31 Jul 2024
https://github.com/gdsbook/book
This book serves as an introduction to a whole new way of thinking systematically about geographic data, using geographical analysis and computation to unlock new insights hidden within data.
data-analysis-python data-science geographic-data geographical-information-system spatial-analysis spatial-data-analysis spatial-statistics statistics
Last synced: 31 Jul 2024
https://github.com/upgini/upgini
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
automated-feature-engineering automl automl-pipeline chatgpt data-enrichment data-science feature-engineering feature-extraction feature-selection features kaggle kaggle-solution large-language-models llm machine-learning open-data open-datasets public-data python-library scikit-learn
Last synced: 01 Aug 2024
https://github.com/weecology/retriever
Quickly download, clean up, and install public datasets into a database management system
data data-retrieval data-science dataset datasets hacktobefest python
Last synced: 01 Aug 2024
https://github.com/mikekeith52/scalecast
The practitioner's forecasting library
auto-ml data-science deep-learning easy-to-use forecasting keras lstm machine-learning mase msis pandas python recurrent-neural-networks scikit-learn scikit-learn-python smape time-series vecm
Last synced: 01 Aug 2024
https://github.com/Technion-Kishony-lab/quibbler
Your data - interactive!
data-analysis data-science data-visualization declarative graphics gui interactive jupyter matplotlib python widgets
Last synced: 31 Jul 2024
https://github.com/tirthajyoti/pydbgen
Random dataframe and database table generator
data-generation data-science database fake-data generator pandas-dataframe python random-generation sqlite sqlite3 synthetic-data synthetic-dataset-generation
Last synced: 31 Jul 2024
https://github.com/ml-tooling/ml-hub
🧰 Multi-user development platform for machine learning teams. Simple to setup within minutes.
data-science docker jupyter jupyterhub machine-learning python
Last synced: 01 Aug 2024
https://github.com/alibaba/feathub
FeatHub - A stream-batch unified feature store for real-time machine learning
apache-flink data data-engineering data-quality data-science feature-engineering feature-store machine-learning mlops streaming
Last synced: 01 Aug 2024
https://github.com/tommyod/Efficient-Apriori
An efficient Python implementation of the Apriori algorithm.
apriori-algorithm association-rules data-mining data-science machinelearning
Last synced: 31 Jul 2024
https://github.com/iterative/terraform-provider-iterative
☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes
aws azure cloud cloud-computing cloud-infrastructure cloud-orchestration cloud-storage cml data-science developer-tools gcp gpu hacktoberfest k8s machine-learning mlops terraform terraform-provider terraform-provider-iterative tpi
Last synced: 01 Aug 2024
https://github.com/merantix-momentum/squirrel-core
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:
ai cloud-computing collaboration computer-vision cv data-ingestion data-mesh data-science dataops datasets deep-learning distributed jax machine-learning ml natural-language-processing nlp python pytorch tensorflow
Last synced: 01 Aug 2024
https://github.com/zero-one-group/geni
A Clojure dataframe library that runs on Spark
big-data clojure clojure-library clojure-repl data-engineering data-science dataframe distributed-computing high-performance-computing machine-learning parallel-computing spark
Last synced: 31 Jul 2024
https://github.com/Mybridge/python-articles
Monthly Series - Top 10 Python Articles
data-science data-visualization django flask python python3
Last synced: 31 Jul 2024
https://github.com/Giorgi/DuckDB.NET
Bindings and ADO.NET Provider for DuckDB
ado-net data-science duckdb duckdb-database hacktoberfest hacktoberfest2023
Last synced: 31 Jul 2024
https://github.com/jupyter-naas/naas
Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)
ai binder data data-science data-transformation engine etl integration jupyter jupyterlab notebooks open-source pipeline
Last synced: 01 Aug 2024
https://github.com/datalayer/jupyter-ui
⚛️ React.js components 💯% compatible with 🪐 Jupyter. https://jupyter-ui-storybook.datalayer.tech
data data-product data-science data-visualisation datalayer ipywidgets jupyter jupyterlab lumino notebook reactjs ui
Last synced: 01 Aug 2024
https://github.com/enlite-ai/maze
Maze Applied Reinforcement Learning Framework
applied-machine-learning automation data-science decision-making deep-learning distributed documentation framework machine-learning monitoring optimization python reinforcement-learning simulation
Last synced: 01 Aug 2024
https://github.com/PYFTS/pyFTS
An open source library for Fuzzy Time Series in Python
data-science econometrics forecasting forecasting-models fts fuzzy-rules fuzzy-sets fuzzy-sets-syst fuzzy-systems fuzzy-time-series interval probabilistic-forecasting python series series-models time-series time-series-analysis time-series-forecasting
Last synced: 01 Aug 2024
https://pyfts.github.io/pyFTS/
An open source library for Fuzzy Time Series in Python
data-science econometrics forecasting forecasting-models fts fuzzy-rules fuzzy-sets fuzzy-sets-syst fuzzy-systems fuzzy-time-series interval probabilistic-forecasting python series series-models time-series time-series-analysis time-series-forecasting
Last synced: 01 Aug 2024
https://github.com/dataqa/nlp-labelling
Labelling platform for text using weak supervision.
annotation-tool data-labeling data-science learning-with-limited-labeled-data learning-with-noisy-labels natural-language-processing ner nlp nlp-machine-learning pseudo-labeling search-engine text-annotation-tool text-classification text-mining weak-supervision
Last synced: 31 Jul 2024
https://github.com/WenjieZ/TSCV
Time Series Cross-Validation -- an extension for scikit-learn
backtesting cross-validation data-science hyperparameter-optimization machine-learning model-selection time-series tuning-parameters
Last synced: 01 Aug 2024
https://github.com/asad70/reddit-sentiment-analysis
This program goes thru reddit, finds the most mentioned tickers and uses Vader SentimentIntensityAnalyzer to calculate the ticker compound value.
algotrading data-science data-science-projects data-visualization mentioned-tickers reddit reddit-sentiment-analysis sentiment sentiment-analysis stocks ticker-compound trading vader vader-sentiment-analysis vader-sentimentintensityanalyzer wallstreetbets
Last synced: 01 Aug 2024
https://github.com/PKU-DAIR/Hetu
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
artificial-intelligence autograd data-science deep-learning deep-neural-networks distributed-systems distributed-training embeddings gpu high-dimensional machine-learning python state-of-the-art
Last synced: 31 Jul 2024
https://github.com/samueldobbie/markup
A web-based document annotation tool, powered by GPT-4 :rocket:
active-learning annotation-tool data-labeling data-science gpt-4 machine-learning named-entity-recognition natural-language-processing ner nlp sequence-to-sequence text-annotation text-annotation-tool
Last synced: 31 Jul 2024
https://github.com/graphia-app/graphia
A visualisation tool for the creation and analysis of graphs
analysis data data-analysis data-science data-visualization graphs interpretation networks visualisation visualization
Last synced: 01 Aug 2024
https://github.com/modal-labs/modal-client
Python client library for Modal
cloud data-science distributed machine-learning modal python serverless
Last synced: 31 Jul 2024
https://github.com/bgruening/docker-galaxy-stable
:whale::bar_chart::books: Docker Images tracking the stable Galaxy releases.
data-science docker-image galaxy galaxyproject science
Last synced: 01 Aug 2024
https://github.com/Minyus/pipelinex
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
data-engineering data-science deep-learning experimentation machine-learning pipeline
Last synced: 31 Jul 2024
https://github.com/koalaverse/homlr
Supplementary material for Hands-On Machine Learning with R, an applied book covering the fundamentals of machine learning with R.
data-science machine-learning r supervised-learning unsupervised-learning
Last synced: 31 Jul 2024
https://github.com/anki-code/xonsh-cheatsheet
Cheat sheet for xonsh shell with copy-pastable examples. The best doc for the new users.
awesome awesome-cheatsheet cheat-sheet cheat-sheets cheatsheet cheatsheets console data-science devops devops-scripts hacking shell terminal xonsh xontrib
Last synced: 31 Jul 2024
https://github.com/jgoerner/data-science-stack-cookiecutter
🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
airflow apistar cookiecutter data-science docker docker-image jupyter minio postgres python superset
Last synced: 31 Jul 2024
https://github.com/A3Data/hermione
ML made simple
data-science hermione machine-learning python
Last synced: 31 Jul 2024
https://github.com/benedekrozemberczki/DANMF
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
autoencoder cikm clustering community-detection coordinate-descent danmf data-science deep-learning deepwalk dimensionality-reduction embedding gemsec machine-learning mnmf nmf node-embedding node2vec sklearn unsupervised-learning word2vec
Last synced: 31 Jul 2024
https://github.com/nteract/bookstore
📚 Notebook storage and publishing workflows for the masses
data-science notebook nteract scheduling storage versioned-buckets
Last synced: 01 Aug 2024
https://github.com/storieswithsiva/Data-Science-Resources
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
artificial-intelligence artificial-neural-networks data data-analysis data-analytics data-mining data-science data-science-resource data-science-resources data-scientist data-scientists data-visualization data-world datascience dataset learning learning-kit machine-learning python repository
Last synced: 01 Aug 2024
https://github.com/agilescientific/striplog
Lithology and stratigraphic logs for wells or outcrop.
data-mining data-science geology petrophysics sedimentology swung-stack
Last synced: 30 Jul 2024
https://lge-arc-advancedai.github.io/auptimizer/
An automatic ML model optimization tool.
automated-machine-learning automl data-engineering data-science deep-learning hpo hyperparameter-optimization hyperparameter-tuning machine-learning neural-networks
Last synced: 01 Aug 2024
https://github.com/h2oai/nitro
Create apps 10x quicker, without Javascript/HTML/CSS.
app apps data-analysis data-science developer-tools devtools graphics h2o-nitro low-code python ui ui-components user-interface web-application webapp widget-library widgets
Last synced: 01 Aug 2024
https://github.com/yogeshhk/TeachingDataScience
Course notes for Data Science related topics, prepared in LaTeX
course-materials data-science deep-learning jupyter-notebooks latex machine-learning natural-language-processing open-source python
Last synced: 31 Jul 2024
https://github.com/ivnvxd/pyquest
Python everything Cheatsheet and a Journey to the land of Python programming
algorithms architecture cheatsheet concurrency data-science data-structures data-types database fundamentals jupyter-notebook learn oop python standard-library tutorial web-development
Last synced: 01 Aug 2024
https://github.com/analysiscenter/batchflow
BatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
data-science machine-learning pipeline pipeline-framework python python3 workflow workflow-engine
Last synced: 01 Aug 2024
https://github.com/alan-turing-institute/skpro
A unified framework for tabular probabilistic regression and probability distributions in python
ai data-science framework machine-learning prediction probabilistic-models probability-distributions python regression sklearn
Last synced: 29 Jul 2024
https://github.com/Toloka/crowd-kit
Control the quality of your labeled data with the Python tools you already know.
aggregations annotation crowd crowdsourcing data-mining data-science labeling python quality-control toloka truth-inference
Last synced: 31 Jul 2024
https://github.com/ActivitySim/activitysim
An Open Platform for Activity-Based Travel Modeling
activitysim bsd-3-clause data-science microsimulation python travel-modeling
Last synced: 31 Jul 2024
https://github.com/nshiab/simple-data-analysis
Easy-to-use and high-performance JavaScript library for data analysis.
data data-analysis data-science duckdb javascript nodejs typescript
Last synced: 31 Jul 2024
https://github.com/robmarkcole/HASS-data-detective
Explore and analyse your Home Assistant data
data data-science home home-assistant home-automation
Last synced: 01 Aug 2024
https://github.com/kdr-aus/ogma
Scripting language focused on processing tabular data.
data-science language rust scripting-language table-data
Last synced: 31 Jul 2024
https://github.com/kevin-hanselman/dud
A lightweight CLI tool for versioning data alongside source code and building data pipelines.
data-engineering data-pipelines data-science dataset dvcs machine-learning mlops
Last synced: 31 Jul 2024
https://github.com/Automunge/AutoMunge
Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbations.
Last synced: 31 Jul 2024
https://learnbyexample.github.io/py_resources/
Collection of Python learning resources
curated-list data-science learning machine-learning python resources scientific-computing
Last synced: 01 Aug 2024
https://github.com/jgoerner/beyond-jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
airflow apache apistar data-science docker docker-compose jupyter jupyter-notebook minio postgres superset
Last synced: 31 Jul 2024
https://github.com/davendw49/k2
Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024
ai4science data-science geoai geoscience kg large-language-models llm
Last synced: 01 Aug 2024
https://github.com/jazzdotdev/jazz
The Scripting Engine that Combines Speed, Safety, and Simplicity
actix android chromeos crypto cryptography data-science database development-environment embeddable jazz jinja2 linux lua markdown rust scraping scripting web witness
Last synced: 01 Aug 2024
https://github.com/DongjunLee/quantified-self
Self-knowledge through numbers
chatbot data-science fitbit github kino machine-learning personal-assistant quantified-self rescuetime self-tracking slack-bot todoist toggl trello
Last synced: 01 Aug 2024
https://github.com/jupyterhub/repo2docker-action
A GitHub action to build data science environment images with repo2docker and push them to registries.
actions binder data-science datascience docker jupyter jupyter-notebook repo2docker repo2docker-action
Last synced: 01 Aug 2024
https://github.com/dogoncouch/logdissect
CLI utility and Python module for analyzing log files and other data.
cli command-line data-analysis data-science forensic-analysis forensics json library log-analysis log-parser module parser parsing parsing-library python-library python-module python-modules security syslog
Last synced: 31 Jul 2024
https://github.com/hamelsmu/Seq2Seq_Tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
data-science deep-learning deeplearning keras keras-tutorials machine-learning medium-article nlp-machine-learning rnn-encoder-decoder seq2seq-tutorial sequence-to-sequence
Last synced: 31 Jul 2024
https://github.com/DataHaskell/dh-core
Functional data science
data-analysis data-mining data-science dataframes datahaskell datasets machine-learning numerical-methods
Last synced: 31 Jul 2024
https://github.com/minerva-ml/steppy
Lightweight, Python library for fast and reproducible experimentation :microscope:
data-science deep-learning image-processing machine-learning minimal-interface nlp open-source pipeline python python-library python3 reproducibility reproducible-research steppy steppy-library steppy-toolkit steps
Last synced: 31 Jul 2024
https://github.com/genular/pandora
PANDORA - Predictive Analytics aNd Data Oriented Research Applications :computer:
bioinformatics biomarkers clinical-data clustering data-integration data-mining data-science data-visualization drug-discovery genomic-data-analysis machine-learning microbiome pandora predictive-analytics systems-biology transcriptomics tsne umap unsupervised-machine-learning
Last synced: 01 Aug 2024
https://github.com/gtkcyber/griffon-vm
Griffon Data Science Virtual Machine
apache-drill apache-spark big-data data-science database elasticsearch hadoop jupyter-notebook mysql node-js python r ruby scala virtual-machine
Last synced: 30 Jul 2024
https://github.com/morganjwilliams/pyrolite
A set of tools for getting the most from your geochemical data.
chemistry data-science geochemical-data geochemistry geoscience pyrolite ternary-diagrams
Last synced: 30 Jul 2024
https://github.com/ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
data-science extract-text parser pdf pdf-document pdf-processing pdfminer pdfs pdfs-textextract pypdf2 python text-analytics
Last synced: 01 Aug 2024
https://github.com/RamiKrispin/Introduction-to-Docker
(WIP) Getting started with Docker - An introduction to Docker with data science and engineering applications
data-engineering data-science docker dockerfile
Last synced: 30 Jul 2024
https://github.com/ModelChimp/modelchimp
Experiment tracking for machine and deep learning projects
ai artificial-intelligence data-science deep-learning experiment machine-learning ml model-management platform tool
Last synced: 31 Jul 2024
https://github.com/EnvironmentOntology/envo
A community-driven ontology for the representation of environments
data-management data-science earth-science ecoinformatics ecology environment esip obofoundry ontology planetary-science semantics sustainable-development-goals
Last synced: 01 Aug 2024
https://github.com/machine-learning-apps/ml-template-azure
Template for getting started with automated ML Ops on Azure Machine Learning
aml azure azure-machine-learning data-science machine-learning machine-learning-lifecycle mlops
Last synced: 01 Aug 2024
https://github.com/jacobgil/confidenceinterval
The long missing library for python confidence intervals
data-science machine-learning metrics statistics
Last synced: 01 Aug 2024
https://github.com/romanmichaelpaolucci/AI_Stock_Trading
Design pattern for critical stages in the development process of an AI Stock Trading Bot
artificial-intelligence data-science machine-learning neural-network python trading trading-algorithms trading-bot trading-strategies
Last synced: 01 Aug 2024
https://github.com/mszell/introdatasci
Course materials for: Introduction to Data Science and Programming
course-materials crash-course data-science network-analysis pandas-python programming programming-courses python teaching-materials
Last synced: 01 Aug 2024
https://github.com/napjon/krisk
Statistical Interactive Visualization with pandas+Jupyter integration on top of Echarts.
dashboard data-science data-visualization echarts interactive-charts jupyter-notebook python
Last synced: 01 Aug 2024
https://github.com/alexandervnikitin/tsgm
Generation and evaluation of synthetic time series datasets (also, augmentations, visualizations, a collection of popular datasets)
augmentations data-augmentation data-science datasets deep-learning generative-model keras machine-learning python synthetic-data synthetic-time-series tensorflow2 time-series vae
Last synced: 01 Aug 2024
https://github.com/streamnative/pulsar-spark
Spark Connector to read and write with Pulsar
apache-pulsar apache-spark batch-processing data-processing data-science flink spark spark-sql stream-processing structured-streaming
Last synced: 01 Aug 2024
https://github.com/lawmurray/Birch
A probabilistic programming language that combines automatic differentiation, automatic marginalization, and automatic conditioning within Monte Carlo methods.
autodiff bayesian bayesian-inference bayesian-methods bayesian-statistics data-science machine-learning machine-learning-algorithms machine-learning-projects monte-carlo-methods monte-carlo-sampling probabilistic-programming-languages statistics
Last synced: 31 Jul 2024
https://github.com/nicohlr/ipychart
The power of Chart.js with Python
charting-library chartjs charts data data-analysis data-science data-visualization ipywidgets javascript-es6 jupyter jupyter-notebook notebook python
Last synced: 01 Aug 2024
https://github.com/benthecoder/ml-blogs-that-are-worth-reading
Blogs on Machine Learning and Deep learning
ai artificial-intelligence data-science deep-learning machine-learning ml
Last synced: 01 Aug 2024
https://github.com/dssg/MLforPublicPolicy
Class resources for CAPP 30254 (Machine Learning for Public Policy)
data-science machine-learning public-policy
Last synced: 31 Jul 2024
https://github.com/georgian-io/pyoats
Quick and Easy Time Series Outlier Detection
anomaly anomaly-detection data-science deep-learning machine-learning time-series timeseries
Last synced: 31 Jul 2024
https://github.com/target/data-validator
A tool to validate data, built around Apache Spark.
data-science data-validation hacktoberfest
Last synced: 01 Aug 2024
https://github.com/xiyanghu/OSDT
Optimal Sparse Decision Trees
accelerate acceleration-model algorithm algorithm-optimization data-mining data-science interpretable-ml machine-learning ml-system mlsys neurips python python3
Last synced: 31 Jul 2024
https://github.com/PetoLau/TSrepr
TSrepr: R package for time series representations
data-analysis data-mining data-mining-algorithms data-science r r-package representation time-series time-series-analysis time-series-classification time-series-clustering time-series-data-mining time-series-representations
Last synced: 01 Aug 2024
https://github.com/IlyaGusev/tgcontest
Telegram Data Clustering contest solution by Mindful Squirrel
classification clustering cpp data-science document-similarity fasttext machine-learning nlp
Last synced: 01 Aug 2024
https://github.com/scottshambaugh/monaco
Quantify uncertainty and sensitivities in your computer models with an industry-grade Monte Carlo library.
data-science monaco monte-carlo python scientific-computing sensitivity-analysis simulation statistics uncertainty-analysis uncertainty-quantification
Last synced: 01 Aug 2024
https://github.com/ideos/gloe
A general-purpose library designed to guide developers in expressing their code as a flow.
clean-code data-science flow functional-programming machine-learning python typing
Last synced: 31 Jul 2024
https://github.com/darribas/gds_course
Geographic Data Science, the course
course data-science educational gds-course geographic-data-science gis
Last synced: 31 Jul 2024
https://github.com/uc-r/uc-r.github.io
Main repository for R programming courses @ University of Cincinnati, courses and tutorials that focus on data wrangling, exploration, visualization, and analysis with R.
classroom data-science data-wrangling machine-learning r tutorial tutorial-code visualization
Last synced: 31 Jul 2024
https://github.com/Dumbris/trunklucator
Python module for data scientists for quick creating annotation projects.
active-learning annotation annotation-tool data-science machine-learning nlp
Last synced: 01 Aug 2024
https://github.com/nuclio/nuclio-jupyter
Nuclio Function Automation for Python and Jupyter
data-science jupyter kubernetes nuclio python
Last synced: 01 Aug 2024
https://github.com/beneath-hq/beneath
Beneath is a serverless real-time data platform ⚡️
analytics beneath data-engineering data-pipelines data-science data-warehouse dataops developer-tools etl go kubernetes mlops python sql streaming
Last synced: 01 Aug 2024
https://github.com/andrea-ballatore/open-geo-data-education
Open Geospatial Datasets for GIS Education: This is a repository of open geospatial datasets to be used in an educational context. I created these files over years of teaching Geographic Data Science and GIS. All original datasets are freely available online with open data licenses (see the dataset attribution for details). All the datasets in this repository have been selected, cleaned, harmonised, and repackaged for GIS exercises in a higher-education context. This is a pretty time-intensive process that other educators can hopefully avoid by using these versions.
data-science geojson geospatial-data geospatial-datasets gis gis-data gis-education tsv
Last synced: 31 Jul 2024
https://github.com/Invictify/Jupter-Notebook-REST-API
Run your jupyter notebooks as a REST API endpoint. This isn't a jupyter server but rather just a way to run your notebooks as a REST API Endpoint.
data-science data-science-pipelines docker dockerfile fastapi jupyter python rest-api
Last synced: 31 Jul 2024
https://github.com/manumerous/vpselector
Visual Pandas Selector: Visualize and interactively select time-series data
data-science data-visualization pandas python selector
Last synced: 31 Jul 2024
https://github.com/nbarrowman/vtree
An R package for calculating and drawing variable trees
data-science data-visualization exploratory-data-analysis r statistics
Last synced: 30 Jul 2024
https://github.com/piquette/qtrn
A cli tool to streamline financial markets data analysis :wrench:
cli data data-science finance go golang options quotes scraper stock stock-analysis stock-market
Last synced: 01 Aug 2024
https://github.com/data-centric-ai/dcbench
A benchmark of data-centric tasks from across the machine learning lifecycle.
Last synced: 31 Jul 2024
https://github.com/shenxiangzhuang/PythonDataAnalysis
The data and code that used in my book.
data-science python3 webcrawler
Last synced: 31 Jul 2024
https://github.com/hsbc/tslumen
A library for Time Series EDA (exploratory data analysis)
analysis data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations pandas profiling python time-series time-series-analysis time-series-eda time-series-profiling timeseries timeseries-analysis timeseries-eda
Last synced: 01 Aug 2024
https://github.com/gitonthescene/csv-reconcile
A reconciliation service for OpenRefine serving data from a given CSV file.
Last synced: 01 Aug 2024