Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tejzpr/ordered-concurrently
Ordered-concurrently a library for concurrent processing with ordered output in Go. Process work concurrently and returns output in a channel in the order of input. It is useful in concurrently processing items in a queue, and get output in the order provided by the queue.
concurrent concurrent-data-structure data-pipeline data-science golang golang-library ordered parallel parallel-computing
Last synced: 30 Jul 2024
https://github.com/sametcopur/ruleopt
Optimization-Based Rule Learning for Classification
data-science explainable-ai linear-programming machine-learning machine-learning-library python
Last synced: 01 Aug 2024
https://github.com/dMLTquant/openbb_sdk_exporation
Explore OpenBB SDK without having to install anything on your local machine. You just need a GitHub and a GitPod account.
algorithmic-trading data-science financial-data jupyter notebook openbb python
Last synced: 01 Aug 2024
https://github.com/wri-dssg-omdena/policy-data-analyzer
Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
active-learning bert data-science document-classification environmental huggingface incentives landscape-restoration lda machine-learning nlp policy sbert scraping scrapy sentence-transformers spyder text-classification topic transformers
Last synced: 31 Jul 2024
https://github.com/alagoa/youtube-or-pornhub
Service identification on ciphered traffic.
capture data-science machinelearning ml pcap python3 spotify traffic tshark youtube
Last synced: 01 Aug 2024
https://github.com/datamininggroup/pfa
Portable Format for Analytics
data-science deployment pfa-standard predictive-analytics
Last synced: 30 Jul 2024
https://github.com/0x0be/scrapeadvisor
A user-friendly python-based GUI which provides sentiment analysis of users' reviews toward a specific TripAdvisor facility
data-mining data-science python3 r scraping sentiment-analysis sentiment-classification text-mining tripadvisor tripadvisor-scraper web-scraping
Last synced: 01 Aug 2024
https://github.com/rjbergerud/open-source-for-common-good
A list I'm keeping of active open source projects that serve a social or environmental goal.
citizen-science civic-tech community data-science humanity non-profit social social-impact sustainability
Last synced: 01 Aug 2024
https://github.com/denadai2/google_street_view_deep_neural
Deep Neural Network model to predict security perception from Google Street View images. Model based on AlexNet CNNs
computational-social-science computer-vision data-science deep-learning urban-planning urban-science
Last synced: 31 Jul 2024
https://github.com/humburg/reportmd
Create multi-page HTML reports in R
data-science r rmarkdown rstudio
Last synced: 31 Jul 2024
https://github.com/minerva-ml/steppy-toolkit
Curated set of transformers that make your work with steppy faster and more effective :telescope:
data-science deep-learning keras keras-models machine-learning nlp open-source pipeline pipeline-framework python python3 pytorch pytorch-models reproducibility reproducible-research steppy steppy-toolkit steps tensorflow tensorflow-models
Last synced: 31 Jul 2024
https://github.com/asavinov/lambdo
Feature engineering and machine learning: together at last!
data-analysis data-mining data-science feature-engineering forecasting forecasting-models machine-learning time-series
Last synced: 31 Jul 2024
https://github.com/kennethleungty/Anomaly-Detection-Pipeline-Kedro
Anomaly Detection Pipeline with Isolation Forest model and Kedro framework
anomaly anomaly-detection credit-card credit-card-fraud data-science data-science-pipeline financial financial-data fraud fraud-detection kedro machine-learning machine-learning-pipeline ml mlops pipelines quantumblack
Last synced: 31 Jul 2024
https://github.com/gagolews/genie
Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
cluster cluster-analysis clustering data-analysis data-mining data-science datascience genie hierarchical-clustering-algorithm machine-learning machine-learning-algorithms outliers r
Last synced: 31 Jul 2024
https://github.com/OGFris/GoStats
GoStats is a go library for math statistics mostly used in ML domains, it covers most of the statistical measures functions.
data-science go golang gostats machine-learning math mathematics mit-license statistical-measures statistics stats
Last synced: 30 Jul 2024
https://github.com/aengl/cocoon-demo
Cocoon – a flow-based workflow automation, data mining and visual analytics tool.
brushing cocoon data-mining data-science data-visualization dataflow flow-based-modeling flow-based-programming interactive-visualisations node-js reactjs visual-analytics workflow-automation
Last synced: 01 Aug 2024
https://github.com/Grasia/WikiChron
Data visualization tool for wikis evolution
analyzer data-analysis data-science data-visualization datascience dump evolution graphs history history-dump mediawiki-wikis plot research-tool time-series visualization web-service wiki wikia wikimedia wikis
Last synced: 01 Aug 2024
https://github.com/somdeep/Statball
Statball - Football soccer stats analyser from top 5 european leagues with data obtained by web scraping from Fbref and Statsbomb
csharp data-science data-scraping data-viz dotnet dotnet-core fbref football football-analytics football-data scouting-data scraping soccer soccer-analytics soccer-data statsbomb tableau visualizations
Last synced: 01 Aug 2024
https://github.com/codelibs/fione
Fione is Enterprise AI Platform
ai automl data-science machine-learning
Last synced: 31 Jul 2024
https://github.com/kennethleungty/Text-to-Audio-with-Bark
Exploring Bark, the Open-Source Text-to-Audio Generative Model
ai artificial-intelligence bark data-science deep-learning gen-ai generative-ai machine-learning prompt-engineering speech text-prompt text-to-audio text-to-music text-to-sound text-to-speech
Last synced: 31 Jul 2024
https://github.com/qpwedev/blockchain-network-visualizer
Blockchain Network Visualizer for TON.
blockchain data-science network ton toncoin
Last synced: 30 Jul 2024
https://github.com/benedekrozemberczki/NestedSubtreeHash
A distributed implementation of "Nested Subtree Hash Kernels for Large-Scale Graph Classification Over Streams" (ICDM 2012).
data-mining data-science deepwalk distributed-machine-learning feature-extraction gensim graph-classification graph-kernel graph-mining hashing large-scale-learning machine-learning multi-scale node2vec representation-learning streaming-data streaming-processing word2vec
Last synced: 31 Jul 2024
https://github.com/rueedlinger/ml-resources
A curated list of statistics, data visualization and machine learning resources which in find useful, have read or want to read.
curated-list data-science data-visualization deep-learning machine-learning statistics
Last synced: 01 Aug 2024
https://cufctl.github.io/mlbd/
Repository for the machine learning / big data creative inquiry
data-science high-performance-computing machine-learning python tensorflow
Last synced: 31 Jul 2024
https://github.com/jurjoroa/hcc-website
Repository of the Hertie Coding Club website. Built with Quarto
css data-science html html-css-javascript html5 javascript latex lua netlify quarto quarto-project quarto-pub quarto-template quartopub webr website
Last synced: 31 Jul 2024
https://github.com/psyplot/psyplot-gui
Graphical User Interface for the psyplot package
data-science gui interactive ipython psyplot qtconsole sphinx
Last synced: 31 Jul 2024
https://github.com/stappit/blog
I often post solutions to textbook exercises, including: Bayesian Data Analysis (BDA) by Gelman et al; Causal Inference in Statistics Primer (CISP) by Pearl et al; Purely Functional Data Structures (PFDS) by Okasaki.
bayesian-data-analysis blog data-analysis data-science gelman hakyll haskell pearl purely-functional-data-structures solutions stan static-site statistical-inference statistics
Last synced: 30 Jul 2024
https://github.com/codelibs/docker-fione
Docker for Fione
ai automl data-science machine-learning
Last synced: 31 Jul 2024
https://github.com/sharatsawhney/character_segmentation
A detailed Research project on Character-Segmentation using Neural Networks!
data-science deep-learning deep-neural-networks keras keras-layer keras-models keras-neural-networks matplotlib neural-network numpy opencv-python
Last synced: 01 Aug 2024
https://github.com/WaylonWalker/kedro-auto-catalog
Kedro catalog create with default configuration
data data-science kedro kedro-catalog kedro-hook kedro-plugin
Last synced: 31 Jul 2024
https://github.com/fmv1992/data_utilities
Data utilities library focused on machine learning and data analysis.
Last synced: 30 Jul 2024
https://github.com/ZackAkil/friendlier-data-labelling
Code resources for generating a google form for labelling data.
data-science google google-apps-script google-forms google-sheets machine-learning
Last synced: 01 Aug 2024
https://github.com/techbastic/roadmaps
A curated list of resources to start your developer journey.
blockchain community data-science devops full-stack hacktoberfest open-source roadmaps
Last synced: 31 Jul 2024
https://github.com/UniversalDataTool/courseware
Create instructions for labeling datasets using the Universal Data Tool
annotators courseware data-science dataset hacktoberfest label
Last synced: 01 Aug 2024
https://github.com/Rahulkumarr2080/Comcast-Telecom-Consumer-Complaints
Comcast is an American global telecommunication company. The firm has been providing terrible customer service. They continue to fall short despite repeated promises to improve. Only last month (October 2016) the authority fined them $2.3 million, after receiving over 1000 consumer complaints. The existing database will serve as a repository of public customer complaints file.
comcast-telcom-complaints data-science data-scientists data-visualization datascience datascience-with-python jupyter-notebook matplotlib numpy pandas python python-for-data-science rahul-kumar rahul-kumar-thakur
Last synced: 31 Jul 2024
https://github.com/Suji04/Chat_Entropy_Analysis
A simple python script to find and compare WhatsApp chat entropy
data-science entropy python3 shannon-entropy whatsapp
Last synced: 29 Jul 2024
https://github.com/yuval-a/deriveODM
DeriveODM is a reactive ODM - Object Document Mapper - framework, a "wrapper" around MongoDB, that removes all the hassle of data-persistence by handling it transparently in the background, in a DRY manner.
collection data data-mapper data-science database db document dry mapper mongo mongodb mongoose node nodejs object odm persistence persistent react reactive
Last synced: 01 Aug 2024
https://github.com/janskwr/Processing-of-structured-data
Processing of structured data - the third homework assignment/project!
big-data data-processing data-science data-table r stackexchange stringi xml
Last synced: 29 Jul 2024
https://github.com/jmbhughes/goes_solar_retriever
Tool to retrieve GOES-R Solar Data
data data-retrieval data-science goes-16 goes-satellite goes16 goes17 solar solar-physics
Last synced: 01 Aug 2024
https://github.com/erinaldi/bmn2-lattice
Data analysis of lattice Monte Carlo simulations of quantum matrix models.
data data-science data-visualisation lattice
Last synced: 31 Jul 2024
https://github.com/firefly-cpp/snail-dataset
computer-vision data-science object-detection snails
Last synced: 01 Aug 2024
https://github.com/github/CodeSearchNet
Datasets, tools, and benchmarks for representation learning of code.
bert cnn data data-science datasets deep-learning machine-learning machine-learning-on-source-code ml natural-language-processing neural-networks nlp nlp-machine-learning open-data programming-language-theory python representation-learning rnn self-attention tensorflow
Last synced: 30 Jul 2024
https://github.com/TheoLvs/data-science-VR
Data Science experiments in Virtual Reality
data-science machine-learning virtual-reality
Last synced: 29 Jul 2024