Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-dotnet-datascience

A curated list of awesome .Net packages, frameworks and resources for datascience
https://github.com/neurallayer/awesome-dotnet-datascience

Last synced: about 23 hours ago
JSON representation

  • Interactive Development

    • .NET Interactive - .NET Interactive takes the power of .NET and embeds it into your interactive experiences. Share code, explore data, write, and learn across your apps in ways you couldn't before. Right now there is support for Jupyter, nteract, and Visual Studio Code.
    • IfSharp - F# for Jupyter Notebooks with intellisense and integrated NuGet support.
    • Binder - Binder is an online application that allows you to point it to a github repo and start a Jupyter notebook that is in there. Binder supports C#, F# and even Powershell notebooks. And if you just want to try some notebooks right now, click [here](https://mybinder.org/v2/gh/dotnet/interactive/master?urlpath=lab).
  • Dataframes

    • Deedle - Data frame and (time) series library for exploratory data manipulation with C# and F# support
    • Pandas.NET - Pandas ported to C#. It is a data analysis tool that can process multi-dimensional arrays using DataFrames.
    • Spreads - Series and Panels for Real-time and Exploratory Analysis of Data Streams. Spreads library is optimized for performance and memory usage. It is several times faster than other open source projects.
  • Visualization

    • Plot.NET - .NET wrapper of plotly.js for ICSharpCore.
    • XPlot - XPlot is data visualization package for F# and .NET. Uses Plotly and Google Charts as "backends" to render fancy visualization.
  • Big Data

    • Cinchoo ETL - Cinchoo ETL is a code-based ETL framework for extracting data from multiple sources, transforming, and loading into your very own data warehouse in .NET environment.
    • FsShelter - FsShelter is a library for defining and running Apache Storm topologies in F# using statically typed streams.
    • ETLBox - Read and write data from flatfiles, databases (e.g. Sql Server, SQLite, MySql or Postgres) or webservices. Harmonize, filter, aggregate, validate and clean the data in-memory with highly customizable transformations and write them into any destination.
  • Data loading

    • HDF.PInvoke - HDF.PInvoke is a collection of PInvoke signatures for the HDF5 C-API. It is not a high-level .NET interface for HDF5.
    • HDF5-CSharp - Set of tools that help in reading and writing hdf5 files for .net environments.
    • parquet-dotnet - Fully managed .NET library to read and write Apache Parquet files.
    • Lightning.NET - .NET library for LMDB key-value store. LMDB is a Btree-based database management library. The entire database is exposed in a memory map, and all data fetches return data directly from the mapped memory. As such, the library is extremely high performance and memory-efficient. So ideal for storing large amount of data during machine learning.
    • Json.NET - Serialize and deserialize any .NET object with Json.NET's JSON serializer.
    • F# Data - F# type providers for accessing XML, JSON, CSV and HTML files (based on sample documents) and for accessing WorldBank data.
    • F# Data - F# type providers for accessing XML, JSON, CSV and HTML files (based on sample documents) and for accessing WorldBank data.
  • Language bindings

    • R Provider - Type provider that exposes R packages and functions in a type-safe way to F# callers
    • Python.NET - Python.NET is a package that gives Python programmers integration with the .NET 4.0+ Common Language Runtime (CLR). Python.NET provides a powerful application scripting tool for .NET developers. Using this package you can script .NET applications or build entire applications in Python, using .NET services and components written in any language that targets the CLR (C#, VB.NET, F#, C++/CLI).
    • IronPython - IronPython is an open-source implementation of the Python programming language which is tightly integrated with the .NET Framework. IronPython can use the .NET Framework and Python libraries, and other .NET languages can use Python code just as easily.
  • Math and Statistics

    • NumSharp - High Performance Computation for N-D Tensors in .NET, similar API to NumPy wrtitten in pure C#.
    • Numpy.NET - Numpy.NET is an almost complete .NET binding for NumPy.
    • Math.NET Numerics - Math.NET Numerics aims to provide methods and algorithms for numerical computations in science, engineering and every day use. Covered topics include special functions, linear algebra, probability models, random numbers, interpolation, integration, regression, optimization problems and more.
    • Math.NET Symbolics - Math.NET Symbolics is a basic open source computer algebra library for .Net and Mono written in F#.
    • Math.NET Filtering - Filtering aims to provide a toolkit for digital signal processing, offering an infrastructure for digital filter design, applying those filters to data streams using data converters, as well as digital signal generators.
    • Math.NET Spatial - Math.NET Spatial is aiming to become a geometry library for .Net and Mono.
    • ILNumerics - set of tools for engineers and scientists based on modern software frameworks that help to develop, deploy and maintain technical applications. Consist of development tooling (integrated in Visual Studio), visualization engine and a compute engine.
    • ALGLIB - ALGLIB is a cross-platform numerical analysis and data processing library. It supports several programming languages (C++, C#, Delphi) and several operating systems (Windows and POSIX, including Linux).
  • Computer Vision

    • SharpCV - A Computer Vision library that combines OpenCV and NDArray together in .NET Standard. Great for processing images and video before feeding into a machine learning model.
    • Accord.NET Extensions - Advanced image processing and computer vision algorithms made as fluent extensions.
    • OpenCVDotNet - A wrapper for the OpenCV project to be used with .NET applications.
    • Emgu CV - Cross platform wrapper of OpenCV which can be compiled in Mono to be run on Windows, Linus, Mac OS X, iOS, and Android.
  • Natural Language Processing

    • Stanford.NLP for .NET - A full port of Stanford NLP packages to .NET and also available precompiled as a NuGet package.
    • Catalyst - trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
    • Stanford.NLP for .NET - A full port of Stanford NLP packages to .NET and also available precompiled as a NuGet package.
  • Machine Learning and Differential Programming

    • ML.NET - Cross-platform open-source machine learning framework which makes machine learning accessible to .NET developers. This framework wraps several other libraries and is actively being developed. So a great place to start.
    • SciSharp STACK - A rich machine learning ecosystem for .NET created by porting the most popular libraries to C#. Since the APIs of the ported libraries are so similar to the originals you can easily re-use all existing resources, documentation and community solutions to common problems in C# or F# without much effort. Severel of their packages are listed separately on this page.
    • Infer.NET - A framework for running Bayesian inference in graphical models. It can also be used for probabilistic programming.
    • Synapses - An in-memory neural network library written in F#.
    • Torch.NET - .NET bindings for PyTorch. Machine Learning with C# / F# with Multi-GPU/CPU support
    • TorchSharp - TorchSharp is a .NET library that provides access to the library that powers PyTorch. It is a work in progress, but already provides a .NET API that can be used to perform (1) various operations on ATen Tensors; (2) scoring of TorchScript models; (3) Training of simple neural networks. Current focus is to bind the entire API surfaced by libtorch.
    • Tensorflow.Net - .NET Standard bindings for Google's TensorFlow for developing, training and deploying Machine Learning models in C#.
    • Keras.NET - Keras.NET is a high-level neural networks API, written in C# with Python Binding and capable of running on top of TensorFlow, CNTK, or Theano.
    • NeuralNetwork.NET - A TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN.
    • XGBoost.Net - .Net wrappers for the awesome XGBoost library
    • DiffSharp - DiffSharp allows for exact and efficient calculation of derivatives, by systematically invoking the chain rule of calculus at the elementary operator level during program execution.
    • autodiff - A .NET library that provides fast, accurate and automatic differentiation (computes derivative / gradient) of mathematical functions.
    • Accord.NET - Machine learning framework combined with audio and image processing libraries (computer vision, computer audition, signal processing and statistics). Merged with [AForge.NET](http://www.aforgenet.com/framework/).
    • numl - This library is designed to assist in the use of common Machine Learning Algorithms in conjunction with the .NET platform. It is designed to include the most popular supervised and unsupervised learning algorithms while minimizing the friction involved with creating the predictive models.
    • FsLab - FsLab is a curated collection of open source F# packages for data-science. Together with your editor or Jupyter notebook these packages allow you to rapidly develop scalable, high-performance analytics and visualizations using succinct, type-safe, production-ready code.
    • MxNet.Sharp - MxNet.Sharp is a CSharp binding coving all the Imperative, Symbolic and Gluon API’s with an easy to use interface. The Gluon library in Apache MXNet provides a clear, concise, and simple API for deep learning. It makes it easy to prototype, build, and train deep learning models without sacrificing training speed.
  • Reinforcement Learning

    • Gym.NET - A complete port of OpenAI Gym to C#. OpenAI Gym is a toolkit for developing and comparing reinforcement learning algorithms. This is the gym open-source library, which gives you access to a standardized set of environments. Work in progress.
    • Deep QLearning demo - This demo is a C# port of [ConvNetJS RLDemo](https://cs.stanford.edu/people/karpathy/convnetjs/demo/rldemo.html) by Andrej Karpathy.
  • Genetic Programming

    • GeneticSharp - Multi-platform genetic algorithm library for .NET Core and .NET Framework. The library has several implementations of GA operators, like: selection, crossover, mutation, reinsertion and termination.
    • SharpNEAT - SharpNEAT provides an implementation of an Evolutionary Algorithm (EA) with the specific goal of evolving neural networks. The EA uses the evolutionary mechanisms of mutation, recombination and selection to search for neural networks with behaviour that satisfies some formally defined problem.
  • Serving

    • SciSharp Cube - Easy way to get all the latest features of SciSharp Machine Learning tools in docker container.
  • Domain specific

    • BotSharp - BotSharp is an open source machine learning framework for AI Bot platform builder. This project involves natural language understanding, computer vision and audio processing technologies, and aims to promote the development and application of intelligent robot assistants in information systems.
    • QuantAD - Automatic Differentiation tool targeted at Quantitative Finance.