Apache-Kafka-Guide
Apache Kafka Guide
https://github.com/mikeroyal/Apache-Kafka-Guide
Last synced: 4 days ago
JSON representation
-
Reinforcement Learning Tools, Libraries, and Frameworks
- Jupyter Notebook - source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Jupyter is used widely in industries that do data cleaning and transformation, numerical simulation, statistical modeling, data visualization, data science, and machine learning.
- XGBoost
- LIBSVM - SVC, nu-SVC), regression (epsilon-SVR, nu-SVR) and distribution estimation (one-class SVM). It supports multi-class classification.
- Microsoft Project Bonsai - code AI platform that speeds AI-powered automation development and part of the Autonomous Systems suite from Microsoft. Bonsai is used to build AI components that can provide operator guidance or make independent decisions to optimize process variables, improve production efficiency, and reduce downtime.
- Predictive Maintenance Toolbox™ - based and model-based techniques, including statistical, spectral, and time-series analysis.
- Navigation Toolbox™ - based path planners, as well as metrics for validating and comparing paths. You can create 2D and 3D map representations, generate maps using SLAM algorithms, and interactively visualize and debug map generation with the SLAM map builder app.
- OpenAI
- ReinforcementLearning.jl
- AWS RoboMaker - managed, scalable infrastructure for simulation that customers use for multi-robot simulation and CI/CD integration with regression testing in simulation.
- Cluster Manager for Apache Kafka(CMAK)
- CARLA - source simulator for autonomous driving research. CARLA has been developed from the ground up to support development, training, and validation of autonomous driving systems. In addition to open-source code and protocols, CARLA provides open digital assets (urban layouts, buildings, vehicles) that were created for this purpose and can be used freely.
- ROS/ROS2 bridge for CARLA(package) - way communication between ROS and CARLA. The information from the CARLA server is translated to ROS topics. In the same way, the messages sent between nodes in ROS get translated to commands to be applied in CARLA.
- Azure Databricks - based big data analytics service designed for data science and data engineering. Azure Databricks, sets up your Apache Spark environment in minutes, autoscale, and collaborate on shared projects in an interactive workspace. Azure Databricks supports Python, Scala, R, Java, and SQL, as well as data science frameworks and libraries including TensorFlow, PyTorch, and scikit-learn.
-
R Learning Resources
- R
- An Introduction to R
- Google's R Style Guide
- R developer's guide to Azure
- Running R on AWS
- RStudio Server Pro for AWS
- Learn R by Codecademy
- Learn R Programming with Online Courses and Lessons by edX
- R Language Courses by Coursera
- Learn R For Data Science by Udacity
- Running R at Scale on Google Compute Engine
-
R Tools, Libraries, and Frameworks
- CatBoost
- Visual Studio Code
- Code Server
- VSCode-R - project.org/), including features such as extended syntax highlighting, R language service based on code analysis, interacting with R terminals, viewing data, plots, workspace variables, help pages, managing packages, and working with [R Markdown](https://rmarkdown.rstudio.com/) documents.
- Rmarkdown
- Plotly
- Metaflow - life data science projects. Metaflow was originally developed at Netflix to boost productivity of data scientists who work on a wide variety of projects from classical statistics to state-of-the-art deep learning.
- LightGBM
- MLR
- Plumber
- Drake - focused pipeline toolkit for reproducibility and high-performance computing.
- DiagrammeR
- Knitr - purpose literate programming engine in R, with lightweight API's designed to give users full control of the output without heavy coding work.
- Broom
- R Debugger
- Language Server Protocol (LSP)
- R Host
- Rplugin
- ML workspace - in-one web-based IDE specialized for machine learning and data science. It is simple to deploy and gets you started within minutes to productively built ML solutions on your own machines. ML workspace is the ultimate tool for developers preloaded with a variety of popular data science libraries (Tensorflow, PyTorch, Keras, and MXnet) and dev tools (Jupyter, VS Code, and Tensorboard) perfectly configured, optimized, and integrated.
- Shiny
-
Scala Learning Resources
- Scala - oriented and functional programming in one concise, high-level language. Scala's static types help avoid bugs in complex applications, and its JVM and JavaScript runtimes let you build high-performance systems with easy access to huge ecosystems of libraries.
- Scala Style Guide
- Creating a Scala Maven application for Apache Spark in HDInsight using IntelliJ
- Using Scala to Program AWS Glue ETL Scripts
- Using Flink Scala shell with Amazon EMR clusters
- AWS EMR and Spark 2 using Scala from Udemy
- Scala Courses from Coursera
- Using the Google Cloud Storage connector with Apache Spark
- Write and run Spark Scala jobs on Cloud Dataproc for Google Cloud
- Scala Courses and Certifications from edX
- Top Scala Courses from Udemy
- Databricks Scala Style Guide
- Data Science using Scala and Spark on Azure
-
Scala Tools and Libraries
- Azure Databricks - based big data analytics service designed for data science and data engineering. Azure Databricks, sets up your Apache Spark environment in minutes, autoscale, and collaborate on shared projects in an interactive workspace. Azure Databricks supports Python, Scala, R, Java, and SQL, as well as data science frameworks and libraries including TensorFlow, PyTorch, and scikit-learn.
- Scala Native - of-time compiler and lightweight managed runtime designed specifically for Scala.
- Gitbucket
- Gatling - Sent-Events and JMS.
- Scala.js
- Polynote
- Scalatra - performance, async web framework, inspired by [Sinatra](https://www.sinatrarb.com/).
- Play Framework
- AWScala
- Dotty
-
SQL/NoSQL Learning Resources
- SQL
- Transact-SQL(T-SQL) - SQL commands.
- Introduction to Transact-SQL
- SQL Tutorial by W3Schools
- Learn SQL Skills Online from Coursera
- SQL Courses Online from Udemy
- SQL Online Training Courses from LinkedIn Learning
- Learn SQL For Free from Codecademy
- GitLab's SQL Style Guide
- OracleDB SQL Style Guide Basics
- Databases on AWS
- Best Practices and Recommendations for SQL Server Clustering in AWS EC2.
- Connecting from Google Kubernetes Engine to a Cloud SQL instance.
- MySQL Certifications
- What is NoSQL?
- Tableau CRM: BI Software and Tools
- SQL vs. NoSQL Databases: What's the Difference?
-
SQL/NoSQL Tools and Databases
- MSSQL for Visual Studio Code
- SQL Server Migration Assistant
- SQL Server Business Intelligence(BI)
- Tableau - releases/press-release-details/2019/Salesforce-Completes-Acquisition-of-Tableau/default.aspx).
- DataGrip - sensitive code completion, helping you to write SQL code faster. Completion is aware of the tables structure, foreign keys, and even database objects created in code you're editing.
- MySQL - native applications using the world's most popular open source database.
- PostgreSQL - relational database system with over 30 years of active development that has earned it a strong reputation for reliability, feature robustness, and performance.
- Amazon DynamoDB - value and document database that delivers single-digit millisecond performance at any scale. It is a fully managed, multiregion, multimaster, durable database with built-in security, backup and restore, and in-memory caching for internet-scale applications.
- Apache HBase™ - source, NoSQL, distributed big data store. It enables random, strictly consistent, real-time access to petabytes of data. HBase is very effective for handling large, sparse datasets. HBase serves as a direct input and output to the Apache MapReduce framework for Hadoop, and works with Apache Phoenix to enable SQL-like queries over HBase tables.
- ElasticSearch - capable full-text search engine with an HTTP web interface and schema-free JSON documents. Elasticsearch is developed in Java.
- Trino - us/azure/architecture/data-guide/relational-data/etl), allow them all to use standard SQL statement, and work with numerous data sources and targets all in the same system.
- Extract, transform, and load (ETL)
- Redis(REmote DIctionary Server) - memory data structure store, used as a database, cache, and message broker. It provides data structures such as strings, hashes, lists, sets, sorted sets with range queries, bitmaps, hyperloglogs, geospatial indexes, and streams.
- FoundationDB - value store and employs ACID transactions for all operations. It is especially well-suited for read/write workloads but also has excellent performance for write-intensive workloads. FoundationDB was acquired by [Apple in 2015](https://techcrunch.com/2015/03/24/apple-acquires-durable-database-company-foundationdb/).
- MongoDB - like documents.
- OracleDB - critical data with the highest availability, reliability, and security.
- MariaDB - critical applications.
- SQLite - language library that implements a small, fast, self-contained, high-reliability, full-featured, SQL database engine.SQLite is the most used database engine in the world. SQLite is built into all mobile phones and most computers and comes bundled inside countless other applications that people use every day.
- SQLite Database Browser
- InfluxDB - us/azure/architecture/data-guide/relational-data/etl) or monitoring and alerting purposes, user dashboards, Internet of Things sensor data, and visualizing and exploring the data and more. It also has support for processing data from [Graphite](http://graphiteapp.org/).
- CouchbaseDB - model NoSQL document-oriented database](https://en.wikipedia.org/wiki/Multi-model_database). It creates a key-value store with managed cache for sub-millisecond data operations, with purpose-built indexers for efficient queries and a powerful query engine for executing SQL queries.
- dbWatch - premise, hybrid/cloud database environments.
- Cosmos DB Profiler - time visual debugger allowing a development team to gain valuable insight and perspective into their usage of Cosmos DB database. It identifies over a dozen suspicious behaviors from your application’s interaction with Cosmos DB.
- Toad - in expertise. This SQL management tool resolve issues, manage change and promote the highest levels of code quality for both relational and non-relational databases.
- Sequel Pro
- Netdata - fidelity infrastructure monitoring and troubleshooting, real-time monitoring Agent collects thousands of metrics from systems, hardware, containers, and applications with zero configuration. It runs permanently on all your physical/virtual servers, containers, cloud deployments, and edge/IoT devices, and is perfectly safe to install on your systems mid-incident without any preparation.
- Azure Data Studio
- Hadoop Distributed File System (HDFS) - yarn/hadoop-yarn-site/YARN.html).
- Azure Synapse Analytics
- Azure SQL Managed Instance - premises applications to the cloud with very few application and database changes. Managed instance has split compute and storage components.
- Logstash
- Kibana
- Atlas - memory dimensional [time series database](https://en.wikipedia.org/wiki/Time_series_database).
- Azure SQL Database - powered and automated features that optimize performance and durability for you. Serverless compute and Hyperscale storage options automatically scale resources on demand, so you can focus on building new applications without worrying about storage size or resource management.
- Adminer
-
Uncategorized
-
Uncategorized
-
Categories
Reinforcement Learning Learning Resources
44
ML Frameworks, Libraries, and Tools
37
SQL/NoSQL Tools and Databases
35
Java Tools, Libraries, and Frameworks
26
Python Frameworks and Tools
26
NLP Tools, Libraries, and Frameworks
22
C# Tools, Libraries and Frameworks
22
Computer Vision Tools, Libraries, and Frameworks
20
Bioinformatics Tools, Libraries, and Frameworks
20
Go Tools and Frameworks
20
NLP Learning Resources
20
R Tools, Libraries, and Frameworks
20
SQL/NoSQL Learning Resources
17
Computer Vision Learning Resources
17
MATLAB Learning Resources
16
MATLAB Tools, Libraries, Frameworks
16
Reinforcement Learning Tools, Libraries, and Frameworks
15
Learning Resources for ML
15
Bioinformatics Learning Resources
13
Scala Learning Resources
13
Python Learning Resources
12
CUDA Tools Libraries, and Frameworks
12
R Learning Resources
11
Java Learning Resources
11
Scala Tools and Libraries
10
Go Learning Resources
9
C# Learning Resources
8
CUDA Learning Resources
6
Uncategorized
4
Deep Learning Tools, Libraries, and Frameworks
3
Deep Learning Learning Resources
2
License
1
Sub Categories
Keywords
python
16
golang
16
go
12
java
9
machine-learning
9
deep-learning
8
cuda
8
gpu
6
nlp
6
docker
5
kubernetes
5
natural-language-processing
5
devops
4
nvidia
4
csharp
4
pytorch
4
neural-network
4
cpp
3
typescript
3
named-entity-recognition
3
dotnet
3
tensorflow
3
http
3
matlab
3
android
3
data-visualization
3
visual-studio
3
neural-networks
3
data-science
3
ai
3
artificial-intelligence
3
cross-platform
2
c-sharp
2
gpu-computing
2
cxx20
2
cxx17
2
cxx14
2
compiler
2
performance
2
scala
2
web-framework
2
cxx11
2
rest
2
kotlin
2
dotnetcore
2
cli
2
nodejs
2
serverless
2
lambda
2
machine-learning-algorithms
2