Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sap-samples/datasphere-fedml
The publication is a collection of sample code to show how data from SAP and non-SAP systems can be made available for training in ANY hyperscaler machine learning service via several layers of abstraction from data connection to training using our FedML Python libraries.
https://github.com/sap-samples/datasphere-fedml
3944 4200 btp-use-case-factory data-federation data-to-value hyperscalers machine-learning sample sample-code sap-data-warehouse-cloud sap-datasphere
Last synced: about 1 month ago
JSON representation
The publication is a collection of sample code to show how data from SAP and non-SAP systems can be made available for training in ANY hyperscaler machine learning service via several layers of abstraction from data connection to training using our FedML Python libraries.
- Host: GitHub
- URL: https://github.com/sap-samples/datasphere-fedml
- Owner: SAP-samples
- License: apache-2.0
- Created: 2021-10-25T20:33:21.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2024-04-05T21:57:01.000Z (9 months ago)
- Last Synced: 2024-04-06T19:47:50.426Z (9 months ago)
- Topics: 3944, 4200, btp-use-case-factory, data-federation, data-to-value, hyperscalers, machine-learning, sample, sample-code, sap-data-warehouse-cloud, sap-datasphere
- Language: Jupyter Notebook
- Homepage:
- Size: 20.9 MB
- Stars: 11
- Watchers: 13
- Forks: 11
- Open Issues: 6
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
[![REUSE status](https://api.reuse.software/badge/github.com/SAP-samples/data-warehouse-cloud-fedml)](https://api.reuse.software/info/github.com/SAP-samples/data-warehouse-cloud-fedml)
# FedML
## Description
The SAP Federated ML Python libraries (FedML) applies the Data Federation architecture of SAP Datasphere for intelligently sourcing SAP as well as non-SAP data for Machine Learning experiments done at any Machine Learning platform thereby removing the need for replicating or moving data.
By abstracting data connection, data loading (for all ML platforms), model training (with flexibility and support for user-provided training scripts), model deployment, and inferencing (for hyperscaler machine learning platforms), the FedML library offers end-to-end integration with just a few lines of code.## What's New
1. The new version of FedML (available as fedml-dsp in PyPi, V1.0.0) :
- Is machine learning platform-independent. It can be used in all machine learning platforms
- Supports NVIDIA RAPIDS™, CUDA cuDF and cuPy and hence can be used for training models in GPU environments.
- Supports sourcing data from SAP Datasphere models directly into PySpark and cuPy (for GPU) dataframes.
- Supports SAP AI Core Deployment - Models that are trained in any ML Platform (and containerized independently) can now be deployed in SAP GenAI Hub's AI Core with couple lines of code.
- Supports writing inferenced results back to SAP Datasphere.
### Solution Architecture
![ARD](/FedMLNew.jpg)
2.FedML (Original, V2.0) for hyperscaler platforms [AWS, GCP, Azure and Databricks] :
- Is pip installable from PyPi for its respective hyperscaler platforms.
- Supports model training and deployment to hyperscaler environment.
- Supports deployment to SAP Business Technology Platform Kyma environment.
- Supports inferencing with hyperscaler deployed as well as Kyma deployed models.
- Supports writing inferenced results back to SAP Datasphere.
## Requirements
- SAP Datasphere tenant instance, with connectivity established to the remote data sources, and views exposed, that can be consumed by FedML.
- Access to corresponding Machine learning Platforms with appropriate configurations. See [Configuration](#configuration) section.
## Download and Installation
Try out examples from the **samples-notebooks** directory of corresponding library folders
## Configuration
- For FedML (platform-independent) library specific pre-requisites, configuration and documentation, [please refer here](Datasphere/fedml-dsp.md)
- For AWS FedML library specific pre-requisites, configuration and documentation, [please refer here](AWS/fedml_aws.md)
- For GCP FedML library specific pre-requisites, configuration and documentation, [please refer here](GCP/fedml_gcp.md)
- For Azure FedML library specific pre-requisites, configuration and documentation, [please refer here](Azure/readme.md)
- For Databricks FedML library specific pre-requisites, configuration and documentation, [please refer here](Databricks/README.md)
## Limitations
None
## How to obtain support
This project is provided "as-is" with no expectation for major changes or support.
[Create an issue](/issues) in this repository if you find a bug or have questions about the content.
For additional support, [ask a question](https://answers.sap.com/questions/ask.html) in SAP Community.
## Licensing
Copyright (c) 2021 SAP SE or an SAP affiliate company. All rights reserved. This project is licensed under the Apache Software License, version 2.0 except as noted otherwise in the [LICENSE](LICENSES/Apache-2.0.txt) file.