https://github.com/uxlfoundation/onemath

oneAPI Math Library (oneMath)
https://github.com/uxlfoundation/onemath

api blas cpu cuda dpcpp gpu hpc intel math-libraries oneapi onemkl parallel-computing parallel-programming performance rng

Last synced: 7 months ago
JSON representation

oneAPI Math Library (oneMath)

Host: GitHub
URL: https://github.com/uxlfoundation/onemath
Owner: uxlfoundation
License: apache-2.0
Created: 2020-03-05T23:55:03.000Z (almost 6 years ago)
Default Branch: develop
Last Pushed: 2025-05-12T11:14:31.000Z (7 months ago)
Last Synced: 2025-05-12T12:28:19.775Z (7 months ago)
Topics: api, blas, cpu, cuda, dpcpp, gpu, hpc, intel, math-libraries, oneapi, onemkl, parallel-computing, parallel-programming, performance, rng
Language: C++
Homepage:
Size: 11.6 MB
Stars: 674
Watchers: 48
Forks: 169
Open Issues: 70
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Codeowners: .github/CODEOWNERS
- Security: SECURITY.md

Awesome Lists containing this project

README

# oneAPI Math Library (oneMath)

oneMath is an open-source implementation of the [oneMath specification](https://oneapi-spec.uxlfoundation.org/specifications/oneapi/latest/elements/onemath/source/). It can work with multiple devices using multiple libraries (backends) underneath. The oneMath project was previously referred to as oneMKL Interfaces.

oneMath is part of the [UXL Foundation](http://www.uxlfoundation.org).

User Application
oneMath Layer
Third-Party Library
Hardware Backend

oneMath
oneMath selector
Intel(R) oneAPI Math Kernel Library (oneMKL)
x86 CPU, Intel GPU

NVIDIA cuBLAS
NVIDIA GPU

NVIDIA cuSOLVER
NVIDIA GPU

NVIDIA cuRAND
NVIDIA GPU

NVIDIA cuFFT
NVIDIA GPU

NVIDIA cuSPARSE
NVIDIA GPU

NETLIB LAPACK
x86 and aarch64 CPU

Arm Performance Libraries
aarch64 CPU

AMD rocBLAS
AMD GPU

AMD rocSOLVER
AMD GPU

AMD rocRAND
AMD GPU

AMD rocFFT
AMD GPU

AMD rocSPARSE
AMD GPU

generic SYCL BLAS
x86 CPU, Intel GPU, NVIDIA GPU, AMD GPU, Other SYCL devices (unsupported)

portFFT
x86 CPU, Intel GPU, NVIDIA GPU, AMD GPU, Other SYCL devices (unsupported)

## Table of Contents

- [Support and Requirements](#support-and-requirements)
- [Documentation](#documentation)
- [FAQs](#faqs)
- [Legal Information](#legal-information)

---

## Support and Requirements

### Supported Usage Models:

#### Host API

There are two oneMath selector layer implementations:

- **Run-time dispatching**: The application is linked with the oneMath library and the required backend is loaded at run-time based on device vendor (all libraries should be dynamic).

Example of app.cpp with run-time dispatching:

```cpp
#include "oneapi/math.hpp"

...
cpu_dev = sycl::device(sycl::cpu_selector());
gpu_dev = sycl::device(sycl::gpu_selector());

sycl::queue cpu_queue(cpu_dev);
sycl::queue gpu_queue(gpu_dev);

oneapi::math::blas::column_major::gemm(cpu_queue, transA, transB, m, ...);
oneapi::math::blas::column_major::gemm(gpu_queue, transA, transB, m, ...);
```
How to build an application with run-time dispatching:

if OS is Linux, use icpx compiler. If OS is Windows, use icx compiler.
Linux example:
```cmd
$> icpx -fsycl –I$ONEMATH/include app.cpp
$> icpx -fsycl app.o –L$ONEMATH/lib –lonemath
```

- **Compile-time dispatching**: The application uses a templated backend selector API where the template parameters specify the required backends and third-party libraries and the application is linked with the required oneMath backend wrapper libraries (libraries can be static or dynamic).

Example of app.cpp with compile-time dispatching:

```cpp
#include "oneapi/math.hpp"

...
cpu_dev = sycl::device(sycl::cpu_selector());
gpu_dev = sycl::device(sycl::gpu_selector());

sycl::queue cpu_queue(cpu_dev);
sycl::queue gpu_queue(gpu_dev);

oneapi::math::backend_selector cpu_selector(cpu_queue);

oneapi::math::blas::column_major::gemm(cpu_selector, transA, transB, m, ...);
oneapi::math::blas::column_major::gemm(oneapi::math::backend_selector {gpu_queue}, transA, transB, m, ...);
```
How to build an application with compile-time dispatching:

```cmd
$> clang++ -fsycl –I$ONEMATH/include app.cpp
$> clang++ -fsycl app.o –L$ONEMATH/lib –lonemath_blas_mklcpu –lonemath_blas_cublas
```

*Refer to [Selecting a Compiler](https://uxlfoundation.github.io/oneMath/selecting_a_compiler.html) for the choice between `icpx/icx` and `clang++` compilers.*

#### Device API

Header-based and backend-independent Device API can be called within ```sycl kernel``` or work from Host code ([device-rng-usage-model-example](https://spec.oneapi.io/versions/latest/elements/oneMath/source/domains/rng/device_api/device-rng-usage-model.html#id2)). Currently, the following domains support the Device API:

- **RNG**. To use RNG Device API functionality it's required to include ```oneapi/math/rng/device.hpp``` header file.

### Supported Configurations:

Supported domains include: BLAS, LAPACK, RNG, DFT, SPARSE_BLAS

Supported compilers include:
- [Intel(R) oneAPI DPC++ Compiler](https://software.intel.com/en-us/oneapi/dpc-compiler): Intel proprietary compiler that supports CPUs and Intel GPUs. Intel(R) oneAPI DPC++ Compiler will be referred to as "Intel DPC++" in the "Supported Compiler" column of the tables below.
- [oneAPI DPC++ Compiler](https://github.com/intel/llvm): Open source compiler that supports CPUs and Intel, NVIDIA, and AMD GPUs. oneAPI DPC++ Compiler will be referred to as "Open DPC++" in the "Supported Compiler" column of the tables below.
- [AdaptiveCpp Compiler](https://github.com/AdaptiveCpp/AdaptiveCpp) (formerly known as hipSYCL): Open source compiler that supports CPUs and Intel, NVIDIA, and AMD GPUs.**Note**: The source code and some documents in this project still use the previous name hipSYCL during this transition period.

#### Linux*

Domain
Backend
Library
Supported Compiler
Supported Link Type

BLAS
x86 CPU
Intel(R) oneMKL
Intel DPC++AdaptiveCpp
Dynamic, Static

NETLIB LAPACK
Intel DPC++Open DPC++AdaptiveCpp
Dynamic, Static

generic SYCL BLAS
Intel DPC++Open DPC++
Dynamic, Static

aarch64 CPU
Arm Performance Libraries
Open DPC++AdaptiveCpp
Dynamic, Static

NETLIB LAPACK
Open DPC++AdaptiveCpp
Dynamic, Static

Intel GPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

generic SYCL BLAS
Intel DPC++Open DPC++
Dynamic, Static

NVIDIA GPU
NVIDIA cuBLAS
Open DPC++AdaptiveCpp
Dynamic, Static

generic SYCL BLAS
Open DPC++
Dynamic, Static

AMD GPU
AMD rocBLAS
Open DPC++AdaptiveCpp
Dynamic, Static

generic SYCL BLAS
Open DPC++
Dynamic, Static

Other SYCL devices (unsupported)
generic SYCL BLAS
Intel DPC++Open DPC++
Dynamic, Static

LAPACK
x86 CPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

aarch64 CPU
Arm Performance Libraries
Open DPC++AdaptiveCpp
Dynamic, Static

Intel GPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

NVIDIA GPU
NVIDIA cuSOLVER
Open DPC++
Dynamic, Static

AMD GPU
AMD rocSOLVER
Open DPC++
Dynamic, Static

RNG
x86 CPU
Intel(R) oneMKL
Intel DPC++AdaptiveCpp
Dynamic, Static

aarch64 CPU
Arm Performance Libraries
Open DPC++AdaptiveCpp
Dynamic, Static

Intel GPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

NVIDIA GPU
NVIDIA cuRAND
Open DPC++AdaptiveCpp
Dynamic, Static

AMD GPU
AMD rocRAND
Open DPC++AdaptiveCpp
Dynamic, Static

DFT
x86 CPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

portFFT (limited API support)
Intel DPC++
Dynamic, Static

Intel GPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

portFFT (limited API support)
Intel DPC++
Dynamic, Static

NVIDIA GPU
NVIDIA cuFFT
Open DPC++
Dynamic, Static

portFFT (limited API support)
Open DPC++
Dynamic, Static

AMD GPU
AMD rocFFT
Open DPC++
Dynamic, Static

portFFT (limited API support)
Open DPC++
Dynamic, Static

Other SYCL devices (unsupported)
portFFT
Open DPC++Open DPC++
Dynamic, Static

SPARSE_BLAS
x86 CPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

Intel GPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

NVIDIA GPU
NVIDIA cuSPARSE
Open DPC++
Dynamic, Static

AMD GPU
AMD rocSPARSE
Open DPC++
Dynamic, Static

#### Windows*

Domain
Backend
Library
Supported Compiler
Supported Link Type

BLAS
x86 CPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

NETLIB LAPACK
Intel DPC++Open DPC++
Dynamic, Static

Intel GPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

LAPACK
x86 CPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

Intel GPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

RNG
x86 CPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

Intel GPU
Intel(R) oneMKL
Intel DPC++
Dynamic, Static

---

### Hardware Platform Support

- CPU
- Intel Atom(R) Processors
- Intel(R) Core(TM) Processor Family
- Intel(R) Xeon(R) Processor Family
- Arm Neoverse Processor Family (tested on N1, V1, V2)
- Accelerators
- Intel(R) Arc(TM) A-Series Graphics
- Intel(R) Data Center GPU Max Series
- NVIDIA(R) A100 (Linux* only)
- AMD(R) GPUs see [here](https://github.com/RadeonOpenCompute/ROCm#hardware-and-software-support) tested on AMD Vega 20 (gfx906)
- Other SYCL devices can be used, but are not supported

---
### Supported Operating Systems

#### Linux*

#### Windows*

Backend | Supported Operating System
:--- | :---
x86 CPU | Microsoft Windows* Server 2022
Intel GPU | Microsoft Windows* 11
---

### Software Requirements

**What should I download?**

#### General:

Functional Testing
Build Only
Documentation

CMake (version 3.13 or newer)

Linux* : GNU* GCC 5.1 or higher
Windows* : MSVS* 2017 or MSVS* 2019 (version 16.5 or newer)

Ninja (optional)

GNU* FORTRAN Compiler
-
Sphinx

NETLIB LAPACK
-
-

#### Hardware and OS Specific:

Operating System
Device
Package

Linux*/Windows*
x86 CPU
Intel(R) oneAPI DPC++ Compiler
or
oneAPI DPC++ Compiler

Intel(R) oneAPI Math Kernel Library

Intel GPU
Intel(R) oneAPI DPC++ Compiler

Intel GPU driver

Intel(R) oneAPI Math Kernel Library

Linux* only
NVIDIA GPU
oneAPI DPC++ Compiler
or
AdaptiveCpp with CUDA backend and dependencies

AMD GPU
oneAPI DPC++ Compiler
or
AdaptiveCpp with ROCm backend and dependencies

#### Product and Version Information:

Product | Supported Version | License
:--- | :--- | :---
[CMake](https://cmake.org/download/) | 3.13 or higher | [The OSI-approved BSD 3-clause License](https://gitlab.kitware.com/cmake/cmake/raw/master/Copyright.txt)
[Ninja](https://ninja-build.org/) | 1.10.0 | [Apache License v2.0](https://github.com/ninja-build/ninja/blob/master/COPYING)
[GNU* FORTRAN Compiler](https://gcc.gnu.org/wiki/GFortran) | 7.4.0 or higher | [GNU General Public License, version 3](https://gcc.gnu.org/onlinedocs/gcc-7.5.0/gfortran/Copying.html)
[Intel(R) oneAPI DPC++ Compiler](https://software.intel.com/en-us/oneapi/dpc-compiler) | Latest | [End User License Agreement for the Intel(R) Software Development Products](https://software.intel.com/en-us/license/eula-for-intel-software-development-products)
[AdaptiveCpp](https://github.com/AdaptiveCpp/AdaptiveCpp) | Later than [2cfa530](https://github.com/AdaptiveCpp/AdaptiveCpp/commit/2cfa5303fd88b8f84e539b5bb6ed41e49c6d6118) | [BSD-2-Clause License ](https://github.com/AdaptiveCpp/AdaptiveCpp/blob/develop/LICENSE)
[oneAPI DPC++ Compiler binary for x86 CPU](https://github.com/intel/llvm/releases) | Daily builds | [Apache License v2](https://github.com/intel/llvm/blob/sycl/sycl/LICENSE.TXT)
[oneAPI DPC++ Compiler source for NVIDIA and AMD GPUs](https://github.com/intel/llvm) | Daily source releases | [Apache License v2](https://github.com/intel/llvm/blob/sycl/sycl/LICENSE.TXT)
[Intel(R) oneAPI Math Kernel Library](https://software.intel.com/en-us/oneapi/onemkl) | Latest | [Intel Simplified Software License](https://software.intel.com/en-us/license/intel-simplified-software-license)
[NVIDIA CUDA SDK](https://developer.nvidia.com/hpc-sdk) | 12.0 | [End User License Agreement](https://docs.nvidia.com/cuda/eula/index.html)
[AMD rocBLAS](https://github.com/ROCm/rocblas) | 4.5 | [AMD License](https://github.com/ROCm/rocBLAS/blob/develop/LICENSE.md)
[AMD rocRAND](https://github.com/ROCm/rocRAND) | 5.1.0 | [AMD License](https://github.com/ROCm/rocRAND/blob/develop/LICENSE.txt)
[AMD rocSOLVER](https://github.com/ROCm/rocSOLVER) | 5.0.0 | [AMD License](https://github.com/ROCm/rocSOLVER/blob/develop/LICENSE.md)
[AMD rocFFT](https://github.com/ROCm/rocFFT) | rocm-5.4.3 | [AMD License](https://github.com/ROCm/rocFFT/blob/rocm-5.4.3/LICENSE.md)
[AMD rocSPARSE](https://github.com/ROCm/rocSPARSE) | 3.1.2 | [AMD License](https://github.com/ROCm/rocSPARSE/blob/develop/LICENSE.md)
[NETLIB LAPACK](https://www.netlib.org/) | [5d4180c](https://github.com/Reference-LAPACK/lapack/commit/5d4180cf8288ae6ad9a771d18793d15bd0c5643c) | [BSD like license](http://www.netlib.org/lapack/LICENSE.txt)
[Generic SYCL BLAS](https://github.com/uxlfoundation/generic-sycl-components/tree/main/onemath/sycl/blas) | 0.1 | [Apache License v2.0](https://github.com/uxlfoundation/generic-sycl-components/blob/main/LICENSE)
[portFFT](https://github.com/codeplaysoftware/portFFT) | 0.1 | [Apache License v2.0](https://github.com/codeplaysoftware/portFFT/blob/main/LICENSE)
[Arm Performance Libraries](https://developer.arm.com/downloads/-/arm-performance-libraries) | 22.0.1 or higher | [EULA](https://developer.arm.com/downloads/-/arm-performance-libraries/eula)

---

## Documentation
- [Contents](https://uxlfoundation.github.io/oneMath/)
- [About](https://uxlfoundation.github.io/oneMath/introduction.html)
- Get Started
- [Selecting a Compiler](https://uxlfoundation.github.io/oneMath/selecting_a_compiler.html)
- [Building the Project with DPC++](https://uxlfoundation.github.io/oneMath/building_the_project_with_dpcpp.html)
- [Building the Project with AdaptiveCpp](https://uxlfoundation.github.io/oneMath/building_the_project_with_adaptivecpp.html)
- Developer Reference
- [oneMath Defined Datatypes](https://uxlfoundation.github.io/oneMath/onemath-datatypes.html)
- [Dense Linear Algebra](https://uxlfoundation.github.io/oneMath/domains/dense_linear_algebra.html)
- [Integrating a Third-Party Library](https://uxlfoundation.github.io/oneMath/create_new_backend.html)

---

## Governance

The oneMath project is governed by the UXL Foundation and you can get involved in this project in multiple ways. It is possible to join the [Math Special Interest Group (SIG)](https://github.com/uxlfoundation/foundation/tree/main/math) meetings where the group discusses and demonstrates work using this project. Members can also join the Open Source and Specification Working Group meetings.

You can also join the mailing lists for the [UXL Foundation](https://lists.uxlfoundation.org/g/main/subgroups) to be informed of when meetings are happening and receive the latest information and discussions.

---

## Contributing

You can contribute to this project and also contribute to [the specification for this project](https://oneapi-spec.uxlfoundation.org/specifications/oneapi/latest/elements/onemath/source/). Please read the [CONTRIBUTING](CONTRIBUTING.md) page for more information. You can also contact oneMath developers and maintainers via [UXL Foundation Slack](https://slack-invite.uxlfoundation.org/) using [#onemath](https://uxlfoundation.slack.com/archives/onemath) channel.

---

## License

Distributed under the Apache license 2.0. See [LICENSE](LICENSE) for more information.

---

## FAQs

### oneMath

**Q: What is the difference between the following items?**
- The [oneAPI Specification for oneMath](https://oneapi-spec.uxlfoundation.org/specifications/oneapi/latest/elements/onemath/source/)
- The [oneAPI Math Library (oneMath)](https://github.com/uxlfoundation/oneMath) project
- The [Intel(R) oneAPI Math Kernel Library (oneMKL)](https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/onemkl.html) Product

**A:**
- The [oneAPI Specification for oneMath](https://oneapi-spec.uxlfoundation.org/specifications/oneapi/latest/elements/onemath/source/) defines the SYCL interfaces for performance math library functions. The oneMath specification can evolve faster and more frequently than implementations of the specification.

- The [oneAPI Math Library (oneMath)](https://github.com/uxlfoundation/oneMath) project is an open source implementation of the specification. The project goal is to demonstrate how the SYCL interfaces documented in the oneMath specification can be implemented for any math library and work for any target hardware. While the implementation provided here may not yet be the full implementation of the specification, the goal is to build it out over time. We encourage the community to contribute to this project and help to extend support to multiple hardware targets and other math libraries.

- The [Intel(R) oneAPI Math Kernel Library (oneMKL)](https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/onemkl.html) project is an Intel product provided as part of the Intel(R) oneAPI Base Toolkit. It is used for the Intel backends of oneMath. Its C++ API is very similar to the oneMath specification. It is highly optimized for Intel CPU and Intel GPU hardware.

**Q: I'm trying to use oneMath in my project using `FetchContent`**, but I keep running into `ONEMATH::SYCL::SYCL target was not found` problem when I try to build the project. What should I do?

**A:**
Make sure you set the compiler when you configure your project.
E.g. `cmake -Bbuild . -DCMAKE_CXX_COMPILER=icpx`.

**Q: I'm trying to use oneMath in my project using `find_package(oneMath)`.** I set oneMath/oneTBB and Compiler environment first, then I built and installed oneMath, and finally I tried to build my project using installed oneMath (e.g. like this `cmake -Bbuild -GNinja -DCMAKE_CXX_COMPILER=icpx -DoneMath_ROOT= .`) and I noticed that cmake includes installed oneMath headers as a system include which ends up as a lower priority than the installed Intel(R) oneAPI Math Kernel Library package includes which I set before for building oneMath. As a result, I get conflicts between Intel(R) oneAPI Math Kernel Library and installed oneMath headers. What should I do?

**A:**
Having installed oneMath headers as `-I` instead on system includes (as `-isystem`) helps to resolve this problem. We use `INTERFACE_INCLUDE_DIRECTORIES` to add paths to installed oneMath headers (check `oneMathTargets.cmake` in `lib/cmake` to find it). It's a known limitation that `INTERFACE_INCLUDE_DIRECTORIES` puts headers paths as system headers. To avoid that:
- Option 1: Use CMake >=3.25. In this case oneMath will be built with `EXPORT_NO_SYSTEM` property set to `true` and you won't see the issue.
- Option 2: If you use CMake < 3.25, set `PROPERTIES NO_SYSTEM_FROM_IMPORTED true` for your target. E.g: `set_target_properties(test PROPERTIES NO_SYSTEM_FROM_IMPORTED true)`.

---

#### [Legal information](legal_information.md)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/uxlfoundation/onemath

Awesome Lists containing this project

README