https://github.com/rocm/rocsolver

Next generation LAPACK implementation for ROCm platform
https://github.com/rocm/rocsolver

lapack linear-algebra rocm

Last synced: 6 months ago
JSON representation

Next generation LAPACK implementation for ROCm platform

Host: GitHub
URL: https://github.com/rocm/rocsolver
Owner: ROCm
License: other
Created: 2018-05-22T22:37:56.000Z (over 7 years ago)
Default Branch: develop
Last Pushed: 2024-10-23T17:50:43.000Z (12 months ago)
Last Synced: 2024-10-24T02:02:30.974Z (12 months ago)
Topics: lapack, linear-algebra, rocm
Language: C++
Homepage: https://rocm.docs.amd.com/projects/rocSOLVER/en/latest/
Size: 9.19 MB
Stars: 91
Watchers: 28
Forks: 50
Open Issues: 20
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE.md
- Codeowners: .github/CODEOWNERS

Awesome Lists containing this project

README

          # rocSOLVER

rocSOLVER is a work-in-progress implementation of a subset of [LAPACK][1]

functionality on the [ROCm platform][2].

## Documentation

> [!NOTE]

> The published rocSOLVER documentation is available at [rocSOLVER](https://rocm.docs.amd.com/projects/rocSOLVER/en/latest/index.html) in an organized, easy-to-read format, with search and a table of contents. The documentation source files reside in the rocSOLVER/docs folder of this repository. As with all ROCm projects, the documentation is open source. For more information, see [Contribute to ROCm documentation](https://rocm.docs.amd.com/en/latest/contribute/contributing.html).

### How to build documentation

Please follow the instructions below to build the documentation.

```

cd docs

pip3 install -r sphinx/requirements.txt

python3 -m sphinx -T -E -b html -d _build/doctrees -D language=en . _build/html

```

## Building rocSOLVER

To download the rocSOLVER source code, clone this repository with the command:

    git clone https://github.com/ROCmSoftwarePlatform/rocSOLVER.git

rocSOLVER requires rocBLAS as a companion GPU BLAS implementation. For

more information about rocBLAS and how to install it, see the

[rocBLAS documentation][4].

After a standard installation of rocBLAS, the following commands will build

rocSOLVER and install to `/opt/rocm`:

    cd rocSOLVER

    ./install.sh -i

Once installed, rocSOLVER can be used just like any other library with a C API.

The header file will need to be included in the user code, and both the rocBLAS

and rocSOLVER shared libraries will become link-time and run-time dependencies

for the user application.

If you are a developer contributing to rocSOLVER, you may wish to run

`./scripts/install-hooks` to install the git hooks for autoformatting.

You may also want to take a look at the [contributing guidelines][7]

## Using rocSOLVER

The following code snippet shows how to compute the QR factorization of a

general m-by-n real matrix in double precision using rocSOLVER. A longer

version of this example is provided by `example_basic.cpp` in the

[samples directory][5]. For a description of the `rocsolver_dgeqrf`

function, see the [rocSOLVER API documentation][6].

```cpp

/////////////////////////////

// example.cpp source code //

/////////////////////////////

#include  // for std::min

#include   // for size_t

#include 

#include  // for hip functions

#include  // for all the rocsolver C interfaces and type declarations

int main() {

  rocblas_int M;

  rocblas_int N;

  rocblas_int lda;

  // here is where you would initialize M, N and lda with desired values

  rocblas_handle handle;

  rocblas_create_handle(&handle);

  size_t size_A = size_t(lda) * N;          // the size of the array for the matrix

  size_t size_piv = size_t(std::min(M, N)); // the size of array for the Householder scalars

  std::vector hA(size_A);      // creates array for matrix in CPU

  std::vector hIpiv(size_piv); // creates array for householder scalars in CPU

  double *dA, *dIpiv;

  hipMalloc(&dA, sizeof(double)*size_A);      // allocates memory for matrix in GPU

  hipMalloc(&dIpiv, sizeof(double)*size_piv); // allocates memory for scalars in GPU

  // here is where you would initialize matrix A (array hA) with input data

  // note: matrices must be stored in column major format,

  //       i.e. entry (i,j) should be accessed by hA[i + j*lda]

  // copy data to GPU

  hipMemcpy(dA, hA.data(), sizeof(double)*size_A, hipMemcpyHostToDevice);

  // compute the QR factorization on the GPU

  rocsolver_dgeqrf(handle, M, N, dA, lda, dIpiv);

  // copy the results back to CPU

  hipMemcpy(hA.data(), dA, sizeof(double)*size_A, hipMemcpyDeviceToHost);

  hipMemcpy(hIpiv.data(), dIpiv, sizeof(double)*size_piv, hipMemcpyDeviceToHost);

  // the results are now in hA and hIpiv, so you can use them here

  hipFree(dA);                        // de-allocate GPU memory

  hipFree(dIpiv);

  rocblas_destroy_handle(handle);     // destroy handle

}

```

The exact command used to compile the example above may vary depending on the

system environment, but here is a typical example:

    /opt/rocm/bin/hipcc -I/opt/rocm/include -c example.cpp

    /opt/rocm/bin/hipcc -o example -L/opt/rocm/lib -lrocsolver -lrocblas example.o

[1]: https://www.netlib.org/lapack/

[2]: https://rocm.docs.amd.com/

[3]: https://rocm.docs.amd.com/projects/rocSOLVER/

[4]: https://rocm.docs.amd.com/projects/rocBLAS/

[5]: clients/samples/

[6]: https://rocm.docs.amd.com/projects/rocSOLVER/en/latest/api/lapack.html#rocsolver-type-geqrf

[7]: CONTRIBUTING.md

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/rocm/rocsolver

Awesome Lists containing this project

README