https://github.com/648trindade/adaptive

Adaptive Scheduler for Parallel Loops in C++/C
https://github.com/648trindade/adaptive

cpp parallel-computing parallel-programming parallelism scheduler

Last synced: over 1 year ago
JSON representation

Adaptive Scheduler for Parallel Loops in C++/C

Host: GitHub
URL: https://github.com/648trindade/adaptive
Owner: 648trindade
License: mit
Created: 2020-04-20T01:39:37.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2020-05-12T00:27:13.000Z (about 6 years ago)
Last Synced: 2025-01-10T01:28:13.138Z (over 1 year ago)
Topics: cpp, parallel-computing, parallel-programming, parallelism, scheduler
Language: C++
Size: 220 KB
Stars: 2
Watchers: 2
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE.txt

Awesome Lists containing this project

README

          # Adaptive

[![Build Status](https://travis-ci.com/648trindade/adaptive.svg?branch=master)](https://travis-ci.com/648trindade/adaptive)

Adaptive is a parallel loop scheduler designed with an adaptive algorithm. Its original purpose is to better balancing parallel irregular workloads and better balancing regular (and irregular) workloads on Asymmetric Multicore Processors (AMP).

The hybrid adaptive scheduling algorithm uses _work stealing_ in order to balance loads on threads, and uses a THE protocol approach to minimize parallel overhead on concurrent scheduling operations. A deep explanation of how scheduler actually works can be found on the following master thesis (written in portuguese, english paper coming soon):

> Trindade, Rafael G. and Lima, João V. F.. **Escalonador Adaptativo de Laços Paralelos para Processadores Multinúcleo Assimétricos**. Master Thesis. Universidade Federal de Santa Maria. 2020. Santa Maria, RS, Brazil. Available at: http://www.inf.ufsm.br/~rtrindade/docs/dissertacao-rtrindade.pdf.

>

> English Title: **An Adaptive Scheduler of Parallel Loops for Asymmetric Multicore Processors**

The scheduler is based on the following related works:

> M. Durand, F. Broquedis, T. Gautier, and B. Raffin, **An efficient openmp loop scheduler for irregular applications on large-scale NUMA machines**, in OpenMP in the Era of Low Power Devices and Accelerators, A. P. Rendell, B. M. Chapman, and M. S. Muller, Eds. Berlin, Heidelberg: Springer Berlin Heidelberg, 2013, pp. 141–155. [Online]. Available: https://doi.org/10.1007/978-3-642-40698-0_11

>

> S. D. K. Mor, **Analysis of synchronizations in greedy-scheduled executions and applications to efficient generation of pseudorandom numbers in parallel**, Thesis, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil, nov 2015. [Online]. Available: http://hdl.handle.net/10183/130529

>

> D. Traore, J.-L. Roch, N. Maillard, T. Gautier, and J. Bernard, **Deque-free work-optimal parallel STL algorithms**, in Euro-Par 2008 -- Parallel Processing, E. Luque, T. Margalef, and D. Benítez, Eds.Berlin, Heidelberg: Springer Berlin Heidelberg, 2008, pp. 887–897. [Online]. Available: https://doi.org/10.1007/978-3-540-85451-7_95

## API

The Adaptive's API is based on Thread Building Blocks (TBB) API. We currently support two types of parallel loops:

* Common parallel loops:

```c++

void adapt::parallel_loop(

    T start, T end, 

    void body(T, T)

);

```

* Reduction parallel loops

```c++

V adapt::parallel_reduce(

    T start, T end, 

    V initial, 

    V body(T, T, V), 

    V reductor(V, V)

);

```

The API accepts functions and lambda functions as parameters for `body` and `reductor` arguments.

## Examples

Parallelizing a vector filling algorithm

```c++

std::vector vec(256, 0);

adapt::parallel_for(

    0, 256,

    [&vec](const int start, const int end) {

        for (int i = start; i < end; i++)

            vec[i] = some_function();

    }

);

```

Parallelizing a vector sum algorithm (using `std::plus` utility binary function as reductor)

```c++

std::vector vec(256, 0.0);

// ... fill vec someway

double result = adapt::parallel_reduce(

    0, 256, 0.0,

    // body

    [&vec](const int start, const int end, double initial) {

        double _result = initial;

        for (int i = start; i < end; i++)

            _result += vec[i];

        return _result;

    },

    // reductor

    std::plus();

);

```

Parallelizing a find minimal value and index algorithm (using a custom reductor -- C++ lambda function)

```c++

std::vector vec(256, 0.0);

// ... fill vec someway

std::pair _initial = std::make_pair(0, vec[0]);

std::pair result = adapt::parallel_reduce(

    1, 256, _initial,

    // body

    [&vec](const int start, const int end, std::pair initial) {

        std::pair _result = initial;

        for (int i = start; i < end; i++) {

            if (vec[i] < _result.second) {

                _result.first = i;

                _result.second = vec[i];

            }

        }

        return _result;

    },

    // reductor

    [](const std::pair left, const std::pair right) {

        std::pair _result = left;

        if (right.second < _result.second) {

            _result.first = right.first;

            _result.second = right.second;

        }

        return _result;

    }

);

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/648trindade/adaptive

Awesome Lists containing this project

README