https://github.com/leavers/fluentmap

A drop-in Python map replacement featuring parallel and batch processing
https://github.com/leavers/fluentmap

batch chunk functional-programming mapreduce parallel-processing python

Last synced: 3 months ago
JSON representation

A drop-in Python map replacement featuring parallel and batch processing

Host: GitHub
URL: https://github.com/leavers/fluentmap
Owner: leavers
License: mit
Created: 2024-03-28T20:16:08.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-12-07T15:58:48.000Z (about 1 year ago)
Last Synced: 2025-09-25T07:44:34.115Z (3 months ago)
Topics: batch, chunk, functional-programming, mapreduce, parallel-processing, python
Language: Python
Homepage:
Size: 41 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Fluentmap

[![Testing](https://github.com/leavers/fluentmap/actions/workflows/test-suite.yml/badge.svg)](https://github.com/leavers/fluentmap/actions/workflows/test-suite.yml)

[![Package version](https://img.shields.io/pypi/v/fluentmap.svg)](https://pypi.org/project/fluentmap/)

[![Python](https://img.shields.io/pypi/pyversions/fluentmap.svg)](https://pypi.org/project/fluentmap/)

Fluentmap provides a drop-in Python map replacement featuring parallel and batch

processing.

## Features

- Use `executor` to run tasks in parallel.

- Use `batch_size`/`chunk_size` to send parameters in batches/chunks.

- Use `num_prepare` to prepare data in advance for better performance.

- Call `on_return` hook to process the return value of each task.

## Installation

Fluentmap is available on [PyPI](https://pypi.org/project/fluentmap/):

```shell

pip install fluentmap

```

## Usage

### Drop-in replacement

You can start to use fluentmap just like built-in `map`:

```python

from typing import Any, List

from fluentmap import map

items: List[str] = [...]

def heavy_task(item: str) -> Any:

    """Suppose this function represents a computationally expensive task."""

def postprocessing(result: Any):

    """Suppose this function represents a postprocessing task."""

for result in map(heavy_task, items):

    postprocessing(result)

```

### Parallel processing

As `heavy_task` is a computationally expensive task, you can use `executor` to

run it in parallel.

```python

from concurrent.futures import ProcessPoolExecutor

from fluentmap import map

# ......

with ProcessPoolExecutor() as executor:

    # each heavy_task invocation runs in a separate process

    for result in map(heavy_task, items, executor=executor):

        postprocessing(result)

```

### Batch/chunk processing

You can use `batch_size`/`chunk_size` to send arguments in batches/chunks.

The difference between them is that `batch_size` packs multiple arguments into a batch

before sending them to the function, therefore the function needs to be modified to

handle a list of arguments.

On the other hand, `chunk_size` packs multiple arguments into a chunks before passing

them to executor workers, while workers still process each argument sequentially.

```python

from concurrent.futures import ProcessPoolExecutor

from typing import Any, List

from fluentmap import map

# ......

def heavy_task_in_batch(item: List[str]) -> Any:

    """Note that `item` is a list since when `batch_size` is set,

    fluentmap will concatenate multiple items into a batch before sending them to

    the function which is to be invoked.

    """

# An example of using `batch_size`

with ProcessPoolExecutor() as executor:

    for result in map(

        heavy_task_in_batch,

        items,

        executor=executor,

        batch_size=64,

    ):

        postprocessing(result)

# An example of using `chunk_size` 

with ProcessPoolExecutor() as executor:

    for result in map(

        heavy_task,

        items,

        executor=executor,

        chunk_size=32,

    ):

        postprocessing(result)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/leavers/fluentmap

Awesome Lists containing this project

README