Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/swansonk14/p_tqdm
Parallel processing with progress bars
https://github.com/swansonk14/p_tqdm
parallel-processing progress-bar tqdm
Last synced: 17 days ago
JSON representation
Parallel processing with progress bars
- Host: GitHub
- URL: https://github.com/swansonk14/p_tqdm
- Owner: swansonk14
- License: mit
- Created: 2017-08-16T01:23:38.000Z (about 7 years ago)
- Default Branch: main
- Last Pushed: 2024-08-08T09:32:53.000Z (3 months ago)
- Last Synced: 2024-08-09T11:44:48.631Z (3 months ago)
- Topics: parallel-processing, progress-bar, tqdm
- Language: Python
- Homepage:
- Size: 55.7 KB
- Stars: 447
- Watchers: 7
- Forks: 44
- Open Issues: 17
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# p_tqdm
[![PyPI - Python Version](https://img.shields.io/pypi/pyversions/p-tqdm)](https://badge.fury.io/py/p-tqdm)
[![PyPI version](https://badge.fury.io/py/p-tqdm.svg)](https://badge.fury.io/py/p-tqdm)
[![Downloads](https://pepy.tech/badge/p_tqdm)](https://pepy.tech/project/p_tqdm)
[![Build Status](https://github.com/swansonk14/p_tqdm/workflows/tests/badge.svg)](https://github.com/swansonk14/p_tqdm)
[![codecov](https://codecov.io/gh/swansonk14/p_tqdm/branch/main/graph/badge.svg)](https://codecov.io/gh/swansonk14/p_tqdm)
[![license](https://img.shields.io/github/license/swansonk14/p_tqdm.svg)](https://github.com/swansonk14/p_tqdm/blob/main/LICENSE.txt)`p_tqdm` makes parallel processing with progress bars easy.
`p_tqdm` is a wrapper around [pathos.multiprocessing](https://github.com/uqfoundation/pathos/blob/master/pathos/multiprocessing.py) and [tqdm](https://github.com/tqdm/tqdm). Unlike Python's default multiprocessing library, pathos provides a more flexible parallel map which can apply almost any type of function, including lambda functions, nested functions, and class methods, and can easily handle functions with multiple arguments. tqdm is applied on top of pathos's parallel map and displays a progress bar including an estimated time to completion.
## Installation
```pip install p_tqdm```
## Example
Let's say you want to add two lists element by element. Without any parallelism, this can be done easily with a Python `map`.
```python
l1 = ['1', '2', '3']
l2 = ['a', 'b', 'c']def add(a, b):
return a + b
added = map(add, l1, l2)
# added == ['1a', '2b', '3c']
```But if the lists are much larger or the computation is more intense, parallelism becomes a necessity. However, the syntax is often cumbersome. `p_tqdm` makes it easy and adds a progress bar too.
```python
from p_tqdm import p_mapadded = p_map(add, l1, l2)
# added == ['1a', '2b', '3c']
``````
0%| | 0/3 [00:00, ?it/s]
33%|████████████ | 1/3 [00:01<00:02, 1.00s/it]
66%|████████████████████████ | 2/3 [00:02<00:01, 1.00s/it]
100%|████████████████████████████████████| 3/3 [00:03<00:00, 1.00s/it]
```## p_tqdm functions
### Parallel maps
* [p_map](#p_map) - parallel ordered map
* [p_imap](#p_imap) - iterator for parallel ordered map
* [p_umap](#p_umap) - parallel unordered map
* [p_uimap](#p_uimap) - iterator for parallel unordered map### Sequential maps
* [t_map](#t_map) - sequential ordered map
* [t_imap](#t_imap) - iterator for sequential ordered map### p_map
Performs an ordered map in parallel.
```python
from p_tqdm import p_mapdef add(a, b):
return a + badded = p_map(add, ['1', '2', '3'], ['a', 'b', 'c'])
# added = ['1a', '2b', '3c']
```### p_imap
Returns an iterator for an ordered map in parallel.
```python
from p_tqdm import p_imapdef add(a, b):
return a + biterator = p_imap(add, ['1', '2', '3'], ['a', 'b', 'c'])
for result in iterator:
print(result) # prints '1a', '2b', '3c'
```### p_umap
Performs an unordered map in parallel.
```python
from p_tqdm import p_umapdef add(a, b):
return a + badded = p_umap(add, ['1', '2', '3'], ['a', 'b', 'c'])
# added is an array with '1a', '2b', '3c' in any order
```### p_uimap
Returns an iterator for an unordered map in parallel.
```python
from p_tqdm import p_uimapdef add(a, b):
return a + biterator = p_uimap(add, ['1', '2', '3'], ['a', 'b', 'c'])
for result in iterator:
print(result) # prints '1a', '2b', '3c' in any order
```### t_map
Performs an ordered map sequentially.
```python
from p_tqdm import t_mapdef add(a, b):
return a + badded = t_map(add, ['1', '2', '3'], ['a', 'b', 'c'])
# added == ['1a', '2b', '3c']
```### t_imap
Returns an iterator for an ordered map to be performed sequentially.
```python
from p_tqdm import p_imapdef add(a, b):
return a + biterator = t_imap(add, ['1', '2', '3'], ['a', 'b', 'c'])
for result in iterator:
print(result) # prints '1a', '2b', '3c'
```## Shared properties
### Arguments
All `p_tqdm` functions accept any number of iterables as input, as long as the number of iterables matches the number of arguments of the function.
To repeat a non-iterable argument along with the iterables, use Python's [partial](https://docs.python.org/3/library/functools.html#functools.partial) from the [functools](https://docs.python.org/3/library/functools.html) library. See the example below.
```python
from functools import partiall1 = ['1', '2', '3']
l2 = ['a', 'b', 'c']def add(a, b, c=''):
return a + b + cadded = p_map(partial(add, c='!'), l1, l2)
# added == ['1a!', '2b!', '3c!']
```### CPUs
All the parallel `p_tqdm` functions can be passed the keyword `num_cpus` to indicate how many CPUs to use. The default is all CPUs. `num_cpus` can either be an integer to indicate the exact number of CPUs to use or a float to indicate the proportion of CPUs to use.
Note that the parallel Pool objects used by `p_tqdm` are automatically closed when the map finishes processing.
### tqdm instance
All the parallel `p_tqdm` functions can be passed the keyword `tqdm` to choose a specific flavor of tqdm. By default, this value is taken from `tqdm.auto`. The `tqdm` parameter can be used pass `p_tqdm` output to `tqdm.gui`, `tqdm.tk` or any customized subclass of `tqdm`.