Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sysread/coro-processpool

An asynchronous process pool for Perl and Coro
https://github.com/sysread/coro-processpool

Last synced: about 1 month ago
JSON representation

An asynchronous process pool for Perl and Coro

Awesome Lists containing this project

README

        

=pod

=encoding UTF-8

=head1 NAME

Coro::ProcessPool - An asynchronous pool of perl processes

=head1 VERSION

version 0.30

=head1 SYNOPSIS

use Coro::ProcessPool;
use Coro;

my $pool = Coro::ProcessPool->new(
max_procs => 4,
max_reqs => 100,
include => ['/path/to/my/task/classes', '/path/to/other/packages'],
);

my $double = sub { $_[0] * 2 };

#-----------------------------------------------------------------------
# Process in sequence, waiting for each result in turn
#-----------------------------------------------------------------------
my %result;
foreach my $i (1 .. 1000) {
$result{$i} = $pool->process($double, $i);
}

#-----------------------------------------------------------------------
# Process as a batch
#-----------------------------------------------------------------------
my @results = $pool->map($double, 1 .. 1000);

#-----------------------------------------------------------------------
# Defer waiting for result
#-----------------------------------------------------------------------
my %deferred;

$deferred{$_} = $pool->defer($double, $_)
foreach 1 .. 1000;

# Later
foreach my $i (keys %deferred) {
print "$i = " . $deferred{$i}->() . "\n";
}

#-----------------------------------------------------------------------
# Use a "task class" implementing 'new' and 'run'
#-----------------------------------------------------------------------
my $result = $pool->process('Task::Doubler', 21);

#-----------------------------------------------------------------------
# Pipelines (work queues)
#-----------------------------------------------------------------------
my $pipe = $pool->pipeline;

# Start producer thread to queue tasks
my $producer = async {
while (my $task = get_next_task()) {
$pipe->queue('Some::TaskClass', $task);
}

# Let the pipeline know no more tasks are coming
$pipe->shutdown;
};

# Collect the results of each task as they are received
while (my $result = $pipe->next) {
do_stuff_with($result);
}

$pool->shutdown;

=head1 DESCRIPTION

Processes tasks using a pool of external Perl processes.

=head1 CONSTRUCTOR

my $pool = Coro::ProcessPool->new(
max_procs => 4,
max_reqs => 100,
include => ['path/to/my/packages', 'some/more/packages'],
);

=head2 max_procs

The maximum number of processes to run within the process pool. Defaults
to the number of CPUs on the ssytem.

=head2 max_reqs

The maximum number of tasks a worker process may run before being terminated
and replaced with a fresh process. This is useful for tasks that might leak
memory over time.

=head2 include

An optional array ref of directory paths to prepend to the set of directories
the worker process will use to find Perl packages.

=head1 METHODS

=head2 join

Cedes control to the event loop until the pool has completed all remaining
tasks and woken up any threads watching them.

=head2 defer

Queues a task to be processed by the pool. Tasks may specified in either of two
forms, as a code ref or the fully qualified name of a perl class which
implements two methods, C and C. Any remaining arguments to C
are passed unchanged to the code ref or the C method of the task class.

C will immediately return an L that will wait for and
return the result of the task (or croak if the task generated an error).

# Using a code ref
my $cv = $pool->defer(\&func, $arg1, $arg2, $arg3);
my $result = $cv->recv;

# With a task class
my $cv = $pool->defer('Some::Task::Class', $arg1, $arg2, $arg3);
my $result = $cv->recv;

=head2 process

Calls defer and immediately calls C on the returned condvar, returning
the result. This is useful if your workflow includes multiple threads which
share the same pool. All arguments are passed unchanged to C.

=head2 map

Like perl's C, applies a code ref to a list of arguments. This method will
cede until all results have been returned by the pool, returning the result as
a list. The order of arguments and results is preserved as expected.

my @results = $pool->map(\&func, $arg1, $arg2, $arg3);

=head2 pipeline

Returns a L object which can be used to pipe
requests through to the process pool. Results then come out the other end of
the pipe, not necessarily in the order in which they were queued. It is up to
the calling code to perform task accounting (for example, by passing an id in
as one of the arguments to the task class).

my $pipe = $pool->pipeline;

my $producer = async {
foreach my $args (@tasks) {
$pipe->queue('Some::Class', $args);
}

$pipe->shutdown;
};

while (my $result = $pipe->next) {
...
}

All arguments to C are passed transparently to the constructor of
L. There is no limit to the number of pipelines
which may be created for a pool.

=head1 A NOTE ABOUT IMPORTS AND CLOSURES

Code refs are serialized using L, allowing closed over
variables to be available to the code being called in the sub-process. Mutated
variables are I updated when the result is returned.

See L for important
notes regarding closures.

=head2 Use versus require

The C pragma is run at compile time, whereas C is evaluated at
runtime. Because of this, the use of C in code passed directly to the
C method can fail in the worker process because the C statement
has already been evaluated in the parent process when the calling code was
compiled.

This will not work:

$pool->process(sub {
use Foo;
my $foo = Foo->new();
});

This will work:

$pool->process(sub {
require Foo;
my $foo = Foo->new();
});

If C is necessary (for example, to import a method or transform the
calling code via import), it is recommended to move the code into its own
module (or to expliticly call require and import in the subroutine), which can
then be called in the anonymous routine:

package Bar;

use Foo;

sub dostuff {
...
}

Then, in your caller:

$pool->process(sub {
require Bar;
Bar::dostuff();
});

Alternately, a task class may be used if dependency management is causing a
headaches:

my $result = $pool->process('Task::Class', @args);

=head1 COMPATIBILITY

C will likely break on Win32 due to missing support for
non-blocking file descriptors (Win32 can only call C and C on
actual network sockets). Without rewriting this as a network server, which
would impact performance and be really annoying, it is likely this module will
not support Win32 in the near future.

The following modules will get you started if you wish to explore a synchronous
process pool on Windows:

=over

=item L

=item L

=item L

=back

=head1 SEE ALSO

=over

=item L

=item L

=back

=head1 AUTHOR

Jeff Ober

=head1 COPYRIGHT AND LICENSE

This software is copyright (c) 2017 by Jeff Ober.

This is free software; you can redistribute it and/or modify it under
the same terms as the Perl 5 programming language system itself.

=cut