An open API service indexing awesome lists of open source software.

https://github.com/cesnet/torque.build

Build configuration for Torque packages.
https://github.com/cesnet/torque.build

Last synced: 26 days ago
JSON representation

Build configuration for Torque packages.

Awesome Lists containing this project

README

          

Torque was package for many distributions already. And it happened,
that slightly different approaches were taken for Ubuntu and Debian.
This package is the result of a joint effort of Ubuntu developer Morten
Kjeldgaard and Debian developer Steffen Moeller to come up with packaging
strategy that both distributions can easily share. The Debian packaging
was originally crafted by Daniel Bayer.

The most recent changes are being introduced by Dominique Belhachemi,
who uses Torque as the basis for his Google Summer of Code project
over mid 2010.

WIKI
====
Additional information can be found in our wiki:

* http://wiki.debian.org/Torque *

Short and incomplete pointers to setting up the torque batch system
===================================================================

The default torque server name is "torqueserver". This name is defined in
the file `/etc/torque/server_name`. You can either change this, add
respective entries to /etc/hosts or, if you have access to the nameserver,
add a torqueserver CNAME record pointing to the actual machine. To change
that name later is some error-prone process that you might want to avoid,
since all users need to update their SSH known hosts entries.

This README will describe a simple setup with a single head node that
does not itself act as a compute server. Submitted jobs will be
executed on distinct compute nodes.

In our setup, users have their home directories on NFS mounted shares
that are accesible from all workstations and also the compute nodes
(via autofs).

For the system to work smoothly, it is important that you have set up
passwordless ssh, so compute nodes can copy the batch queue log files back
to the users' disk area. Changing the name of the server will impose the
need to reperform that SSH setup.

Head node
---------

On the head node, you need to run the torque-server and torque-scheduler.
The scheduler is a simple one, you can substitute it with a more
sophisticated one like maui, also distributed from
http://www.clusterresources.com/[clusterressources.org] (unfortunately, maui
has a license too strict to distribute as packages.)

Compute node
------------

On each of the compute nodes, you need to install at least torque-mom.

Submit hosts
------------

You also need a number of submit hosts, that may or may not include the
head and compute nodes. On these, you need to install the software in
torque-client. In our case, users do their work on a desktop workstation in
their offices or in the computing room. You can submit jobs from each of
these, these jobs are forwarded to the head node (torque-server) and there
scheduled for execution.

Add to /etc/services
--------------------

If you want to have names associated with the ports that are used by
torque, you may want to patch /etc/services:

if grep -q '^pbs' /etc/services; then
echo "/etc/services already offers a pbs entry. Not touched: "
grep ^pbs /etc/services
else
cat >>/etc/services < /var/lib/torque/server_priv/nodes
hostname --long > /var/lib/torque/server_name
hostname --long > /var/lib/torque/mom_priv/config
ps aux | grep -v grep | grep -q pbs_server || pbs_server -t create
qmgr -c "s s scheduling=true"
qmgr -c "c q batch queue_type=execution"
qmgr -c "s q batch started=true"
qmgr -c "s q batch enabled=true"
qmgr -c "s q batch resources_default.nodes=1"
qmgr -c "s q batch resources_default.walltime=3600"
qmgr -c "s s default_queue=batch"
invoke-rc.d restart
invoke-rc.d restart
invoke-rc.d restart
qmgr -c "s n `/bin/hostname --long` state=free" -e

and enjoy your very own batch server!

Morten Kjeldgaard
January 2008

-- Morten Kjeldgaard , Sat, 9 Feb 2008 23:01:19 +0100