Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mesosphere-backup/mesos-hydra
MPICH2 Hydra scheduler for Apache Mesos.
https://github.com/mesosphere-backup/mesos-hydra
Last synced: 5 days ago
JSON representation
MPICH2 Hydra scheduler for Apache Mesos.
- Host: GitHub
- URL: https://github.com/mesosphere-backup/mesos-hydra
- Owner: mesosphere-backup
- License: apache-2.0
- Created: 2014-02-04T17:18:48.000Z (almost 11 years ago)
- Default Branch: master
- Last Pushed: 2014-02-09T03:02:22.000Z (over 10 years ago)
- Last Synced: 2024-10-22T23:35:27.797Z (17 days ago)
- Language: Python
- Homepage:
- Size: 1.36 MB
- Stars: 29
- Watchers: 166
- Forks: 7
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-starred - mesosphere-backup/mesos-hydra - MPICH2 Hydra scheduler for Apache Mesos. (others)
README
mesos-hydra
===========MPICH2 Hydra scheduler for Apache Mesos.
The stock MPI framework in Mesos is targeted MPICH2 with MPD process management. This framework allows users to use the default Hydra process manager.
Usage:
$ ./mrun -N <#Nodes> -n <#MPI processes> -c <#Cores per MPI process>
This is still a experimental framework and any participation and feedback is appreciated.
## Installation on Elastic Mesos
First off, go ahead and launch a cluster at http://elastic.mesosphere.io.
Then log into one of the master nodes and fetch the Hydra framework:$ wget https://github.com/mesosphere/mesos-hydra/archive/master.zip
$ unzip master.zip
$ cd mesos-hydra-master
$ sudo aptitude install python2.7-protobuf python-distribute make g++ build-essential gfortran libcr0 default-jdk
$ wget http://www.cebacad.net/files/mpich/ubuntu/mpich-3.1rc2/mpich_3.1rc2-1ubuntu_amd64.deb
$ sudo dpkg -i mpich_3.1rc2-1ubuntu_amd64.deb
$ export HDFS_NAME_NODE=
$ make download_egg
$ make
$ ./mrun -N 3 -n 6 ./hello_world
I0209 02:54:30.842380 17588 sched.cpp:218] No credentials provided. Attempting to register without authentication
I0209 02:54:30.842560 17588 sched.cpp:230] Detecting new master
Number of tasks= 6 My rank= 1 Running on ec2-54-211-204-163.compute-1.amazonaws.com
Number of tasks= 6 My rank= 0 Running on ec2-54-204-134-8.compute-1.amazonaws.com
Number of tasks= 6 My rank= 3 Running on ec2-54-204-134-8.compute-1.amazonaws.com
Number of tasks= 6 My rank= 2 Running on ec2-107-21-190-250.compute-1.amazonaws.com
Number of tasks= 6 My rank= 5 Running on ec2-107-21-190-250.compute-1.amazonaws.com
Number of tasks= 6 My rank= 4 Running on ec2-54-211-204-163.compute-1.amazonaws.com
## Known issues### Missing library dependencies
MPICH2 usually expects a mounted parallel filesystem but mesos-hydra only use and depends on HDFS. This means that necessary libraries needs to be shipped with the MPI command to the slaves. This can be worked around by copying the needed libraries to export/libs and rerun make.
### mrun hangs with node, process configuration X
mesos-hydra will decline offers indefinitely if too greedy resource constraints have been set up (for example requiring more cores than nodes provide). This will make mrun hang and should be avoided if possible.