Skip to content

A sample setup for processing non linear workflows using docker swarm

License

Notifications You must be signed in to change notification settings

nehaljwani/ddag-sample

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ddag-sample

A sample setup for processing (non?) linear workflows using docker swarm

ddag

This project serves as a sample for setting up a distributed cluster of nodes which can be part of a (disconnected?) directed acyclic graph. Each node in this graph represents a module which performs some action in the scientific workflow. For example, in the context of NLP, it can be a Morph Analyzer written in C. Each node hosts an internal API (as a wrapper over the program) using the real-time web framework Mojolicious . This setup helps in processing the components of the given pipeline, which are at the same level, in a parallel manner. For example, in the above picture, {bcd, cde} are processed in parallel. The user is expected to POST a graph to the container hosting the public API, which then sets a JOB ID for each such graph and subscribes to it using a pub-sub channel. The graph to be processed can have more nodes than the number of nodes in the swarm, in case some module is supposed to be reused, for example, in the above picture, bcd is used twice. In order to facilitate that, each node in the graph (except the input nodes) follow the identification scheme: ${modulename}_${identifier}

Base image for all modules

Either build the image:

$ docker build -t ddag-sample .

Or simply pull it:

$ docker pull nehaljwani/ddag-sample

Setting up docker swarm

Either follow: https://docs.docker.com/engine/userguide/networking/get-started-overlay/ Or for a quick setup, do the following:

  • Make sure that the docker daemon is launched with the required arguments on all host machines.

    # For a machine running systemd, with network interface `ensp1s0` and IP address `10.1.65.241`, the line on the master host would look like:
    ExecStart=/usr/bin/docker daemon -H fd:// -H tcp://10.1.65.241:2375 --cluster-advertise enp1s0:2375 --cluster-store consul://10.1.65.241:8500
    #  Similarly, on a separate host, which is also supposed to host nodes of this swarm, running systemd, with network interface `eth0` and IP address `10.1.65.242`,  the line would look like:
    ExecStart=/usr/bin/docker daemon -H fd:// -H tcp://10.1.65.242:2375 --cluster-advertise eth0:2375 --cluster-store consul://10.1.65.241:8500
  • Launch a key-value store on the master host:

    $ docker run -d -p 8500:8500 --name=consul progrium/consul -server -bootstrap
  • Launch the swarm manager on the master host:

    $ docker run -d -p 4000:4000 swarm manage -H :4000 --replication --advertise 10.1.65.241:4000 consul://10.1.65.241:8500
  • Join the swarm. For example, ...

    # The master host (10.1.65.241) would do:
    $ docker run -d swarm join --advertise=10.1.65.241:2375 consul://10.1.65.241:8500
    # The other host (10.1.65.242) would do::
    $ docker run -d swarm join --advertise=10.1.65.242:2375 consul://10.1.65.241:8500
  • Create the overlay network on any one host (it will be available on all hosts which have joined the swarm):

    $ docker network create -d overlay ddag-net

Running modules as docker containers

For each module, do the following (replace abc with the name of the module and 10.1.65.241 with IP address of the swarm manager)

$ docker -H 10.1.65.241:4000 run -dit --name abc --hostname abc --net ddag-net nehaljwani/ddag-base:latest /bin/bash
$ docker -H 10.1.65.241:4000 cp modules/abc abc:/
$ docker -H 10.1.65.241:4000 exec -d abc bash -c 'cd /abc ; hypnotoad api.pl'

For the public API end point, do: (replace 10.1.65.241 with IP address of the swarm manager)

$ docker -H 10.1.65.241:4000 run -dit --name public --hostname public --net ddag-net nehaljwani/ddag-base:latest /bin/bash
$ docker -H 10.1.65.241:4000 cp modules/public.pl public:/
$ docker -H 10.1.65.241:4000 exec -d abc bash -c 'hypnotoad public.pl'

Launch the redis key-value store (replace 10.1.65.241 with IP address of the swarm manager)

$ docker -H 10.1.65.241:4000 run -dit --name redis --hostname redis --net ddag-net redis

Querying distributed modules

To process the disconnected directed acyclic graph as shown in the picture above, create the file: /tmp/input.txt with the contents:

$ cat /tmp/input.txt
{
  "edges": {
    "input1": [
      "abc_1"
    ],
    "input2": [
      "abc_1"
    ],
    "input3": [
      "efg_1"
    ],
    "abc_1": [
      "bcd_1",
      "cde_1"
    ],
    "bcd_1": [
      "def_1"
    ],
    "cde_1": [
      "def_1"
    ],
    "def_1": [
      "bcd_2"
    ]
  },
  "data": {
    "input1": "Hello! This is Nehal ",
    "input2": "Hi! This is J ",
    "input3": "Hola! This is Wani "
  }
}

and then type the following to find out the IP address of the container by the name 'public':

$ docker -H 10.1.65.241:4000 exec -it public bash -c 'ip a'

and finally, query it (replace 172.18.0.2 with the IP of the 'public' container). Sample run:

$ curl -s -H Expect: 172.18.0.2 --data "@/tmp/input.txt" | jq .
{
  "bcd_1": "hi! this is j hello! this is nehal ",
  "input2": "Hi! This is J ",
  "efg_1": "Hola! This Is Wani ",
  "cde_1": "HI! THIS IS J HELLO! THIS IS NEHAL ",
  "abc_1": "Hi! This is J Hello! This is Nehal ",
  "bcd_2": " lahen si siht !olleh j si siht !ih: lahen si siht !olleh j si siht !ih",
  "input1": "Hello! This is Nehal ",
  "input3": "Hola! This is Wani ",
  "def_1": " lahen si siht !olleh j si siht !ih: LAHEN SI SIHT !OLLEH J SI SIHT !IH"
}

Happy DDAG-ing! :D

About

A sample setup for processing non linear workflows using docker swarm

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages