Resque pool is a simple library for managing a pool of resque workers. Given a
a config file (resque-pool.yml
or config/resque-pool.yml
), it will manage
your workers for you, starting up the appropriate number of workers for each.
- Less memory consumption - If you are using Ruby Enterprise Edition, or any ruby with copy-on-write safe garbage collection, this could save you a lot of memory when you are managing many workers.
- Simpler (less) config - If you are using monit or an init script to start up your workers, you can start up one pool, and it will manage your workers for you.
- Faster startup - if you are starting many workers at once, you would normally have them competing for CPU as they load their environments. Resque-pool can load the environment once and almost instantaneously fork all of the workers.
To configure resque-pool, you can use either resque-pool.yml
or
config/resque-pool.yml
. To use resque-pool, require its rake tasks
(resque/pool/tasks
) in your rake file, and call the resque:pool
task.
The YAML file supports both using root level defaults as well as environment
specific overrides (RACK_ENV
, RAILS_ENV
, and RESQUE_ENV
environment
variables will be used to determine environment). For example, to use
resque-pool with rails, in config/resque-pool.yml
:
foo: 1
bar: 2
"foo,bar,baz": 1
production:
"foo,bar,baz": 4
and in lib/tasks/resque.rake
:
require 'resque/pool/tasks'
# this task will get called before resque:pool:setup
# preload the rails environment in the pool master
task "resque:setup" => :environment do
# generic worker setup, e.g. Hoptoad for failed jobs
end
task "resque:pool:setup" do
# close any sockets or files in pool master
ActiveRecord::Base.connection.disconnect!
# and re-open them in the resque worker parent
Resque::Pool.after_prefork do |job|
ActiveRecord::Base.establish_connection
end
end
Then you can start the queues via:
rake resque:pool RAILS_ENV=production VERBOSE=1
This will start up seven worker processes, one exclusively for the foo queue, two exclusively for the bar queue, and four workers looking at all queues in priority. This is similar to if you ran the following:
rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=foo &
rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=bar &
rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=bar &
rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=foo,bar,baz &
rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=foo,bar,baz &
rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=foo,bar,baz &
rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=foo,bar,baz &
Resque already forks for its own child processes. The pool master will stay
around monitoring the resque worker parents, giving three levels: a single pool
master, many worker parents, and a worker child per worker (when the actual job
is being processed). For example, ps -ef f | grep [r]esque
(in Linux) might
return something like the following:
rails 13858 1 0 13:44 ? S 0:02 resque-pool-master: managing [13867, 13875, 13871, 13872, 13868, 13870, 13876]
rails 13867 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Waiting for foo
rails 13868 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Waiting for bar
rails 13870 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Waiting for bar
rails 13871 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Waiting for foo,bar,baz
rails 13872 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Forked 7481 at 1280343254
rails 7481 13872 0 14:54 ? S 0:00 \_ resque-1.9.9: Processing foo since 1280343254
rails 13875 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Waiting for foo,bar,baz
rails 13876 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Forked 7485 at 1280343255
rails 7485 13876 0 14:54 ? S 0:00 \_ resque-1.9.9: Processing bar since 1280343254
An example startup script, which redirects STDOUT and STDERR and creates a pid file, is given in the examples directory.
The pool master responds to the following signals:
HUP
- reload the config file, e.g. to change the number of workers per queue listQUIT
- sendQUIT
to each worker parent and shutdown the master after all workers are done.INT
- sendQUIT
to each worker parent and immediately shutdown masterTERM
- sendTERM
to each worker parent and immediately shutdown masterWINCH
- sendQUIT
to each worker, but keep master running (sendHUP
to reload config and restart workers)USR1
/USR2
/CONT
- send the signal on to all worker parents (see Resque docs).
After a HUP
, workers that are no longer needed will be gracefully shutdown
via QUIT
.
Workers will watch the pool master, and gracefully shutdown if the master process dies (for whatever reason) before them.
You can specify an alternate config file by setting the RESQUE_POOL_CONFIG
environment variable like so:
rake resque:pool RESQUE_ENV=production RESQUE_POOL_CONFIG=/path/to/my/config.yml
- do appropriate logging (e.g. all to one logfile, each queue to its own
logfile, or each worker to its own logfile). Logfile location must be
configurable, but default to
log/resque-pool.log
. Of course, since resque "logs" by writing to $stdout, this is really no more than redirecting stdout to the appropriate logfile. - (optionally) daemonize, setting a PID file somewhere. configurable, of
course, but default to
tmp/pids/resque-pool.pid
. - recover gracefully from a malformed config file (on startup and HUP)
- procline for malformed config file, graceful shutdown... and other states?
- figure out a good automated way to test this (cucumber or rspec?)
- clean up the code (I stole most of it from unicorn, and it's still a bit bastardized); excessive use of vim foldmarkers are a code smell.
- rdoc
- incorporate resque-batchworker features? (v2.0)
- web interface for adding and removing workers (etc) (v2.0)
- John Schult (config file can be split by environment)