This repository contains annotated example code to accompany the AWS re:Invent 2015 presentation DEV309: "Large Scale Metrics Analysis in Ruby".
These are executable Ruby files that can be used with Hadoop Streaming on Amazon Elastic MapReduce clusters. They're made to work with a particular log file format shown in the presentation. However, like most of these examples, they can serve as useful guideposts when you design your own purpose-built mappers and reducers.
Shows an example of a control plane for log processing jobs, that can support job batching. Includes an example of how this control plane might be called for a simple example job.
Shows some Redshift cluster management operations, based on the pattern used in the presentation.