Skip to content

A boilerplate for kickstarting a production Crunch processor

Notifications You must be signed in to change notification settings

jondot/crunch-starter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crunch-Starter

This is a starter project for Crunch, the Go based ETL toolkit for Hadoop.

Quick Start

Clone or download this project

$ git clone --depth 1 https://github.com/jondot/crunch-starter
$ rm -rf .git

Hack on main.go (or use a different starting point from /samples)

row := crunch.NewRow()
row.FieldWithDefault("ip", "", makeQuery("x-forwarded-for"), transform.AsIs)
row.FieldWithDefault("ev_ts", "", makeQuery("head.timestamp"), transform.AsIs)
row.FieldWithDefault("ev_json", "", makeQuery("action"), transform.AsJson)
:
:
etc.

Build and generate stubs

$ make
built "process"
$ make stubs
built "process".
2014/11/19 00:04:22 Generated: crunch.pig
2014/11/19 00:04:22 Generated: crunch.hql

If you want to tweak the default templates, their in /templates

About

A boilerplate for kickstarting a production Crunch processor

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages