Skip to content
Jae Hyeon Bae edited this page Nov 21, 2013 · 9 revisions

Suro is a distributed data pipeline which enables services for moving, aggregating, routing, storing data. Its design is focused on easy configuration and operation for multiple data sources. It supports the following features:

  • The client can send any type of Java object to remote servers using customizable SerDe through either asynchronous or synchronous communication.
  • According to the routing key contained in the message, message routing and sinking behaviors can be dynamically configurable without server restart
  • Currently, the following sinks are provided
    • Local file sink: Stores the data at the local file system. It supports text file and Hadoop sequence file.
    • S3 file sink: Uploads files generated by local file sink to AWS S3.
    • Kafka sink: Sends messages to Kafka 0.8 broker.
Clone this wiki locally