Skip to content

zerebral/aerospike-loader

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

87 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

#Aerospike Loader

Aerospike Loader parses a set of .CSV files and loads the data into Aerospike server.

Prerequisites

  • Java 1.6 or higher
  • Maven 2.3.2

Installation

  • Source code is available on git-hub:

      $ git clone [email protected]:aerospike/aerospike-loader.git
    
  • Then build the utility by running following:

      $ cd aerospike-loader
      $ mvn clean install
    

##Dependencies Following dependencies are downloaded automatically:

  • Aerospike Java client 3.0.22 or greater
  • Apache commons cli 1.2
  • Log4j 1.2.14
  • Junit 4.4
  • Json-simple 1.1.1

Usage

Use run_loader script along with options and data files.

    $ ./run_loader <options> <data file name(s)/directory>

"data file name(s)/directory" can either be space delimited files or a directory name containing data files. See "Data Files" section for more details.

Options:

-c,--config <arg>                Column definition file name
-ec,--abort-error-count <arg>    Error threshold to abort (default: 0)
-et,--expiration-time <arg>      Expiration time of records in seconds (default: never expire)
-h,--host <arg>                  Server hostname (default: localhost)
-n,--namespace <arg>             Namespace (default: test)
-p,--port <arg>                  Server port (default: 3000)
-rt,--read-threads <arg>         Number of read threads (default: Number of cores * 1)
-s,--set <arg>                   Set name. (default: null)
-T,--timezone <arg>              Time zone of data backup source (default: local timezone)
-tt,--transaction-timeout <arg>  Write transaction timeout in milliseconds(default: No timeout)
-u,--usage                       Print usage
-v,--verbose                     Detailed log
-wa,--write-action <arg>         Write action (default: update)
-wt,--write-threads <arg>        Number of write threads (default: Number of cores * 5)

For more details, refer to Options.

###Sample usage of all options:

    $ ./run_loader -h nodex -p 3000 -n test -s demo -tt 3000 -et 3600 -et 2592000 -ec 100 -rt 4 -wt 20 -T PST -wa update -c ~/pathto/config.json datafiles/

Where:

Server IP:                                  nodex (-h)
Port:                                       3000 (-p)
Namespace:                                  test (-n) 
Set:                                        demo (-s)
Read Threads:                               4 (-rt)
Write Threads:                              20 (-wt)
Write Operation Timeout (in milliseconds):  3000 (-tt)
Write Error Threshold:                      100 (-ec)
Record Expiration:                          2592000 (-et)
Timezone:                                   PST (-T)
Write Action:                               update (-wa) 
Data Mapping:                               ~/pathto/config.json (-c)
Data Files:                                 datafiles/

Demo example

Example directory contains two files: allDatatype.json and data.csv. Run the following command to load data from data file data.csv.

./run_loader -h localhost -c example/allDatatype.json example/data.csv

For more examples, see the examples/README.md.

About

Aerospike Data Loader

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 100.0%