Assignment 2: Adding Durability to the Graph Store

Introduction

In this lab, we'll add durability to the graph store you built in Lab 1. We'll use a simple logging design, where the graph is stored entirely in main memory, but each incoming update is logged to raw disk. Occasionally the graph is checkpointed to disk and the log is garbage collected. We'll run the system in Google Cloud using a virtual block device in raw mode (i.e. reading and writing blocks directly without first mounting a filesystem on the device).

Change List

(updates are marked with (NEW) in the text)

Added suggestion to use getopt in the coding tips.
Added instruction of how to add firewall rules to enable remote http request

Google Cloud Instructions

sign up a google cloud account using your gmail account
redeem the education credit using the promotion code after your have signing up for the google cloud, click here
create your own VM, quick start guide

the detailed specification of the VM is as follows:

then, click the "Management, disk, networking, SSH keys" to add additional disk, add a disk using the following specification:

After creating an extra disk, you need to change the "When deleting instance" option from the default "keep disk" to "delete disk". After going through all the procedure mentioned above, your creating page will like the following one. And you click "create" button to create your own VM.

update your ssh public-private key pair, and ssh to your VM, click here for detailed instructions
(NEW) About how to enable the remote http requests, you need turn to here to add new firewall rules to enable the from/to traffic from specific port. For example. if your server is running on the port 8000, you need add a rule to allow tcp:8000 traffic from all source IP.

API Changes

The service retains the API described in Lab1, with two differences.

First, any mutating command (add_node, add_edge, remove_node, remove_edge) should return error code 507 if the log is full and there is no space for logging the new entry. This tells the client that checkpointing is required.

Second, we expose a new checkpoint command:

Function	Method	Arguments	Return
`checkpoint`	`POST`		`200` on success `507`if there is insufficient space for the checkpoint

In the command line, accept an additional parameter specifying the device file in addition to the port:

$ ./cs426_graph_server <port> <devfile>

Note: To find the disk you add, simply use command lsblk to check all of the disks in your vm. Typically, if you follow the above google cloud instruction to create your vm. The disk serving as checkpoint disk is /dev/sdb. To get the permission to open the disk a file and write to it. You need to change the access permisson of this disk using chmod.

Also, support a -f flag that formats/initializes a new deployment (i.e. writes a fresh version of the superblock):

$ ./cs426_graph_server -f <port> <devfile>

Note: As the disk you add is initialized with random bits, so you need to first to format this disk by yourself.

Protocol Format

There are many different valid protocols. Here we outline one such protocol and a couple of alternatives. You are free to use your own protocol -- just make sure it'll continue to work under the failure and workload assumptions! In particular, for this homework, we assume failures will occur when there's no ongoing activity in the system.

The first block is a superblock for storing metadata. Here, store the location and size of the log segment. The remainder of the disk is used for the checkpoint. A 10GB disk has 2.5M 4KB blocks. Let block 0 be the metadata superblock. For this lab, initialize the log size to be the first 2GB (including the superblock), and use the remainder 8GB as the checkpoint area.

Protocol 0: Here we store the current generation number in the metadata block, and always reset the log tail to the start on a checkpoint or a format. The process maintains the current generation number and tail of the log in memory.

--- On a format, you read the superblock and check if it's valid. If valid, you read the current generation number from the superblock, increment it, and write it back out. If invalid, this store has never been used before; you write a brand new superblock with a generation number of 0. (There are corner cases where the superblock gets corrupted, and formatting it with a generation number of 0 can cause problems due to valid data blocks in the log from the earlier instance; for this homework, assume that superblock corruptions don't occur).

--- On a normal startup, you read the superblock and check if it's valid. If not valid, exit with an error code. If valid, locate the checkpoint using the superblock, and read it in if one exists. Read the generation number from the superblock. Play the log forward until you encounter the first log entry that's invalid or has a different generation number.

--- On a checkpoint call, write out the checkpoint. (You can assume that crashes don't occur during the checkpointing process). Increment the generation number in the superblock. Reset the in-memory tail counter to 0.

byte 0: 32 bits unsigned int: current generation (e.g. generation 5)
byte 4: 64 bits unsigned int: checksum (e.g. xor of all 8-byte words in block)
byte 12: 32 bits unsigned int: start of log segment (e.g. block 1)
byte 16: 32 bits unsigned int: size of log segment (e.g. 250K blocks)

Each 4KB log entry block will have:

byte 0: 32 bits unsigned int: generation number
byte 4: 32 bits unsigned int: number of entries in block
byte 8: 64 bits unsigned int: checksum
byte 16: first 20-byte entry
byte 36: second 20-byte entry
...

Each log entry will have a 4-byte opcode (0 for add_node, 1 for add_edge, 2 for remove_node, 3 for remove_edge) and two 64-bit node IDs (only one of which is used for add_node and remove_node).

Note: the checksum can fail (i.e. say the block is valid when the block is actually invalid) for a small number of random bit patterns. Unfortunately the all-zero case -- which is a common bit pattern for unwritten blocks on disk -- happens to one where the checksum fails. To get around this, either write a constant 'magic number' in each block (just a constant number in each valid block that ensures that all-zeros is not a valid block), or add a constant value to the checksum function (i.e., the checksum is the XOR of all previous words plus some fixed value).

Alternative Schemes:

Here are two other schemes. One involves storing the tail persistently in the superblock and updating it on every append. This is actually not as bad a scheme if each log block has multiple log entries, since you only update the superblock when the block being currently appended to changes. In this scheme you don't need generation numbers if you reset the log to the start on checkpoints / formats. This scheme is okay for this homework; its weakness is that if the log entries are large, it becomes very inefficient.

A second scheme involves tracking the head of the log explicitly and updating it during checkpoints / formats. Handling wraparound requires generation numbers. We won't be testing wraparound in this homework.

Testing Methodology

Provide a make file. Your code should run with the following command:

make && ./cs426_graph_server

Here is how we'll test your system:

Step 1: We run the binary with the -f flag and it doesn't crash.

Step 2: We run the Lab2 tester (/c/cs426/scripts/lab2test.sh) from the zoo and point it at the cloud VM, and it completes without failures (some people may get timeouts on the last test case; this is okay.)

Step 3: We run a number of reads and the data is there.

Step 4: We kill the process and restart it. It starts again without crashing.

Step 5: We run a number of reads and the data is there.

Step 6: We issue a checkpoint command. It completes successfully without crashing.

Step 7: We run a number of reads and the data is there.

Code pointers

Use open/read/write (instead of fopen) to bypass the libc streaming functionality. Use the O_DIRECT flag to bypass OS buffering.
Use mmap to allocate memory on page boundaries instead of malloc. Specifically: mmap(NULL, BLOCK_SIZE, PROT_READ | PROT_WRITE, MAP_PRIVATE | MAP_ANONYMOUS, 0, 0).
To manipulate buffers easily, cast them to structs and then read/write members of the structs.
(NEW) Use getopt to process command line parameters.

Finding Your `push` Location

To find your push location run git remote show origin in your repo from lab 1. You should get output like

> git remote show origin
...
* remote origin
  Fetch URL: /c/cs426/SUBMIT/<your dir>
  Push  URL: /c/cs426/SUBMIT/<your dir>
...

If you have already set up a repository use git remote add origin ssh://<username>@node.zoo.cs.yale.edu/c/cs426/SUBMIT/<your dir> to add the found directory as the origin for your local one. As in:

> git remote show origin
...
* remote origin
  Fetch URL: /c/cs426/SUBMIT/<your dir>
  Push  URL: /c/cs426/SUBMIT/<your dir>
...
> git remote add origin ssh://<username>@node.zoo.cs.yale.edu/c/cs426/SUBMIT/<your dir>

If you wish to create a new repository you can use git clone ssh://<username>@node.zoo.cs.yale.edu/c/cs426/SUBMIT/<your dir> which will setup the origin for you.

After this, git push will send changes back to your submit repo on the zoo.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lab 2.md

Lab 2.md

Assignment 2: Adding Durability to the Graph Store

Introduction

Change List

Google Cloud Instructions

API Changes

Protocol Format

Testing Methodology

Code pointers

Finding Your `push` Location

Files

Lab 2.md

Latest commit

History

Lab 2.md

File metadata and controls

Assignment 2: Adding Durability to the Graph Store

Introduction

Change List

Google Cloud Instructions

API Changes

Protocol Format

Testing Methodology

Code pointers

Finding Your push Location

Finding Your `push` Location