Multiple shards #2

wangkuiyi · 2016-01-28T05:37:51Z

Currently, the backend server runs as a single process, which maintains the index data structure in memory. This should works for a small scale problem. But for applications where there are many documents to be indexed and searched, the index data structure might be very big, that we need to partition it into multiple shards, and to start multiple backend server instances -- each maintains a shard. Also, how could the client makes a single RPC call to add documents to the (or all) correct shard.

wangkuiyi · 2016-01-28T05:59:21Z

Usually, people split the inverted index by documents. This makes each posting lists shorter. Also, without splitting by terms, we don't need to split a query and send terms in the query to multiple computers.

brjg · 2016-01-29T16:51:02Z

Is this proposal good?

launch a master server
the master launches n index server. each of them indexes a document shard
the master server accepts queries. each query is sent to all index servers
the term frequency on each index is calculated as usual
the idf is calculated using the global number of document
the results returned by each index server is merged and sorted by scores
when adding document, it is added to a random index shard and the corresponding index server is called to add it

some questions here.

does the master maintains a file that tells where each document shard is located?
should every index server report their status periodically? if one dies, the master needs to restart it
what if the master dies?
does the master need to have a queue to process queries?

yitopic · 2016-01-30T05:44:02Z

I thought about this for a while, and agree with all you proposed. A few cents hopefully complementary:

About 2.

I am afraid that the search master process (or root) has no way to create processes on remote computers. Usually, we'd have to use SSH, MPI or Kubernetes to start remote processes. So there are generally two ways to let root know index servers (or leaves):

We start root before starting leaves; and leaves, when started, send their the network address to root via RPC. Leaves can know root's network address via command line flags. By this way, root must be started and be ready to accept RPC calls before we start leaves. We can use https://godoc.org/github.com/wangkuiyi/healthz#OK to check if the root has been started and ready for RPC calls.
All leaves register their network addresses to a certain etcd directory known by root, and root watches changes to the directory so to know newly joined index servers. In this way, root could be started/restarted before or after leaves start/restart. It is notable that the official etcd client package in Go is too complex to use for me, so I wrote a much simplified version https://github.com/wangkuiyi/etcd .

It seems that the second approach also address your question 2., as etcd is immortal (Paxos protocol based service that doesn't really crash).

An interesting detail with approach 2 is that when a leaf registers itself to etcd, it actually writes a key-value pair, which could have a TTL (time-to-live) attached. The leaf can start a goroutine which updates the key-value pair and its TTL periodically. Without this periodical update, etcd will remove the registration information after TTL, which indicates the death of the leaf.

About 4. and 5.

I know that weak-and is able to do TF-IDF based retrieval, though I have not done this in my previous work. It is great you think about this!

About 7.

I understand your point as that a new document could be added to any leaf, as long as all leaves make sure that all posting lists are sorted by document IDs.

In the future, we might want to implement a more sophisticated logic -- to add new document to the index server which consumes the minimal amount of memory, or even start new leaf if all existing leaves are running out of their memory quota. But that is for the future.

* About Question 1.*

Does "file" here mean files on disk? I think the answer is yes -- at least that each index server should checkpoints its FowardIndex periodically to disk files or to AWS S3 (which is actually HDFS I think).

About Question 2.

I think etcd and checkpointing look like a solution -- if the root checkpoints its status into etcd, leaves register themselves onto etcd, and leaves checkpoints index into S3/HDFS files.

About Question 3.

I don't feel that the root needs a queue of queries. But for each query, I think the root would need to maintain a heap to merge result lists from all leaves.

brjg · 2016-01-31T01:35:27Z

When I "go get github.com/wangkuiyi/etcd", I got the following error:
../../wangkuiyi/etcd/etcd.go:33: not enough arguments in call to transport.NewTransport

@brjg Please review my fix https://github.com/wangkuiyi/etcd/pull/1

@brjg I merged it. I expect that you can remove the previous checkout of github.com/wangkuiyi/etcd, and go get it now.

yitopic · 2016-01-31T07:39:06Z

That is because github.com/coreos/etcd has some recent changes. I updated
github.com/wangkuiyi/etcd to keep up with that change. Please review:
https://github.com/wangkuiyi/etcd/pull/1

On Sat, Jan 30, 2016 at 5:35 PM brjg [email protected] wrote:

When I "go get github.com/wangkuiyi/etcd", I got the following error:
../../wangkuiyi/etcd/etcd.go:33: not enough arguments in call to
transport.NewTransport

—
Reply to this email directly or view it on GitHub
#2 (comment).

wangkuiyi changed the title ~~Considering~~ Multiple shards and replica Jan 28, 2016

wangkuiyi changed the title ~~Multiple shards and replica~~ Multiple shards Jan 28, 2016

wangkuiyi assigned chqsark Jan 28, 2016

wangkuiyi unassigned chqsark Jan 29, 2016

yitopic mentioned this issue Jan 30, 2016

Index.go: question around line 75 #3

Open

yitopic assigned brjg Jan 30, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple shards #2

Multiple shards #2

wangkuiyi commented Jan 28, 2016

wangkuiyi commented Jan 28, 2016

brjg commented Jan 29, 2016

yitopic commented Jan 30, 2016

brjg commented Jan 31, 2016

yitopic commented Jan 31, 2016

Multiple shards #2

Multiple shards #2

Comments

wangkuiyi commented Jan 28, 2016

wangkuiyi commented Jan 28, 2016

brjg commented Jan 29, 2016

yitopic commented Jan 30, 2016

brjg commented Jan 31, 2016

yitopic commented Jan 31, 2016