forked from LLNL/magpie
-
Notifications
You must be signed in to change notification settings - Fork 0
/
TODO
30 lines (24 loc) · 718 Bytes
/
TODO
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Minor feature support
---------------------
Hadoop
- namenode failover w/ shared dir mechanism
- start w/ 2.1.1-beta
Ganglia
- if user wants aid in monitoring
Hbase
- multiple masters
Spark
- multiple masters
- "PER_NODE" variables, not correct
Storm + Trident
Tachyon
- fault tolerance (zookeeper + multiple masters, etc.)
- multiple layer hierarchy support
- localfilesystem underfs support
ideas
-----
failover support
- if too many nodes fail, recover through fake re-bringup?
- e.g. rank 10 node dies, have extra nodes to re-take over rank 10
- need to re-arch to make all nodes get rank based off file instead of actual rank
- would need daemon-ish thing to monitor situation, adjust config files, etc.?