You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Through the study on OpenAI's model, here is some useful information for developers who wrote their own version of mlstm and try to import OpenAI's model paramters. In mlstm function in encoder.py, defines the tensors' name, this is the baseline.
Computation Graphic and tensor
Under the name scope model, there are three sub name scope:
embedding
tensors: w
out
tensors: w, b
rnn
tensors: b, gh, gmb, gmx, gx, wh, wmh, wmx, wx
The tensors are listed as follow:
tensor_name: model/embedding/w
tensor_name: model/out/b
tensor_name: model/out/w
tensor_name: model/rnn/b
tensor_name: model/rnn/gh
tensor_name: model/rnn/gmh
tensor_name: model/rnn/gmx
tensor_name: model/rnn/gx
tensor_name: model/rnn/wh
tensor_name: model/rnn/wmh
tensor_name: model/rnn/wmx
tensor_name: model/rnn/wx
Table for the correlation between tensor and .npy files
For detailed information about each tensor and which .npy it is correlated, please check the table
Name
Correlated-tensor
Array Shape
npy file index
line of code
params[0]
embedding/w
(256,64)
0
embd, line 23
params[1]
rnn/wx
(64, 16384)
1
mlstm, line 47
params[2]
rnn/wh
(4096, 16384)
hstack 2-5
mlstm, line 48
params[3]
rnn/wmx
(64, 4096)
6
mlstm, line 49
params[4]
rnn/wmh
(4096, 4096)
7
mlstm, line 50
params[5]
rnn/b
(16384,)
8
mlstm, line 51
params[6]
rnn/gx
(16384,)
9
mlstm, line 53
params[7]
rnn/gh
(16384,)
10
mlstm, line 54
params[8]
rnn/gmx
(4096,)
11
mlstm, line 55
params[9]
rnn/gmh
(4096,)
12
mlstm, line 56
params[10]
out/w
(4096, 256)
13
fc, line 31
params[11]
out/b
(256,)
14
fc, line 38
Hopyfully this would help.
The text was updated successfully, but these errors were encountered:
Through the study on OpenAI's model, here is some useful information for developers who wrote their own version of mlstm and try to import OpenAI's model paramters. In mlstm function in encoder.py, defines the tensors' name, this is the baseline.
Computation Graphic and tensor
Under the name scope model, there are three sub name scope:
The tensors are listed as follow:
Table for the correlation between tensor and .npy files
For detailed information about each tensor and which .npy it is correlated, please check the table
Hopyfully this would help.
The text was updated successfully, but these errors were encountered: