Using data tensors as data sources: action plan #7503

fchollet · 2017-08-02T20:39:41Z

We want to add the ability to feed TensorFlow data tensors (e.g. input queues) into Keras models. A few days ago I met with @athundt and we discussed his previous efforts to make it happen. Here is how we will handle it:

First step [Update: done]

The following API:

# Get data tensors
data_tensor, target_tensor = ...

# Build model on top of the data tensor
inputs = Input(tensor=data_tensor)
outputs = Dense(...)(inputs)
model = Model(inputs, outputs)

# Add internal loss
loss = loss_fn(target_tensor, outputs)
model.add_loss(loss)

# Compile without external loss
model.compile(optimizer='sgd', loss=None)

# Fit without external data
model.fit(epochs=10, steps_per_epoch=1000)

This is already 90% supported. What is missing is the steps_per_epoch argument (currently fit would only draw a single batch, so you would have to use it in a loop).

NEEDED:

[Update: done] PR introducing the steps_per_epoch argument in fit. Here's how it works:
- Based on arguments received, we determine whether training should be step-based (like in fit_generator) or sample-based (like in fit currently).
- We have two independent code branches handling each mode.
[Update: done] PR introducing a MNIST example of how to use data tensors for inputs and targets, following the code snippet above. It should use the MNIST data tensors built-in in TF.

Second step

The following API:

# Get data tensors
data_tensor, target_tensor = ...

# Build model on top of the data tensor
inputs = Input(tensor=data_tensor)
outputs = Dense(...)(inputs)
model = Model(inputs, outputs)

# Compile as usual
model.compile(optimizer='sgd', loss='mse')

# Fit by passing the target tensor
model.fit(y=target_tensor, epochs=10, steps_per_epoch=1000)

Main issue: in compile, we create placeholders for the targets. We need to discard them (cache them, actually) and use the provided target tensor instead.

Solution: a model recompilation step inside fit in order to cache the previous target placeholder and replace it with our target tensor.

NEEDED:

PR adding support for a target tensor in the call to fit for a normally compiled model. Involves a recompilation step.

Third step

The following API:

# Get data tensors
data_tensor, target_tensor = ...

# Build model on top of placeholders
inputs = Input(shape=(...))
outputs = Dense(...)(inputs)
model = Model(inputs, outputs)

# Compile as usual
model.compile(optimizer='sgd', loss='mse')

# Fit by passing the data tensor and target tensor
model.fit(data_tensor, target_tensor, epochs=10, steps_per_epoch=1000)

It's not 100% clear at this point how we will handle it, but we will figure it out. Most likely this will involve building a new TF graph inside fit, running training with it, then transferring weight values back to the initial graph. I'll handle it.

CC: @athundt @Dref360 @colinskow @TimZaman

The text was updated successfully, but these errors were encountered:

TimZaman · 2017-08-03T09:23:31Z

LGTM. @athundt, do you take the lead in steps (1) and (2)? It seems you've mostly nailed those already.

Dref360 · 2017-08-03T19:18:10Z

Step 3 seems really "hacky". Could we ask the TF team if they are willing to handle feeding placeholder with Tensors?

For step 2, I was away for a while so I didn't keep up with @athundt 's PR. But since the data_tensor is already there, I see no problem doing : model.compile(y=target_tensor, optimizer='sgd', loss='mse').

Would save one compilation, if you've already talked about it in the PR, ignore this.

TimZaman · 2017-08-04T15:39:48Z

@Dref360

Step 3 seems really "hacky".

Yes, it's a bit dirty. But I think Keras's API's do allow us to clean up the graph-surgery mess quite easily, in a way that it's a hack in principle, but it's a great one. We'll see when we get there.

Could we ask the TF team if they are willing to handle feeding placeholder with Tensors?

We did; issue: tensorflow/tensorflow#10837

model.compile(y=target_tensor, optimizer='sgd', loss='mse').

On first glance, that sounds pretty sane to me! I don't recall anyone suggesting this?

ahundt · 2017-08-23T00:00:13Z

Sorry guys, I didn't see this until now because @ahundt is the account I actually use. I'm not sure I have access to the other one any more.

@Dref360 I submitted the request for the feature in tensorflow a couple months ago tensorflow/tensorflow#10837.

The graph editing PR might be a good way to implement the underlying functionality for API 3 #7505

PBehr · 2017-09-20T13:52:54Z

Update 2 and 3 will lead to issues with distributed training. Tensorflow distributed finalizes the graph, so we get an error if we try to recompile the model. See #3997 for reference

fchollet · 2017-09-20T17:05:39Z

For distributed training you should be using the TensorFlow estimator API. We are about to release an integration between the estimator API and Keras models. It will be in TF 1.4.

…

On 20 September 2017 at 06:53, PBehr ***@***.***> wrote: Update 2 and 3 will lead to issues with distributed training. Tensorflow distributed finalizes the graph, so we get an error if we try to recompile the model. See #3997 <#3997> for reference — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#7503 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AArWb4MXsD3jxL6IIzgA1ErBtbPpYtFJks5skRi_gaJpZM4OroI4> .

ahundt · 2017-10-01T22:20:22Z

How should we handle validation data? When a model uses input tensors the data being loaded is pre-defined, so it likely needs to be instantiated a second time or perhaps something like #7505 would be needed to reconnect the input tensors.

Thoughts?

fchollet · 2017-11-20T19:03:51Z

Yes, that's still in the pipeline, as well as the ability to call `fit`/`evaluate`/`predict` directly on data tensors for a model built on top of placeholders. You'll probably have it by TF 1.6.

…

On 20 November 2017 at 08:43, N-McA ***@***.***> wrote: Maybe this is planned, but support for the automatic validation features (running a test on the validation set after each epoch, early stopping, learning rate adjustment based on val scores) that Keras allows would be great through this API as well. That in the pipeline? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#7503 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AArWb954mI_nKLbcgk71n8XfCUry77ayks5s4awpgaJpZM4OroI4> .

ahundt · 2017-11-21T01:53:47Z

Cool, thanks! I saw the tf + keras estimator API is out with 1.4, perhaps there is an example somewhere?

ahundt · 2018-01-16T19:09:00Z

Found an example of estimators in horovod, and it seems to convert a keras model to tf you use model_to_estimator.

R-Miner · 2018-05-08T18:24:12Z

Do you have a fix on the ability to call
fit/evaluate/predict directly on data tensors for a model built on
top of placeholders?

sekharvth · 2018-05-23T06:29:55Z

@R-Miner I tried using the 3rd step where tensors are directly passed as input to the model. But it threw me an AttributeError saying that 'Tensor' object has no attribute 'ndim'. I'm running Keras 2.1.6 on top of Tensorflow 1.8 .
@fchollet said that the issue would be most likely resolved by TF 1.6, but since there haven't been any further updates about that on this thread, I'm not sure if step 3 has been implemented.
It would be great to get an update regarding this, @ahundt

UPDATE - I got the following dummy code to work:

from keras.layers import Input, Dense, Lambda
from keras.models import Model
import tensorflow as tf
with tf.Session() as sess:
  sess.run(tf.initialize_all_tables())
  sess.run(tf.initialize_all_variables())
  inp = Input(tensor = embedding)
  inp1 = Lambda(lambda x: tf.cast(x, tf.float32))(inp) 
  dense = Dense(1, activation = 'sigmoid')(inp1)

  model = Model(inp, dense)

  model.compile(loss = 'binary_crossentropy', metrics = ['accuracy'], optimizer = 'adam')

  model.fit(embedding, np.array([5]), epochs = 10)

The casting to float operation is done to avoid conflicting datatypes in the Matmul operation of the Dense layer. 'embedding' is a tensor of shape (num_examples, 512).

But it still doesn't support a multiple input model, where one input is a tensor and the other an array. It then throws the same error shown earlier ('Tensor' object has no attribute ndim').

So it apparently works with exclusively tensor inputs, but doesn't support multiple data type inputs yet. Is there like a temporary hack or something that can solve this problem?

psoulos · 2018-06-15T18:42:34Z

Is there a way to save and load models that use data tensors as data sources? I am able to create the original model and save it, but I'm not sure how to load the model. If I call load_model(), how do I correctly specify the input tensor? I found this stackoverflow answer for replacing the input tensor, but this creates a dangling input which prevents me from saving the model.

TimZaman · 2018-06-15T18:51:16Z

@psoulos you cannot. A model that you load in sadly is always created on top of placeholders. The only thing you can do is:

x = $your_input_tensor
m1 = keras.$.load_model()
m2 = Model(inputs=x, outputs=m(x))

psoulos · 2018-06-15T18:58:39Z

@TimZaman Will that allow me to continue training without losing the state of my optimizer and configuration? Currently I'm re-creating the model architecture and calling model.load_weights but this makes it difficult to continue training.

was84san · 2018-06-19T19:01:10Z

I tried to use the third step but then I have this error
" When feeding symbolic tensors to a model, we expect thetensors to have a static batch size. Got tensor with shape: (None, 32, 64, 64, 3)"

I used the following strategy to fit the model :

  training_filenames = [.....]
  dataset = tf.data.TFRecordDataset(training_filenames)
  dataset = dataset.map(_parse_function_all) # Parse the record into tensors.

  dataset = dataset.batch(20)
  iterator = dataset.make_initializable_iterator()
  next_element= iterator.get_next()
  # videos will be next_element [0], labels = next_element[1]

  # since it is pair I will use only first pair for training and second pair for validation
 # train_video = next_element [0][:, 0] val_videos = next_element[0][:, 1]
 # same with labels

 model = create_base_network()
 # input_dim = (None, 32, 64, 64 3) for the model above
 # output dimension will be (None, 10) for the model above

 sgd = SGD(lr=0.01, decay=1e-6, momentum=0.9, nesterov=True)
 model.compile(loss='categorical_crossentropy', optimizer=sgd)

 model.fit(next_element[0][:, 0], next_element[1][:, 0], validation_data=(next_element[0][:, 1], 
 next_element[1][:, 1]), epochs=10, steps_per_epoch=1000)

`
So any one can tell me why I got this error?
Is the third step of fitting the model working now in kerns? or still have issues

nmiculinic · 2018-06-20T20:38:42Z

Hmmm...how is this suppose to work with validation dataset? Is it possible to inject both via those API's or do I have to resort to tf magic?

dillondaudert · 2018-06-20T20:58:46Z

I wanted to leave a comment here so others could see, but as of tensorflow 1.9, the tf.keras package supports using tf.data.Datasets and tf.data.Iterators as inputs to Model.fit()/evaluate()/predict(). See the documentation here.

For instance, this works as of tf1.9:

import tensorflow as tf
import numpy as np
from tensorflow import keras

inputs = np.zeros((10, 3))
targets = np.zeros((10, 4))
dataset = tf.data.Dataset.from_tensor_slices((inputs, targets))
dataset = dataset.repeat(100)
dataset = dataset.batch(5)

x = keras.layers.Input(shape=(3,), name='input')
y = keras.layers.Dense(4, name='dense')(flat)

model = keras.Model(x, y)
model.compile(loss='mse', optimizer='rmsprop')

model.fit(dataset, epochs=1, steps_per_epoch=2, validation_data=dataset, validation_steps=2)

I'm not sure what the exact differences between keras-team/keras and tensorflow/keras are at this point, but it seems that tf.data.Dataset support is further along in the latter.

was84san · 2018-06-21T18:15:08Z

@ dillondaudert . So thats mean I can't use it with tensor flow 1.8 version.

lminer · 2018-07-11T18:56:38Z

@was84san seems to work if you call .set_shape((YOUR SHAPE INCLUDING BATCH SIZE)) on the tensors you get from .get_next()

Edit: Actually even better seems to be to set drop_remainder=True in the batch method.

was84san · 2018-08-12T20:33:30Z

@Iminer , I did that and still have this error
AttributeError: "'Tensor' object has no attribute 'ndim'"

jandono · 2018-11-14T15:03:52Z

What's the current support for Model.predict(some_data), if I have hard wired tf.data.Dataset iterator as an input tensor to my model? Namely, I have something similar to the following:

# dataset = Some tf.data.Dataset
dataset_iterator = dataset.make_one_shot_iterator()
input_tensor_x, input_tensor_y = dataset_iterator.get_next()
outputs = Dense(10)(inputs)
model = Model(inputs=[input_tensor_x], outputs=[outputs])
model.compile(
        optimizer='adam',
        loss='categorical_crossentropy',
        metrics=['categorical_accuracy'],
        target_tensors=[input_tensor_y]
)

How can I call model.predict(data_to_be_predictied) on such a model?

eaplatanios mentioned this issue Aug 11, 2017

[Enhancement] Redesigning TensorFlow's input pipelines tensorflow/tensorflow#7951

Closed

This was referenced Aug 23, 2017

[API DESIGN REVIEW] Keras Input Tensor API #7102

Closed

TFRecord integration with Keras API tensorflow/tensorflow#8787

Closed

penguinmenac3 mentioned this issue Apr 30, 2018

Keras TFRecord support? penguinmenac3/starttf#6

Closed

fchollet closed this as completed Jun 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using data tensors as data sources: action plan #7503

Using data tensors as data sources: action plan #7503

fchollet commented Aug 2, 2017 •

edited

Loading

TimZaman commented Aug 3, 2017

Dref360 commented Aug 3, 2017

TimZaman commented Aug 4, 2017 •

edited

Loading

ahundt commented Aug 23, 2017 •

edited

Loading

PBehr commented Sep 20, 2017

fchollet commented Sep 20, 2017 via email

ahundt commented Oct 1, 2017

fchollet commented Nov 20, 2017 via email

ahundt commented Nov 21, 2017

ahundt commented Jan 16, 2018

R-Miner commented May 8, 2018 •

edited

Loading

sekharvth commented May 23, 2018 •

edited

Loading

psoulos commented Jun 15, 2018

TimZaman commented Jun 15, 2018

psoulos commented Jun 15, 2018

was84san commented Jun 19, 2018 •

edited

Loading

nmiculinic commented Jun 20, 2018

dillondaudert commented Jun 20, 2018

was84san commented Jun 21, 2018

lminer commented Jul 11, 2018 •

edited

Loading

was84san commented Aug 12, 2018

jandono commented Nov 14, 2018

Using data tensors as data sources: action plan #7503

Using data tensors as data sources: action plan #7503

Comments

fchollet commented Aug 2, 2017 • edited Loading

First step [Update: done]

Second step

Third step

TimZaman commented Aug 3, 2017

Dref360 commented Aug 3, 2017

TimZaman commented Aug 4, 2017 • edited Loading

ahundt commented Aug 23, 2017 • edited Loading

PBehr commented Sep 20, 2017

fchollet commented Sep 20, 2017 via email

ahundt commented Oct 1, 2017

fchollet commented Nov 20, 2017 via email

ahundt commented Nov 21, 2017

ahundt commented Jan 16, 2018

R-Miner commented May 8, 2018 • edited Loading

sekharvth commented May 23, 2018 • edited Loading

psoulos commented Jun 15, 2018

TimZaman commented Jun 15, 2018

psoulos commented Jun 15, 2018

was84san commented Jun 19, 2018 • edited Loading

nmiculinic commented Jun 20, 2018

dillondaudert commented Jun 20, 2018

was84san commented Jun 21, 2018

lminer commented Jul 11, 2018 • edited Loading

was84san commented Aug 12, 2018

jandono commented Nov 14, 2018

fchollet commented Aug 2, 2017 •

edited

Loading

TimZaman commented Aug 4, 2017 •

edited

Loading

ahundt commented Aug 23, 2017 •

edited

Loading

R-Miner commented May 8, 2018 •

edited

Loading

sekharvth commented May 23, 2018 •

edited

Loading

was84san commented Jun 19, 2018 •

edited

Loading

lminer commented Jul 11, 2018 •

edited

Loading