-
Notifications
You must be signed in to change notification settings - Fork 246
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix: tflite conversion and inference
- Loading branch information
Showing
20 changed files
with
317 additions
and
230 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,12 +1,66 @@ | ||
# TFLite Conversion Tutorial | ||
- [TFLite Tutorial](#tflite-tutorial) | ||
- [Conversion](#conversion) | ||
- [Inference](#inference) | ||
- [1. Input](#1-input) | ||
- [2. Output](#2-output) | ||
- [3. Example script](#3-example-script) | ||
|
||
## Run | ||
|
||
# TFLite Tutorial | ||
|
||
## Conversion | ||
|
||
```bash | ||
python examples/train.py \ | ||
python3 examples/train.py \ | ||
--config-path=/path/to/config.yml.j2 \ | ||
--h5=/path/to/weight.h5 \ | ||
--bs=1 \ # Batch size | ||
--beam-width=0 \ # Beam width, set >0 to enable beam search | ||
--output=/path/to/output.tflite | ||
## See others params | ||
python examples/tflite.py --help | ||
``` | ||
``` | ||
|
||
## Inference | ||
|
||
### 1. Input | ||
|
||
Input of each tflite depends on the models' parameters and configs. | ||
|
||
The `inputs`, `inputs_length` and `previous_tokens` are still the same as bellow for all models. | ||
|
||
```python | ||
schemas.PredictInput( | ||
inputs=tf.TensorSpec([batch_size, None], dtype=tf.float32), | ||
inputs_length=tf.TensorSpec([batch_size], dtype=tf.int32), | ||
previous_tokens=tf.TensorSpec.from_tensor(self.get_initial_tokens(batch_size)), | ||
previous_encoder_states=tf.TensorSpec.from_tensor(self.get_initial_encoder_states(batch_size)), | ||
previous_decoder_states=tf.TensorSpec.from_tensor(self.get_initial_decoder_states(batch_size)), | ||
) | ||
``` | ||
|
||
For models that don't have encoder states or decoder states, the default values are `tf.zeros([], dtype=self.dtype)` tensors for `previous_encoder_states` and `previous_decoder_states`. This is just for tflite conversion because tflite does not allow `None` value in `input_signature`. However, the output `next_encoder_states` and `next_decoder_states` are still `None`, so we can simply ignore those outputs. | ||
|
||
### 2. Output | ||
|
||
```python | ||
schemas.PredictOutputWithTranscript( | ||
transcript=self.tokenizer.detokenize(outputs.tokens), | ||
tokens=outputs.tokens, | ||
next_tokens=outputs.next_tokens, | ||
next_encoder_states=outputs.next_encoder_states, | ||
next_decoder_states=outputs.next_decoder_states, | ||
) | ||
``` | ||
|
||
This is for supporting streaming inference. | ||
|
||
Each output corresponds to the input = each chunk of audio signal. | ||
|
||
Then we can overwrite `previous_tokens`, `previous_encoder_states` and `previous_decoder_states` with `next_tokens`, `next_encoder_states` and `next_decoder_states` for the next chunk of audio signal. | ||
|
||
And continue until the end of the audio signal. | ||
|
||
### 3. Example script | ||
|
||
See [examples/inferences/tflite.py](../../examples/inferences/tflite.py) for more details. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
47 changes: 0 additions & 47 deletions
47
examples/models/transducer/conformer/inference/run_tflite_model.py
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -42,7 +42,7 @@ def parse_requirements(lines: List[str]): | |
|
||
setup( | ||
name="TensorFlowASR", | ||
version="2.0.0", | ||
version="2.0.1", | ||
author="Huy Le Nguyen", | ||
author_email="[email protected]", | ||
description="Almost State-of-the-art Automatic Speech Recognition using Tensorflow 2", | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.