Features:
- Added Android inference support
- Built Android artifacts for "impl", "onnx" and "visualization" modules #422
- Added Android-specific models to the model zoo
- Implemented preprocessing operations working on Android
Bitmap
#416 #478:Resize
Rotate
Crop
ConvertToFloatArray
- Added utility functions to convert
ImageProxy
toBitmap
#458 - Added
NNAPI
execution provider #420 - Added api to create
OnnxInferenceModel
from theByteArray
representation #415 - Introduced a gradle task to download model hub models before the build #444
- Added utility functions to draw detection results on Android
Canvas
#450
- Implemented new preprocessing API #425
- Introduced an
Operation
interface to represent a preprocessing operation for any input and output - Added
PreprocessingPipeline
class to combine operations together in a type-safe manner - Re-implemented old operations with the new API
- Added convenience functions such as
pipeline
to start a new preprocessing pipeline,call
to invoke operations defined elsewhere,onResult
to access intermediate preprocessing results - Converted
ModelType#preprocessInput
function toOperation
#429 - Converted common preprocessing functions for models trained on ImageNet to
Operation
#429
- Introduced an
- Added new ONNX features
- Added execution providers support (
CPU
,CUDA
,NNAPI
) and convenient extensions for inference with them #386 - Introduced
OnnxInferenceModel#predictRaw
function which allows customOrtSession.Result
processing and extension functions to extract common data types from the result #465 - Added validation of input shape #385
- Added execution providers support (
- Added
Imagenet
enum to represent different Imagenet dataset labels and added support for zero indexed COCO labels #438 #446 - Implemented unified summary printing for Tensorflow and ONNX models #368
- Added
FlatShape
interface to allow manipulating the detected shapes in a unified way #480 - Introduced
DataLoader
interface for loading and preprocessing data for dataset implementations #424 - Improved swing visualization utilities #379 #388
- Simplified
Layer
interface to leave onlybuild
function to be implemented and remove explicit output shape computation #408
Breaking changes:
- Refactored module structure and packages #412 #469
- Extracted "tensorflow" module for learning and inference with Tensorflow backend
- Extracted "impl" module for implementation classes and utilities
- Moved preprocessing operation implementations to the "impl" module
- Removed dependency of "api" module on the "dataset" module
- Changed packages for "api", "impl", "dataset" and "onnx" so that they match the corresponding module name
- Preprocessing classes such as
Preprocessing
,ImagePreprocessing
,ImagePreprocessor
,ImageSaver
,ImageShape
,TensorPreprocessing
,Preprocessor
got removed in favor of the new preprocessing API #425 - Removed
Sharpen
preprocessor since theModelType#preprocessor
field was introduced, which can be used in the preprocessing pipeline using thecall
function #429
Bugfixes:
- Fix loading of jpeg files not supported by standard java ImageIO #384
- Updated ONNX Runtime version to enable inference on M1 chips #361
- Fixed channel ordering in for image recognition models #400
- Avoid warnings from
loadWeightsForFrozenLayers
function for layers without parameters #382
New documentation and examples:
- Inference with KotlinDL and ONNX Runtime on desktop and Android
- KotlinDL ONNX Model Zoo
- Sample Android App
0.4.0 (01/06/2022) Pose Detection, EfficientDet for Object Detection and EfficientNet for Image Recognition
Features:
- Added the PoseNet model family to the ONNX Model Hub. #269
- MoveNetSinglePoseLighting
- MoveNetMultiPoseLighting
- MoveNetSinglePoseThunder
- Added the EfficientDet model family to the ONNX Model Hub. #304
- EfficientDetD0
- EfficientDetD1
- EfficientDetD2
- EfficientDetD3
- EfficientDetD4
- EfficientDetD5
- EfficientDetD6
- Added SSD-Mobilenet-v1 model to the ONNX Model Hub. #296
- Added EfficientNet model family to the ONNX Model Hub. #264
- EfficientNetB0 ( + noTop model)
- EfficientNetB1 ( + noTop model)
- EfficientNetB2 ( + noTop model)
- EfficientNetB3 ( + noTop model)
- EfficientNetB4 ( + noTop model)
- EfficientNetB5 ( + noTop model)
- EfficientNetB6 ( + noTop model)
- EfficientNetB7 ( + noTop model)
- Added NoTop models to the TensorFlow Model Hub. #281
- VGG'16
- VGG'19
- ResNet50
- ResNet101
- ResNet152
- ResNet50V2
- ResNet101V2
- ResNet152V2
- MobileNet
- MobileNetV2
- NasNetMobile
- NasNetLarge
- DenseNet121
- DenseNet169
- DenseNet201
- Xception
- Inception
- Added new
Dot
layer andConv1DTranspose
,Conv2DTranspose
,Conv3DTranspose
layers. #144 #124 - Added new activation functions:
SparsemaxActivation
andSoftShrinkActivation
. #171 #170 - Added new
Padding
,CenterCrop
,Convert
,Grayscale
image preprocessors andNormalizing
tensor preprocessor. #203 #201 #202 #204
Examples and tutorials:
- Added an example of Image preprocessing DSL usage with TensorFlow model. #292.
- Added examples for Object Detection with EfficientDet2.
- Added examples for Object Detection with SSD-Mobilenet-v1.
- Added examples for Pose Detection with different models.
- Added examples for Image Recognition with different models from EfficientNet model family.
- Added examples for fine-tuning of noTop ResNet model.
- Added an example for new Image Preprocessing DSL operator Normalize.
- Added an example for Linear Regression model training with two metrics.
- Added an example for new Functional API DSL with ToyResNet model.
- Added an example for training LeNet model with multiple callbacks support.
API changes:
- Introduced new DSL for creating
Sequential
andFunctional
models. #133 - Added support for multiple
Callbacks
infit()
,evaluate()
,predict()
instead ofcompile()
. #270 - Added support for multiple metrics. #298
- Added support for the model reset. #271
- Replaced
Long
parameters withInteger
ones in convolutional, average pool and max pool layers. #273 - Moved loading section out of image preprocessing. #322
- Remove obsolete
CustomPreprocessor
interface. #257 - Supported exporting tensor data to
BufferedImage
#293
Internal API changes:
- Introduced new abstraction for layer parameters --
KVariable
. #324 - Moved some of the
Layer
functionality to the new interfacesParametrizedLayer
andTrainableLayer
. #217
Bug fixes:
- Added support for correct loading of
isTrainable
status from Keras. #153 - Generalized
Reshape
layer to higher dimensions. #249 - Fixed incorrect bounding box coordinates in ObjectDetection and FaceDetection examples. #279
- Fixed
toString
methods for layer classes. #301 - Set all the optimizers to have
useLocking = True
#305 - Fixed a bug with silently skipped layers in topological sort. #314
- Fixed
loadModelLayersFromConfiguration
recursively calling itself. #319 - Fixed
GraphTrainableModel#internalPredict
for multi-dimensional predictions. #327 - Fixed
Orthogonal
initializer. #348 - Fixed initialization of the variables in
GraphTrainableModel
. #355 - Add model output type checks for
OnnxInferenceModel
. #356 - Fixed
IndexOutOfBoundsException
in theDot
layer. #357 - Fixed import and export issues:
- Updated log4j version.
- Fixed the group of examples with ToyResNet.
Features:
- Implemented the copying for the Functional and Sequential models
- Implemented the copying for the TensorFlow-based Inference Model
- Implemented the experimental ONNX integration:
- added new 'onnx' module
- added the ONNXModel implementing the common InferenceModel interface
- ONNX model could be used as a preprocessing stage for the TensorFlow model
- prepared ONNX model without top layers could be fine-tuned via training of top layers implemented with TensorFlow-based layers
- Added SSD and YOLOv4 object detection models to the Model Hub
- Added Fan2D106 face alignment model to the Model Hub
- Added SSDObjectDetectionModel with the easy API for object detection, including pre- and post-processing
- Added a few models in ONNX format to the Model Hub
- ResNet18
- ResNet34
- ResNet50
- ResNet101
- ResNet152
- ResNet18V2
- ResNet34V2
- ResNet50V2
- ResNet101V2
- ResNet152V2
- EfficientNetV4
- Added new TensorFlow-based models to the Model Zoo (or Model Hub):
- NasNetMobile
- NasNetLarge
- DenseNet121
- DenseNet169
- DenseNet201
- Xception
- Added ResNet18 and ResNet34 TensorFlow-based models to ModelZoo
- Added L1 and L2 regularization to the layers
- Added Identity initializer
- Added Orthogonal initializer
- Added Softmax activation layer
- Added LeakyReLU activation layer
- Added PReLU activation layer
- Added ELU activation layer
- Added ThresholdedReLU activation layer
- Added Conv1D layer
- Added MaxPooling1D layer
- Added AveragePooling1D layer
- Added GlobalMaxPooling1D layer
- Added GlobalAveragePooling1D layer
- Added Conv3D layer
- Added MaxPooling3D layer
- Added AveragePooling3D layer
- Added GlobalAveragePooling3D layer
- Added GlobalMaxPool2D layer
- Added GlobalMaxPool3D layer
- Added Cropping1D and Cropping3D layers
- Added Permute layer
- Added RepeatVector layer
- Added UpSampling1D, UpSampling2D and UpSampling3D layers
- Added Gelu activation function
- Added HardShrink activation function
- Added LiSHT activation function
- Added Mish activation function
- Added Snake activation function
- Added Tanh shrink activation function
- Added TimeStopping callback
Bugs:
- Added missed loaders for the ReLU and ELU activation layers
- Add model export for a few layers (Concatenate, DepthwiseConv2D, SeparableConv2D) missed in ModelSaver.kt
- Fixed the use-case when ModelSaver fails on saving Input with 2d and 3d tensors
- Fixed a StackOverflowError in objectDetectionSSD.kt example
- Fixed a problem with the confusing logs during weights loading from .h5 file
- Fixed the Windows separator usage instead of File.separator in the Save and Load preprocessors
- Fixed the incorrect temporary folder for the cat-vs-dogs dataset
- Fixed the problem when ImageConverter and Loading do not close opened streams
- Fixed the Image Preprocessing DSL issues
- Reduced time complexity of FloatArray::argmax to linear
API breaking changes:
- Renamed ModelZoo to the ModelHub
- Changed the ImagePreprocessing DSL: loading and saving are moved to the separate level of DSL
- Changed the TrainableModel::summary API to return ModelSummary
Infrastructure:
- Loaded the weights and JSON configurations of the newly added ModelHub models to S3 storage
- Moved ImageDSL and Dataset API to the separate 'dataset' module
- Added a new 'visualization' module with the basic support for painting on Swing and in Jupyter Notebook with lets-plot
- Transformed the project from the single-module project to the multi-module project
Docs:
- Created website with API Documentation from KDoc via Dokka
- Added support for the multiple version API Documentation from KDoc via Dokka
- Updated all existing tutorials
- Updated the Readme.md
- Updated the existing KDocs
- Added a new tutorial about ONNX models usage
- Added a new tutorial about Transfer Learning with ONNX ResNet no-top model and TensorFlow
Examples:
- Added an example of SSDObjectDetectionModel usage and visualisation of the detected objects on the Swing panel
- Added an example of Fan2D106 (face alignment) model and landmarks visualisation on the Swing panel
- Added an example where the prepared ONNX model without top layers is fine-tuned via training of top layers implemented with TensorFlow-based layers
- Added a lot of examples for the newly added to the ModelHub models (ONNX-based and TensorFlow-based)
- Added an example with the model SoundNet trained on Free Spoken Digits Dataset to classify the audio
- Updated 'visualization' examples with the new Batik and lets-plot support
Tests:
- Added tests for ModelLoading
- Added tests for InputLayer
- Added tests for all newly added layers
Features:
- Added support for Functional API
- Added BatchNorm layer for inference
- Added GlobalAveragePooling2D layer
- Added 7 Merge layers (Add, Average, Concatenate, Maximum, Minimum, Multiply, Subtract)
- Added Activation layer
- Added ReLU layer
- Added DepthwiseConv2D layer
- Added SeparableConv2D layer
- Added Reshape layer
- Added Cropping2D layer
- Added ZeroPadding2D layer
- Added NoGradients interface to indicate layers whose weights cannot be updated during training due to the lack of gradients in TensorFlow
- Added Model Zoo with the following models:
- VGG'16
- VGG'19
- ResNet50
- ResNet101
- ResNet152
- ResNet50V2
- ResNet101V2
- ResNet152V2
- MobileNet
- MobileNetV2
- Added ImageNet related preprocessing for each of the ModelZoo supported models: available in ModelZoo object and as a
sharpen
stage in the image preprocessing DSL - Added model descriptions for models from ModelZoo (excluding MobileNet) designed with the Functional API in org.jetbrains.kotlinx.dl.api.core.model package
- Added two implementations of the Dataset class: OnFlyImageDataset and OnHeapDataset
- Added topological sort for layers as nodes in the DAG model representation
- Added
shuffle
function support for both Dataset implementations - Added the Kotlin-idiomatic DSL for image preprocessing with the following operations:
- Load
- Crop
- Resize
- Rotate
- Rescale
- Sharpen
- Save
- Implemented label generation on the fly from the names of image folders
- Implemented
summary
method for the Functional API - Added embedded datasets support (MNIST, FashionMNIST, Cifar'10, Cats & Dogs)
Bugs:
- Fixed a bug with BGR and RGB preprocessing in examples
- Fixed missed
useBias
field in convolutional layers
Internals improvements:
- Refactored: both Sequential and Functional models now inherit the GraphTrainableModel class
- Completed the Klaxon migration from 5.0.1 to 5.5
- Removed useless labels and data transformations before sending to
Tensor.create(...)
Infrastructure:
- Loaded the weights and JSON configurations of ModelZoo models to S3 storage
- Added a TeamCity build for the examples
- Loaded embedded datasets to S3 storage
- Removed dependencies from
jcenter
- Moved an artifact to the Maven Central Repository
- Changed the groupId and artifactId
- Reduced the size of the downloaded
api
artifact from 65 MB to 650 KB by cleaning up resources and migrating the model and datasets to the S3 storage
Docs:
- Updated all the tutorials
- Updated the Readme.md
Examples:
- Renamed all the example's packages
- Regrouped examples between packages
- Added examples for training all ResNet models from scratch on the Cats & Dogs dataset
- Tuned hyper-parameters in all examples with VGG-like architecture to achieve convergence
- Added examples for the Image Preprocessing DSL
- Added examples for all available ModelZoo models, including additional training on the subset of the Cats & Dogs dataset
- Added ToyResNet examples (trained on the FashionMnist dataset)
Tests:
- Converted all examples to integration tests by refactoring
main
functions
Features:
- Added support batch processing for predictSoftly in #28
- Converted getXXX functions to properties in public API (layers and model classes)
- Removed a flag
verbose
from public API - Made logging based on a configuration.
Bugs:
- Fixed #25 Suspiciously slow calls to
Sequential.predictSoftly
- Fixed #24 reshapeFunction not initialized after model load
- Fixed #22 Exception in combination with Log4J
- Added permission 'executable' making gradle wraper script runnable on Linux/Mac OS X systems
Internals improvements:
- Removed unnecessary copying of FloatArray to off-heap memory in
internalPredict
method
Docs:
- Added "Release check list" for release managers
- Updated Readme.md with new chapters about logging and fatJar issue
- Minor updates in "Tutorials"
Features:
- Added @JvmStatic for companion methods
Examples:
- Provided support for VGG'16 and VGG'19 weights and models in examples
- Added links for loading all models and weights used in examples
- Moved direct file paths to property file
- Removed duplicated resources
- Transfer Learning examples are merged and improved
- Added description for all examples
Docs:
- Minor updates in "Transfer Learning Tutorial"
Features:
- ReductionType support for loss function: SUM and SUM_OVER_BATCH_SIZE
- Added new loss functions: LogCosh, BinaryCrossEntropy, SquaredHinge
Tests:
- Added tests for all loss functions (in Eager Mode)
- Added tests for new scenarios of Keras weights loading
Docs:
- "Transfer Learning Tutorial" added
- Code of conduct and ChangeLog documents added