[CPU] Deconvolution int8 support (ngraph) #63

antonvor · 2021-05-05T11:43:20Z

Tickets:

48331

PR in oneDNN: openvinotoolkit/oneDNN#50
PR with tests: openvinotoolkit#5348

TODO

support post ops for int8 deconvolution

* [LPT] ConcatTransformation: fixed naming of outputs after split * [LPT][TESTS] Concat with split tests: added verification of output names

…ut (openvinotoolkit#5431)

…penvinotoolkit#5521)

)

* implement argmin extractors * reconsidering argmax to topk * arg ops refactoring * rename ArgMaxToTopK * added unittests * update docs * move unittest file to new folder * conversations resolving * revert changes with argmax.py, move argmin op to a new file * rename ArgMaxSqueeze * updated BOM file * little fix * code refactoring in ArgMaxOp, updated unittests Co-authored-by: yegor.kruglov <[email protected]>

by making it template.

…ero plugin (openvinotoolkit#5222)

@ref

* Added info on DockerHub CI Framework * Feature/azaytsev/change layout (openvinotoolkit#3295) * Changes according to feedback comments * Replaced @ref's with html links * Fixed links, added a title page for installing from repos and images, fixed formatting issues * Added links * minor fix * Added DL Streamer to the list of components installed by default * Link fixes * Link fixes * ovms doc fix (openvinotoolkit#2988) * added OpenVINO Model Server * ovms doc fixes Co-authored-by: Trawinski, Dariusz <[email protected]> * Updated openvino_docs.xml * Edits to MO Per findings spreadsheet * macOS changes per issue spreadsheet * Fixes from review spreadsheet Mostly IE_DG fixes * Consistency changes * Make doc fixes from last round of review * integrate changes from baychub/master * Update Intro.md * Update Cutting_Model.md * Update Cutting_Model.md * Fixed link to Customize_Model_Optimizer.md Co-authored-by: Trawinski, Dariusz <[email protected]> Co-authored-by: baychub <[email protected]>

* Allow nagative values for batch_dims * Update formula * Update spec according to comments * clarified cases when batch_dims and axis less than zero and enhanced restriction for index types Co-authored-by: Pavel Esir <[email protected]>

…it#5437) * Add sys_platform environment marker * Update sys_platform check * Add unit tests for sys_platform marker * apply review comments * Fix typo * Update checker and tests, apply comments * Update comments parsing and tests * Fix commrnt * Resolve comments and update check logic * Update tests and fix bug with negative tests Co-authored-by: achetver <[email protected]>

* Convert op specification refactoring. * Minor readability improvements. * Fixed 'category' formatting.

* Removed constant DDR_MAX_SIZE = 512. Removed the DDR_MAX_SIZE constant as it could potentially lead to incorrect behavior of devices with a different DDR size (Prism Creek can be up to 2 GB in size). Removed the use of this constant in methods.

…penvinotoolkit#5467) * [LPT] Zero point insertion in case of zero value on FQ output high * [LPT] Change precision in test on the real default precision[0]

* Written MO classes for DFT and IDFT operations. * Added class to read TF (I)FFT operations. * Written extractors for TF operations FFT, FFT2D, FFT3D, IFFT, IFFT2D, IFFT3D. * Written MO Roll operation and TF Roll operation extractor. * Started to write needed transformations. * Written transformation StridedSlices + Complex + Roll + (i)FFTxD + Roll + (Imag, Real) + Pack -> Roll + (I)DFT + Roll. * Written transformation for Complex + ComplexAbs. * Written correction of axes of Roll. * Small fix. * Small fix. * Some fixes. * Some changes. * Now TF Roll is read as TFRoll. Written inserting Transposes before and after (I)DFT. * Small fix. * Written tests for the transformation TFRollToRoll. * Added comments to some transformations. * Deleted redundant import. * Written tests for the transformation TransposeDFT. * Fixes in MO IR Reader to read/write (I)DFT. * Fixes in the list of supported TF layers. * Started to write tests for SSliceComplexRolledFFTPackBlockReplacement transformation. * Written tests for the MO transformation SSliceComplexRolledFFTPackBlockReplacement. * Written tests for the MO transformation ComplexAbs. * Tests for transformations were moved into unit_tests directory. * All extractors for (I)FFTxD are in one file now. * Deleted redundant transformations. * Fixed extractor for TF Roll: now this operation is read as MO Roll. * Added comments to TFFFT operation. * The method insert_transpose of classes TransposeDFT and LayoutChangeForGatherND was moved into the separate function in the file model-optimizer/extensions/middle/InsertLayoutPropagationTransposes.py. * Fixed comment for the transformation TransposeDFT. * Small fix. * Some fixes. * Deleted shape infer function for the operation TFFFT. Sorted imports in complex_abs.py. * Small fixes. * Deleted redundant import. * Fixes in some asserts. * Small fix. * Added names for created nodes in the transformation ComplexAbs. * Added comments to the method canonicalize_axes. * The transformation SSliceComplexRolledFFTPackBlockReplacement was split into the sequence of transformations SSliceComplexRollReplacement -> RollRealImagPackReplacement -> TFFFTToDFT. * Written tests for the transformation SSliceComplexRollReplacement. * Written tests for the transformation RollRealImagPackReplacement. * Written tests for the transformation TFFFTToDFT. * Deleted commented code. * Fixed types of constants in the transformation ComplexAbs. * Written tests for canonicalization of signal_size value. * Deleted 'Replacement' from names of files and classes. * Used comarison of ids, not names. * replace_sub_graph was replaced with find_and_replace_pattern. * Now the transformation RollRealImagPack is executed before running transformation model-optimizer/extensions/front/Pack.py. * The body of the function create_dft_from_tffft is a part of the transformation TFFFTToDFT body now. * Now method correct_roll_axes of classes RollRealImagPack and SSliceComplexRoll is moved to the function in mo/front/tf/graph_utils.py. * Small changes. * Added comment before mark_input_as_in_correct_layout(roll, 2). * Now the functions correct_roll_axes generates sub-graph in the input port 2 of Roll. * Corrected tests for the transformation SSliceComplexRoll. * Corrected tests for the transformation RollRealImagPack. * Deleted commented code. * Some renaming. * Added decomposition of the separate operation ComplexAbs (without Complex before it). * Added comment to the transformation ComplexAbsAfterComplex. * Optimized imports for the transformation TFFFTToDFT. * The transformation SSliceComplexRoll was split into the sequence SSliceComplex -> CorrectRollAxes and disabled. * Written tests for the transformation ComplexAbs. * Written tests for the transformation SSliceComplex. * Written tests for the transformation CorrectRollAxes. * Deleted the transformation SSliceComplexRoll. * Deleted renaming nodes. * Fixed comment. * Small fixes. * Small fix. * The attribute need_correction was renamed as input_rank_changed. * Small fixes. * Deleted commented code. * Now we iterate over all complex_node.out_port(0).get_connection().get_destinations() input ports and mark the corresponding nodes with the marker attribute. * Added the attribute 'in_ports_count' into the class FFTBase. * Tests for the transformation TransposeDFT were rewritten using helper functions. * Now the transformation RollRealImagPack uses existing Roll node instead of creating new one. * Small fixes. * Fix in the documentation. * Written class to read MxNet (I)FFT operations. Written corresponding extractors. * Corrected shape infer function for MXFFT operation. Written transformation to convert MXFFT to (I)DFT. * Fixed shape infer function. * Fixed the conversion MXFFT to (I)DFT. * Written tests for the transformation MXFFTToDFT. * The function correct_roll_axes was replaced with more generic function add_constant_to_negative_values. * Fixes in classes TFFFT, FFTBase, DFT, IDFT, MXFFT. * Added asserts in constructors of operations TFFFT and MXFFT. * Refactored transformation MXFFTToDFT: conversion of DFT and IDFT were moved into separated functions. * Moved some commented code. * Fixed BOM file. * Written function convert_ifft_to_dft. * Started to rewrite tests for MXFFTToDFT transformations, in the case is_inverse=False. * Small fixes. * Fixes in the transformation RollRealImagPack. * Renaming tests class for the transformation SSliceComplex. * Fixes in the function compare_graphs. Now we get all output nodes of op node, and these output nodes are sorted by names. * Fixed tests for the transformation MXFFTToDFT. * Fix in the transformation ThresholdedReluDecomposition: added disconnect for trelu input port. * Fixes in test for the transformation TFSliceToSlice. * Small fix in the transformation ObjectDetectionAPIPreprocessor2Replacement. * Small fix in comment. * Optimized imports. * Used remove_node in the transformation ThresholdedReluDecomposition and remove_nodes_from in the transformation RollRealImagPack, instead of ports disconnection. * Deleted commented code. * Deleted test case test_slice_replacer_begin_with_2_inputs.

…toolkit#5399) * [GNA] Additional PWL segments are added to avoid saturation After design phase for PWL segments has finished, additional segments are added to avoid saturation. This commit also reduces the number of PWL segments created for some layer types. * [GNA] Make PWL unit tests take into account saturation errata

…openvinotoolkit#5541) It is a known internal issue in gtest when holding a shared_ptr to mocked object, which sometimes reports about memory leak It is recommended to use Mock::VerifyAndClearExpectations at the end of each test when mock object is not needed anymore After adding this, issue with incorrect TestThrowOnImport expectations is observed

* Exclude xbyak from install * Added automatically generated InferenceEngineConfig.cmake * Reverted a version back * Fixed issues with target aliases * Make TBB dependency private * Made ie_parallel.cmake self-sufficient * Don't expose ie_paralle.cmake to end users * Fixed compilation with TBB * Fixes for TBB * Fixed vpu_graph_transformer compilation * Fixed tests compilation * Added install of ie_parallel.cmake * Switched ENABLE_ALTERNATIVE_TEMP to OFF. Fixed COMPONENTS for TBB * Fixed file name in install rules * Added find_dependency for TBB in ie_parallel.cmake * WA for cmake bug with PACKAGE_PREFIX_DIR * Fixed no-deprecation to fix speech-library build * Reverted version from 2.1.0 to 2.1 * Revert "Reverted version from 2.1.0 to 2.1" This reverts commit 7cb5d15. * Returned custom version file back * Added InferenceEngineConfig-version.cmake to share as well * Disabled one more GPU test * Added one more WA for CI * WA for CI issue for C API * WIP

…nvolution and pooling (openvinotoolkit#5501)

* Updated list of supported OS for PyPi * merge of Helena branch * 55103: add instructions to install Microsoft* Visual C++ Redistributable Package

This reverts commit ff583ce.

* New flags for GNA trget * Add new flags to configuration * Update utest * Update speech_Sample * Add unit tests * Guard for wrong values * Use gna execution target as consistent device * Apply review

…olkit#5547)

* BatchNormInference specification refactoring * Address review comments * Remove he term Transform from definition * Add title of the paper where this operation is introduced * Add missing backticks * Remove redundant information in attribute epsilon range of values * Refinement of spec Remove more mentions to transformation to avoid confusion * Corrected typos and added changes to improve readability * Use third person to express operation steps

…lkit#5548)

* ConvertLike specification refactoring * Corrected typos and clean up * Changed supported types of Convert to align with ConvertLike

* Fixed attributes saving to keep tensor debug info in Parameter node. * Added comment and unit tests. * Small correction. * Small correction of unit test. * Comment corrected.

* Changed fuse_mul behaviour for proper data node connection. * Corrected the comment. * Corrected the comment. * Added permutation attribute saving. * Added comment. * Added unit tests, comments corrections.

* Revise unsqueeze op spec * Second input value boundaries fix * adjust first input description

…quantize

antonvor changed the base branch from master to feature/cpu_migration_on_ngraph May 5, 2021 11:43

antonvor force-pushed the feature/deconvolution_int8_support_ngraph branch 2 times, most recently from 520d10f to edfcf24 Compare May 5, 2021 12:31

antonvor marked this pull request as ready for review May 5, 2021 12:34

dmitry-gorokhov force-pushed the feature/cpu_migration_on_ngraph branch from 586fd7a to 7d4fd2f Compare May 5, 2021 21:19

v-Golubev and others added 25 commits May 6, 2021 10:58

ConcatTransformation fix (openvinotoolkit#5482)

49a5385

* [LPT] ConcatTransformation: fixed naming of outputs after split * [LPT][TESTS] Concat with split tests: added verification of output names

[IE CLDNN] Fixed FQ in byxf layout and pooling in fsv32 with int8 inp…

fa4a67a

…ut (openvinotoolkit#5431)

[IE CLDNN] Disabled vectorized ocl path for modes with bool output (o…

30b9d2b

…penvinotoolkit#5521)

Change model_path to model_name in timeline report (openvinotoolkit#5457

5834eef

)

[IE TESTS] Remove dummy file for beh tests (openvinotoolkit#5436)

935405a

fix incorrect input names for mean values (openvinotoolkit#5508)

e3ea9bf

Extend filling roi data for other precisions (openvinotoolkit#5432)

5bd6343

by making it template.

[IE][VPU]: Fix for crash in Myriad plugin during LoadNetwork with Het…

a411af1

…ero plugin (openvinotoolkit#5222)

Add time_tests dir path to sys.path (openvinotoolkit#5498)

4790c79

[LPT] Improve Etlwise branch selection logic (openvinotoolkit#5208)

ec7b1f4

Convert op specification refactoring. (openvinotoolkit#5530)

2896b3a

* Convert op specification refactoring. * Minor readability improvements. * Fixed 'category' formatting.

[CPU] Plugin migration on ngraph (openvinotoolkit#4344)

a19413c

Fix incorrect plural: childs -> children (openvinotoolkit#5532)

2c755aa

Update opencv package for yocto (openvinotoolkit#5536)

a8b5f1f

Fixed TBBBind_2.4 usage for RelWithDebInfo (openvinotoolkit#5535)

a8289b5

[LPT] Zero point insertion in case of zero value on FQ output high (o…

8645c08

…penvinotoolkit#5467) * [LPT] Zero point insertion in case of zero value on FQ output high * [LPT] Change precision in test on the real default precision[0]

elilobanova and others added 25 commits May 7, 2021 12:01

[GNA] Fix compilation of topologies with only 2 functional layers: co…

696aa37

…nvolution and pooling (openvinotoolkit#5501)

Updated list of supported OS for PyPi (openvinotoolkit#5525)

d339cbe

* Updated list of supported OS for PyPi * merge of Helena branch * 55103: add instructions to install Microsoft* Visual C++ Redistributable Package

Revert "Reuse existing cmake variables" (openvinotoolkit#5550)

39717ae

This reverts commit ff583ce.

ReduceSum specification refactoring (openvinotoolkit#5527)

edfec91

[GNA] New flags to select GNA target generation (openvinotoolkit#5429)

7c07c61

* New flags for GNA trget * Add new flags to configuration * Update utest * Update speech_Sample * Add unit tests * Guard for wrong values * Use gna execution target as consistent device * Apply review

Azure: Add installing setuptools in linux_onnxruntime.yml (openvinoto…

102e95f

…olkit#5547)

[IE CLDNN] Fix segmentation fault for hetero plugin mode (openvinotoo…

d4a8834

…lkit#5548)

ConvertLike specification refactoring (openvinotoolkit#5534)

b9812a4

* ConvertLike specification refactoring * Corrected typos and clean up * Changed supported types of Convert to align with ConvertLike

ReverseInputChannels mapping fix (openvinotoolkit#5523)

4ea09b1

* Fixed attributes saving to keep tensor debug info in Parameter node. * Added comment and unit tests. * Small correction. * Small correction of unit test. * Comment corrected.

Moved ie_thread_affinity.hpp to private API (openvinotoolkit#5554)

3e25539

Fuse mul transformation fix (openvinotoolkit#5518)

84b94c9

* Changed fuse_mul behaviour for proper data node connection. * Corrected the comment. * Corrected the comment. * Added permutation attribute saving. * Added comment. * Added unit tests, comments corrections.

Revise Unsqueeze op - specification (openvinotoolkit#5526)

af0aa5f

* Revise unsqueeze op spec * Second input value boundaries fix * adjust first input description

[LPT] ConvolutionBackpropData support

334d14c

minor fixes

77a7e78

[Transformations] Legacy subtract precision keep

1b17a7f

[LPT] ConvolutionBackpropData tests improvements

c187075

[LPT] ConvolutionBackpropData weights folding when can't be transformed

4700604

[LPT] CanBeTransformed unification and convolution weights folding

0dee43f

[LPT] GPU INT8 optimizations condition flag

3f99fb1

[LPT] Concat precision predict improvement

7e1e827

[LPT] Turn off asymmetric quantization for Deconvolution on GPU

144b38d

[LPT] Improvements from review

007a984

[LPT] Check if layer after concat isQuantized and require per-tensor …

02972bf

…quantize

[CPU] int8 deconvolution support

e065229

antonvor force-pushed the feature/deconvolution_int8_support_ngraph branch from edfcf24 to e065229 Compare May 7, 2021 15:39

antonvor closed this May 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CPU] Deconvolution int8 support (ngraph) #63

[CPU] Deconvolution int8 support (ngraph) #63

antonvor commented May 5, 2021 •

edited

Loading

[CPU] Deconvolution int8 support (ngraph) #63

[CPU] Deconvolution int8 support (ngraph) #63

Conversation

antonvor commented May 5, 2021 • edited Loading

Tickets:

TODO

antonvor commented May 5, 2021 •

edited

Loading