merge paddle #1

b3602sss · 2021-01-18T07:44:07Z

PR types

PR changes

Describe

…h are supported) (#29772)

* oneps (3/4) Co-authored-by: MrChengmo <[email protected]> Co-authored-by: malin10 <[email protected]> Co-authored-by: chengmo <[email protected]>

This reverts commit 067d7f1.

… unit test (#29739) * reopen python coverage --include for test, test=develop * if no .py file modified, not use coverage run, test=develop * remove test code, test=develop * add WITH_INCREMENTAL_COVERAGE, test=develop * refine if else, test=develop

* feat: support check_nan_inf for kunlun device * support kunlun stack * minor

* Fix Windows unittest of test_fetch_lod_tensor_array

* support some shape in matmul and cast * modify matmul

* add support for complex grad accumulated * add unittest for coverage * update test dtype * remove useless blank line

* try to add fwd op input dtypes * refactor base impl * return tmp_ins after dygraph prepare data * fix typo found in debug * polish comment & add complex net test * revert detail change * fix unittest failed * add complex kernel condition control * fix xpu test failed & polish comment * polish details by review comments

* Add the ipipe log param prefix 1. add the prefix; 2. using Colon before the metric values; * 增加效率云日志指标收集前缀暂未验证windows bat的这个字符串替换是否正常 * Preserve The Old Format Metrics During The Transition Period Please DELETE the old format metrics log finally. The period man last for a week. * ipipe_log_param + ccache and clcache ..

* enable bilateral_slice unittest on windows platform * reduce max threads

* update, test=develop

* split callback unittest * rm test_callback from timeout list

* fix conv_transpose bug when padding=same

…29574)

…or (#29842) * Revert "[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267)" This reverts commit b10ecd9. * Support ShareInplaceVersionCounterWith to share the same inplace version counter for VarBase

* register op version for grid_sampler

* Support MIPS arch.

… executor (#29926)

* fix a bug in reset_tensor_array, test=develop * ci coverage, test=develop

…29923) * add op version for fake_quant and fake_dequant ops, test=op_version, test=develop

…remove unnecessay code (#29851)

* flush denormals to zero, test=develop * add comments, test=develop

* rm check_import_scipy, rm chunk_eval and mean_iou in paddle.metric.__init__.py * Revert "rm check_import_scipy, rm chunk_eval and mean_iou in paddle.metric.__init__.py" This reverts commit 179ba8c. * delete paddle.metric.chunk_eval and paddle.metric.mean_iou * delete paddle.nn.clip and paddle.nn.clip_by_norm * delete paddle.nn.functional.activation.hard_sigmoid and paddle.nn.functional.activation.hard_swish * delete paddle.nn.Pool2D, paddle.nn.BilinearTensorProduct, paddle.nn.RowConv, paddle.nn.functional.row_conv * fix extension import error * fix unittest for row_conv and Pool2D

built-in `rsqrt` is shadowed

* add view strategy on squeeze,unsqueeze,reshape,flatten * add squeeze unittest * add unittests * use View strategy as name rather than Reuse Allacation * fix view api doc * fix format * use core.ops when input of reshape2 is Tensor * fix test_cross_entropy_loss error because of reshape2 * fix test_cross_entropy_loss error because of reshape2 * add inplace strategy * add elementwise_add sub * let backward op not use inplace * grad op do not use inplace * fix memory increase error and add leaf error message * delete selected_rows * change op_function * little change * solve HandleViewBetweenInputAndOutput * add unittest and leaf error message * merge view error * optimize op_function_generator format and support sum inplace op * fix format of basic_engine * fix format for framework * little change of variable wrapper * add reshape, squeeze, unsqueeze, scatter api * add relu elu tanh softmax inplace api * fix test_squeeze_op unittest * fix test_relu_op unittest * fix comment problems * delete sample code of inplace api * add reference of grad_pending_nodes in basic_engine * fix unittest name * add inplace apis into wlist * fix error message * add PADDLE_ENFORCE for set grad op twice * fix head file error

* update voc url

* Move pass tester helper functions to single common place. * Use helper functions in two more fuse pass tests.

* Modify the calculation logic of LambOptimizer

…28701) * Collect weight threshold of lstm, test=develop

* Avoid bug on 'MAC python3.5/6'. * Choose the saving method according to the OS. * smaller length of '_unpack_saved_dict' for MAC OS. * add version information of Python. * Edit comment.

* initial commit: simple demo * polish copyright format * add grap op simple demo * adapt uncertain number of argument * change trait marco name * add place & dtype support for add kernel * add dispath and infershape func * poish code & add notes * add dynamic_loader dep for paddle_framework * add new custom op test dir * polish impl details * add unittest for new custom op * fix failed unittest * Costum op (#1) * fix compile error * wrap framework tensor with LoDTensor * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * add CustomTensor default constructor * add size() for CustomTensor * make size const for CustomTensor * refactor place related api to circle the concept * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * make place const * make Tensor copy * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * remove additional head of framework * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * add gpu test * merge latest cwh code in * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * Remove ShareData from user && Change CustomTensor to Tensor && Support more data type (#2) * fix compile error * wrap framework tensor with LoDTensor * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * add CustomTensor default constructor * add size() for CustomTensor * make size const for CustomTensor * refactor place related api to circle the concept * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * make place const * make Tensor copy * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * remove additional head of framework * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * add gpu test * merge latest cwh code in * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * hid share data from and to * rename CustomTensor to Tensor * refactor register design & add test * change op_funtion to op_meta_info * split op meta info into .h and .cc * move get methods into friend class * move OpMetaInfoHelper into framework space * move CustomTensorUtils into framework space * change pybind api name * move PD C API into op meta info * add register custom op api * remove inference cmake change * refactor copy to api && change Reshape to lowercase && support more dtype && add more test (#3) * fix compile error * wrap framework tensor with LoDTensor * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * add CustomTensor default constructor * add size() for CustomTensor * make size const for CustomTensor * refactor place related api to circle the concept * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * make place const * make Tensor copy * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * remove additional head of framework * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * add gpu test * merge latest cwh code in * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * hid share data from and to * rename CustomTensor to Tensor * support multi dtype * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * fix copy to error * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * polish detail & error message * polish test details * Add cast api && Change copy related api to copy_to && add more test (#4) * fix compile error * wrap framework tensor with LoDTensor * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * add CustomTensor default constructor * add size() for CustomTensor * make size const for CustomTensor * refactor place related api to circle the concept * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * make place const * make Tensor copy * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * remove additional head of framework * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * add gpu test * merge latest cwh code in * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * hid share data from and to * rename CustomTensor to Tensor * support multi dtype * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * fix copy to error * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add type cast * add cast and make copy to api * add cast and make copy to api * add cast and make copy to api * add cast and make copy to api * merge cwh code * merge cwh code * merge cwh code * merge cwh code * merge cwh code * add more error log * add more error log * polish code * used for test * remove test comment * remove test comment * fix uint8 type error * fix lost uint8 type error * add test for coverage * polish details by reviewer comments * add prefix for DISABLE_COPY_AND_ASSIGN Co-authored-by: Jiabin Yang <[email protected]>

…y::Allocation> for Storage (PaddlePaddle#38301) * Added shared_ptr<Allocation> member & corresponding interfaces to Storage * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly * Fixed issues with storage offset * Used place to malloc allocation for TensorStorage

* Enabled Eager OpTest #1 * Enabled Eager OpTest #1 * Fixed get_tensor method for EagerTensor

* #1 migrate dist-related type()-> dtype() * move datatype function from pten -> fluid/framework * change type() in imperative into convert(dtype()) * modify xx_tensor->type into xx_tensor->dtype * change the set_type interface and the caller * modify xx_tensor.type into xx_tensor.dtype * fix mutable_data(place, dtype()) * change caller of mutable_data in pten and distributed * change the caller of mutable_data in fluid/framework * change the caller of mutable_data in imperative directory * mutable_data: inference * update the call of mutable_data * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType * pass the compile. the next step is remove VarType in Pten * fix all and remove VarType from pten. success in linux. Next task is other platform * fix conflict with develop * fix compiled error * Fix reset conversion * fix conflict * fix compiled problem * fix typo * Fix << in tensor_utils.cc * fix type->dtype * fix unittest * fix tensor init constructor * fix DataTypeSize for BFloat16 * fix code style * fix npu compiled error * fix npu * compile npu sucessfully * fix conflict * fix conflict Co-authored-by: xiongkun <[email protected]>

* [Refactor] refactored eager_gen.py PR #1 * [Refactor] refactored eager_gen.py PR #1 * Refactored version 2 * Added automatic code generation utils * Fixed merge issues

…rdFunctions and GradNodes (PaddlePaddle#40937) * [Refactor] refactored eager_gen.py PR #2 * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes * Fixed minor issue

…enerateForwardDefinition (PaddlePaddle#41016) * [Refactor] refactored eager_gen.py PR #2 * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes * Fixed minor issue * Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition * Fixed issues * Fixed minor issue

…Paddle#41051) * [Refactor] refactored eager_gen.py PR #2 * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes * Fixed minor issue * Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition * Fixed issues * Supported higher-order grad node generation * [DoubleGrad PR #4] Supported higher-order GradNode generation * Fixed yaml typo

…e#41121) * [Refactor] refactored eager_gen.py PR #2 * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes * Fixed minor issue * Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition * Fixed issues * Supported higher-order grad node generation * [DoubleGrad PR #4] Supported higher-order GradNode generation * [DoubleGrad #4] Bug Fixes to Double Grad Node Generation * Fixed yaml typo * Fixed yaml typo * fixed minor issues * Fixed minor issue

…sed to paddle.grad() (PaddlePaddle#41198) * [Refactor] refactored eager_gen.py PR #2 * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes * Fixed minor issue * Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition * Fixed issues * Supported higher-order grad node generation * [DoubleGrad PR #4] Supported higher-order GradNode generation * [DoubleGrad #4] Bug Fixes to Double Grad Node Generation * Fixed yaml typo * Fixed yaml typo * fixed minor issues * [DoubleGrad PR #5] Enabled gradient computations for grad_tensors passed to paddle.grad() * Fixed minor issue * Fixed CI-Inference issue * Fixed CI-inference issues

…efore backward run (PaddlePaddle#41306) * [Refactor] refactored eager_gen.py PR #2 * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes * Fixed minor issue * Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition * Fixed issues * Supported higher-order grad node generation * [DoubleGrad PR #4] Supported higher-order GradNode generation * [DoubleGrad #4] Bug Fixes to Double Grad Node Generation * Fixed yaml typo * Fixed yaml typo * fixed minor issues * [DoubleGrad PR #5] Enabled gradient computations for grad_tensors passed to paddle.grad() * Fixed minor issue * Fixed CI-Inference issue * Fixed CI-inference issues * [DoubleGrad PR PaddlePaddle#7] paddle.grad() to copy backward graph before backward run * Fixed minor issues * Fixed issue with backward graph construction logic * Fixed implementation issues with backward graph reconstruction * Fixed unittest issue * Fixed issues

…atmul (PaddlePaddle#41387) * [Refactor] refactored eager_gen.py PR #2 * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes * Fixed minor issue * Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition * Fixed issues * Supported higher-order grad node generation * [DoubleGrad PR #4] Supported higher-order GradNode generation * [DoubleGrad #4] Bug Fixes to Double Grad Node Generation * Fixed yaml typo * Fixed yaml typo * fixed minor issues * [DoubleGrad PR #5] Enabled gradient computations for grad_tensors passed to paddle.grad() * Fixed minor issue * Fixed CI-Inference issue * Fixed CI-inference issues * [DoubleGrad PR PaddlePaddle#7] paddle.grad() to copy backward graph before backward run * Fixed minor issues * Fixed issue with backward graph construction logic * Fixed implementation issues with backward graph reconstruction * Fixed unittest issue * Fixed issues * [DoubleGrad PR PaddlePaddle#8] Enabled triple grads for sigmoid and matmul * Fixed issues with phi kernel * Added triple grad test case * Fixed minor issue

chenwhql and others added 30 commits December 24, 2020 11:20

replace exit method (#29862)

0e0bb1b

Added fc + activation fuse pass (currently only gelu, sigmoid and tan…

edc06c6

…h are supported) (#29772)

[Feature] one ps (3/4) (#29604)

032414c

* oneps (3/4) Co-authored-by: MrChengmo <[email protected]> Co-authored-by: malin10 <[email protected]> Co-authored-by: chengmo <[email protected]>

Revert "fix conv2d int8 windows UT (#29528)" (#29869)

5d130d5

This reverts commit 067d7f1.

add the cumsum unit test for the develop (#29881)

7498df2

feat: support check_nan_inf for kunlun/xpu device (#29694)

59b47f3

* feat: support check_nan_inf for kunlun device * support kunlun stack * minor

fix TransferInplaceBack (#29830)

6b25831

Skip Windows Multi-GPU test of test_fetch_lod_tensor_array (#29508)

80eb777

* Fix Windows unittest of test_fetch_lod_tensor_array

support some shape for matmul and cast in xpu place (#29900)

c7acad9

* support some shape in matmul and cast * modify matmul

[Complex] Add support for complex grad accumulated (#29889)

1a304e6

* add support for complex grad accumulated * add unittest for coverage * update test dtype * remove useless blank line

remove duplicate ut names (#29809)

2a01756

Enable bilateral_slice unittest on windows platform (#29896)

0f4b218

* enable bilateral_slice unittest on windows platform * reduce max threads

fix the bug in pipeline data parallelism (#29731)

01950ce

* update, test=develop

Split callbacks unittest (#29914)

11de384

* split callback unittest * rm test_callback from timeout list

Fix Conv2DTanspose bug when padding='same' (#29915)

0b74428

* fix conv_transpose bug when padding=same

[Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#…

4427df3

…29574)

fix optimizer dtype (#29917)

a4b9daf

fix reg api ut fail (#29921)

5c162fe

Register op version for grid_sampler, test=op_version (#29916)

eab0b60

* register op version for grid_sampler

Support mips arch (#29903)

332da13

* Support MIPS arch.

[Kunlun] bug fix of PR2: Support MultiDevicePass and BKCL in parallel…

3d1741b

… executor (#29926)

fix a bug in reset_tensor_array, test=develop (#29620)

acb5e86

* fix a bug in reset_tensor_array, test=develop * ci coverage, test=develop

add op version for fake_quant and fake_dequant ops, test=op_version (#…

7667e59

…29923) * add op version for fake_quant and fake_dequant ops, test=op_version, test=develop

[Windows CI test] Enable unittest test_optimizer_in_control_flow and …

14bd77f

…remove unnecessay code (#29851)

flush denormals to zero, test=develop (#29924)

181ea18

* flush denormals to zero, test=develop * add comments, test=develop

Shixiaowei02 and others added 14 commits January 15, 2021 13:44

export global google flags to users, test=develop (#30448)

715d862

Fix float64 bug in layer norm (#30452)

008b0a8

built-in `rsqrt` is shadowed

add VecCastCUDAKernel (#30296)

c9a334e

Update voc dataset url (#30450)

1d7bf1d

* update voc url

[oneDNN] Refactor fuse pass helper functions to one place. (#30460)

c5ffad1

* Move pass tester helper functions to single common place. * Use helper functions in two more fuse pass tests.

Modify the calculation logic of LambOptimizer (#29313)

11e78eb

* Modify the calculation logic of LambOptimizer

Collect weight threshold for lstm op in post_training_quantization (#…

5d8d463

…28701) * Collect weight threshold of lstm, test=develop

fix random seed in nll_loss unittest test=develop (#30468)

d8a9ba5

Recompute Offload: fixed bug in memcpy (#30484)

16ba0ab

Avoid bug on 'MAC python3.5/6'. (#30485)

18ecd43

* Avoid bug on 'MAC python3.5/6'. * Choose the saving method according to the OS. * smaller length of '_unpack_saved_dict' for MAC OS. * add version information of Python. * Edit comment.

fix range op crash in dygraph xpu place (#30469)

5e5c282

if pybind.cc changed, generate total report, test=develop (#30514)

bd97192

optimize batch_norm & pool op for kunlun (#30490)

8489d4f

b3602sss merged commit 10ca907 into b3602sss:develop Jan 18, 2021

b3602sss pushed a commit that referenced this pull request Nov 30, 2021

Added Eager Dygraph AutoCodeGen dependencies #1 (PaddlePaddle#37574)

fcd44b5

b3602sss pushed a commit that referenced this pull request Nov 30, 2021

Added performance tests for Eager Dygraph #1 (PaddlePaddle#37638)

7df301f

b3602sss pushed a commit that referenced this pull request Feb 16, 2022

infershaped autogen (PR #1), test=develop (PaddlePaddle#39405)

b3e049f

b3602sss pushed a commit that referenced this pull request Feb 16, 2022

Fixed get_tensor method for EagerTensor (PaddlePaddle#39414)

9722994

* Enabled Eager OpTest #1 * Enabled Eager OpTest #1 * Fixed get_tensor method for EagerTensor

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

merge paddle #1

merge paddle #1

b3602sss commented Jan 18, 2021

merge paddle #1

merge paddle #1

Conversation

b3602sss commented Jan 18, 2021

PR types

PR changes

Describe