Fix compatibility checks for 18.04 container #4

cbcase · 2018-05-23T17:49:53Z

Our 18.04 container is weird -- kind of an intermediate state of tensor / variable merge. This handles it better.

* fix dropout scaling from p to 1/(1-p) (NVIDIA#816) Co-authored-by: Sukru Eryilmaz <[email protected]> * Improvements to apex.mlp (NVIDIA#804) * update fused bias relu backward kernel * adding support for not require first layer dgrad * fix bug: wrong layer in requires grad * add infrastructure for optional bias and activation, currently only support no bias and no relu * make bias and relu optional separately * add sigmoid activation option * enable wider load/store for multi_tensor_apply kernels (NVIDIA#763) * modify MTA axpby for wider load/store * Make scale/axpby/l2/adam/lamb multi_tensor uses wider load * Changes to make xentropysoftmax load/store vectorized when possible: (NVIDIA#725) * Changes to make xentropysoftmax load/store vectorized when possible: Increase default ILP so that each thread handle 16 Bytes data in one step Make thread load/store longest vector possible Make unroll case handle adjacent data instead of strided, so same order compare to vector case * Add shift for not aligned case. Remove less than 16 bytes aligned access Co-authored-by: Burc Eryilmaz <[email protected]> Co-authored-by: Sukru Eryilmaz <[email protected]> Co-authored-by: Deyu Fu <[email protected]>

Accept custom (layer type:param name) to include in sparse_parameter …

* Added support for fused ReLU and dropout into transducer joint * Reorganized code selection path in transducer joint fwd * Added support for fused ReLU+dropout into transducer joint * Vectorize transducer loss backward with fused softmax (#3) * Nanz/transducer loss (#4) * Vectorize transducer loss backward with fused softmax * Added a predicate to avoid potential IMA * Nanz/transducer loss (#5) * Vectorize transducer loss backward with fused softmax * Added a predicate to avoid potentional IMA * Added more predicates to avoid IMAs * Updated documentations for newly added features. * Fixed a error in transducer.py

fix compatibility checks for 18.04 container

9ce3a33

mcarilli merged commit 1737ce1 into master May 23, 2018

cbcase deleted the amp_compat_fix branch July 11, 2018 00:15

jinserk mentioned this pull request Aug 30, 2018

'RNN' KeyError #36

Closed

Solacex mentioned this pull request Jan 13, 2019

RuntimeError: cuda runtime error (74) : misaligned address at /pytorch/aten/src/THC/THCTensorCopy.cu:84 #124

Open

adrienchaton mentioned this pull request Jun 27, 2019

RuntimeError and speed loss with opt_level = O1, O2 or O3 #373

Open

cizhenshi mentioned this pull request Aug 14, 2019

nms error Expected object of scalar type Half but got scalar type Float for sequence elment 1 in sequence argument at position #1 'tensors' #430

Closed

FabianIsensee mentioned this pull request Oct 7, 2019

AMP will crash with non-tensorcore GPUs #528

Closed

liulhdarks mentioned this pull request Oct 21, 2019

fail to use O1 level for AdaptiveLogSoftmaxWithLoss #556

Open

chengmengli06 mentioned this pull request Nov 12, 2019

apex hangs on cudaFree #599

Open

keloemma mentioned this pull request Mar 10, 2020

Installing apex display this error : warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++ #749

Open

quanpn90 mentioned this pull request Mar 17, 2020

Possible bug with O1 and FusedLayerNorm #760

Closed

matlabninja mentioned this pull request May 20, 2020

Use O1 opt_lv leads to RuntimeError: CUDA error: no kernel image is available for execution on the device #842

Closed

thorjohnsen pushed a commit that referenced this pull request Sep 15, 2020

Merge pull request #4 from a-maci/ASP_sparse_param_dict_update

eb5e96c

Accept custom (layer type:param name) to include in sparse_parameter …

tigerccx mentioned this pull request Oct 20, 2020

RuntimeError: CUDA error: invalid device function #982

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix compatibility checks for 18.04 container #4

Fix compatibility checks for 18.04 container #4

cbcase commented May 23, 2018

Fix compatibility checks for 18.04 container #4

Fix compatibility checks for 18.04 container #4

Conversation

cbcase commented May 23, 2018