Skip to content
This repository has been archived by the owner on Aug 15, 2019. It is now read-only.

vectorize min/max/logsumexp/nan/reduce_sum #145

Merged
merged 2 commits into from
Sep 22, 2017
Merged

vectorize min/max/logsumexp/nan/reduce_sum #145

merged 2 commits into from
Sep 22, 2017

Conversation

dsmilkov
Copy link
Contributor

@dsmilkov dsmilkov commented Sep 22, 2017

2x improvement for min/min and 4x for LogSumExp

Old Max on Linux:
max

Vectozied Max on Linux:
max-vec4

Old LogSumExp on Linux:
logsumexp

Vectorized LogSumExp on Linux:
logsumexp-vec4


This change is Reviewable

@nsthorat
Copy link
Contributor

Reviewed 6 of 6 files at r1.
Review status: all files reviewed at latest revision, all discussions resolved.


Comments from Reviewable

@nsthorat
Copy link
Contributor

:lgtm_strong:


Review status: all files reviewed at latest revision, all discussions resolved.


Comments from Reviewable

@dsmilkov dsmilkov merged commit 50c3805 into master Sep 22, 2017
@dsmilkov dsmilkov deleted the vec4 branch September 22, 2017 22:39
mnottheone pushed a commit to mnottheone/deeplearnjs that referenced this pull request Dec 1, 2018
* vectorize min/max/logsumexp/nan shaders

* vectorize reduce sum
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants