improve performance of _determineActualTypes #78

davidchambers · 2016-06-25T00:01:07Z

Commit message:

The current algorithm is inefficient for unary and binary types. Given an array containing 1000 elements, for example, we first determine the types of each element, resulting in an array of 1000 arrays of types. We then find the intersection of these "sets". This commit introduces a different approach: for each element, we refine the set of types of which all previous elements are members rather than filtering the whole environment each time. This is significantly more efficient for large arrays.

I ran the following command to observe the performance improvement:

$ time node --eval 'const R = require("ramda"), $ = require("."), a = $.TypeVariable("a"), def = $.create({checkTypes: true, env: $.env}), id = def("id", {}, [a, a], x => x); id(R.range(0, 100 * 1000))'

On my computer this takes about 1.5 seconds on dc-perf compared with about 5 seconds on master.

100,000 elements was about as large as I could make the array without blowing the stack on master.

davidchambers · 2016-07-04T05:03:16Z

test/index.js

+                   '\n' +
+                   '2)  Left(/XXX/) :: Either RegExp ???\n' +
+                   '\n' +
+                   'Since there is no type of which all the above values are members, the type-variable constraint has been violated.\n'));


This is incorrect. We should underline the first and third arguments in this case.

This is fixed in #79. Rather than spend time backporting the fix to this branch, I updated the expected output then commented out the failing assertion.

The current algorithm is inefficient for unary and binary types. Given an array containing 1000 elements, for example, we first determine the types of each element, resulting in an array of 1000 arrays of types. We then find the intersection of these "sets". This commit introduces a different approach: for each element, we refine the set of types of which all previous elements are members rather than filtering the whole environment each time. This is significantly more efficient for large arrays.

davidchambers mentioned this pull request Jun 26, 2016

define $.Function and $.UnaryTypeVariable #79

Merged

davidchambers reviewed Jul 4, 2016
View reviewed changes

davidchambers force-pushed the dc-perf branch from c3fa873 to 2f94dbc Compare July 4, 2016 05:21

davidchambers merged commit 0b4db18 into master Jul 4, 2016

davidchambers mentioned this pull request Feb 23, 2017

Remove one of the env arguments from _determineActualTypes #122

Merged

davidchambers mentioned this pull request Dec 10, 2017

prevent unnecessary recursive applications of _determineActualTypes #177

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve performance of _determineActualTypes #78

improve performance of _determineActualTypes #78

davidchambers commented Jun 25, 2016

davidchambers Jul 4, 2016

davidchambers Jul 4, 2016

improve performance of _determineActualTypes #78

improve performance of _determineActualTypes #78

Conversation

davidchambers commented Jun 25, 2016

davidchambers Jul 4, 2016

Choose a reason for hiding this comment

davidchambers Jul 4, 2016

Choose a reason for hiding this comment