Numba improvements #144

sglyon · 2015-05-04T13:29:34Z

I'm just opening this as a PR so that it is easy to see what has been changed. We can still push to the branch to make more changes to the PR.

I have no intention of merging this myself.

…here numba is not installed

`numba_installed` flag introduced

This reverts commit 8ea1206.

Following the issue and fix numba/numba#1103 and numba/numba#1104 Test should fail for this commit

…version that works in 0.18.2

Test should fail for this commit

For use with Numba <= 0.18.2

… a[n-1] <= v Docstring and test added

…dom seeds

coveralls · 2015-05-04T13:41:01Z

Coverage decreased (-1.06%) to 89.37% when pulling 89302ec on numba_improvements into 8a19047 on master.

oyamad · 2015-05-04T14:34:01Z

@spencerlyon2 Thanks for issuing the PR!

Summary of changes and issues to discuss:

gth_solve.py: The main part is jitted in a new function called _gth_solve_jit with nopython mode; if not numba_installed, the previous NumPy version is executed. The test does not cover the NumPy version. See GTH_SOLVE: Numba version #103 for details (this in particular).
mc_tools.py: mc_sample_path is autojitted if numba_installed; a NumPy version mc_sample_path_numpy is also implemented, which overwrites mc_sample_path if not numba_installed. The only difference between the two version is that the former uses the custom searchsorted, which is contained in utilities.py, while the latter uses np.searchsorted. See Numba version of mc_sample_path #137 for details.
utilities.py: Contains searchsorted, which is a jitted and simplified version of np.searchsorted; it is an alias of np.searchsorted if not numba_installed. This function will no longer be need and should be discarded once np.searchsorted is supported by Numba.
numba_import_fail_message is in common_messages.py; numba_installed is defined with try from Numba import jit except ... in external.py.
Issues on cartesian are to be discussed in [Update] Cartesian Module #143.
The issue of keeping/preparing a NumPy counterpart for each routine implemented with Numba, one of the main issues in Numpy version to fall back on in lss.py #132. Will we drop support for NumPy version sometime in the future, and if so, at what occasion? (I feel somehow uneasy when I see the duplications...)
These may belong to other PRs:
1. Possible changes in the API of MarkovChain/mc_sample_path: see this and this.
2. Support for sparse matrices in MarkovChain (I have code for mc_sample_path but not yet for gth_solve).
This is not really important, but the "coverage" as displayed on the top page gets decreased as we write more Numba implemented code, since the coverage module inspects the jitted functions to not covered by the tests. See Code coverage of jitted functions - Google Groups.

mmcky · 2015-05-05T03:17:57Z

@oyamad Excellent summary. I think most of these discussions points are captured in other open issues now - so I don't see any reason not to merge this in. Do you?

Point 7 (i). I have opened an issue to track this (#146) based on content in #137
Point 7 (ii). I have opened an issue to track this (#145).

Point 8. Interesting issue - I agree not entirely important at this stage - but good to be aware of. Thanks for pointing this out.

oyamad · 2015-05-05T07:22:30Z

@mmcky thanks.

I still have a concern about the duplication in mc_tools.py. I think mc_sample_path_numpy is not necessary and the following is sufficient:

def mc_sample_path(P, init=0, sample_size=1000):
    """
    See Section: DocStrings below
    """
    n = len(P)

    # CDFs, one for each row of P
    cdfs = np.empty((n, n), order='C')  # see issue #137#issuecomment-96128186
    np.cumsum(P, axis=-1, out=cdfs)

    # Random values, uniformly sampled from [0, 1)
    u = np.random.random(size=sample_size)

    # === set up array to store output === #
    X = np.empty(sample_size, dtype=int)
    if isinstance(init, int):
        X[0] = init
    else:
        cdf0 = np.cumsum(init)
        X[0] = searchsorted(cdf0, u[0])

    # === generate the sample path === #
    for t in range(sample_size-1):
        X[t+1] = searchsorted(cdfs[X[t]], u[t+1])

    return X

if numba_installed:
    mc_sample_path = jit(mc_sample_path)

If not numba_installed, searchsorted works as np.searchsorted (with side='right'), as defined in utilities.py.

mmcky · 2015-05-05T11:57:57Z

@oyamad I see what you are saying now - thanks. I think this is a good idea. Effectively it boils down to moving the numpy vs numba comparison down a level to the now internal searchsorted utility functions. So we should update the test to compare jitted searchsorted with the numpy searchsorted (with side="right") to ensure both give the same result over time. In your implementation above the rest of mc_sample_path is exactly the same logically jitted or not.

The only downside I see is in this implementation we won't be able to check the function mc_sample_path final output between the jit version and the non jit version easily - but that would only effectively be checking for jit errors which are probably less of an issue versus logic errors etc.

@jstac I will leave the final decision with you. But @oyamad suggestion does reduce duplication in this special case.

Performance:
I had a quick look at performance (http://nbviewer.ipython.org/github/mmcky/work-notebooks/blob/master/quantecon/PerformanceCheckMCTools.ipynb). Sorry didn't suppress %timeit text but the graphs in Out[7] and Out[15] suggests that numba function is best for anything greater than a sample size of ~70. localB is what would happen in the case of no numba installed and is the slowest. Interestingly the non-numpy looped version of searchsorted runs faster even in pure python which is localA. I am not sure why this is.

oyamad · 2015-05-05T14:15:45Z

@mmcky Thank you for the performance comparison. It is very interesting.
I suspect that qe.numpy and localB would be faster than localA for matrices of larger size.

mmcky · 2015-05-05T23:44:03Z

@oyamad Yeah I think you are right. Last night I had a quick look in our library for a simple utility for producing a Transition Matrix of size n. I didn't see one - I could produce some performance checks through the matrix dimensions this afternoon.

mmcky · 2015-05-06T08:42:57Z

@oyamad Indeed. I added a really simple test which is just using a random matrix P with each row normalised to 1. localB is much better than localA. (http://nbviewer.ipython.org/github/mmcky/work-notebooks/blob/master/quantecon/PerformanceCheckMCTools.ipynb)

jstac · 2015-05-06T10:50:49Z

@oyamad @mmcky You've done a lot of work. It's looking really good. I like the look of the code and I'm keen to merge this.

I agree with @oyamad 's suggestion about cutting mc_sample_path_numpy. There's no need for it. As long as the home grown version of searchsorted is properly tested --- which it is --- there's no need to keep mc_sample_path_numpy just for the sake of testing.

So I propose that we adopt @oyamad 's suggestion, cut this function, and then merge.

coveralls · 2015-05-06T12:36:17Z

Coverage decreased (-1.05%) to 89.38% when pulling 008231c on numba_improvements into 8a19047 on master.

oyamad · 2015-05-06T12:38:32Z

I deleted mc_sample_path_numpy and test_mc_sample_path_functions. Now no test is there for mc_sample_path.

jstac · 2015-05-07T07:24:15Z

Thanks for all this work. I'll merge now and at the same time open an issue to add a test for mc_sample_path. Great to have this merged!

Numba improvements

sglyon · 2015-05-07T16:06:57Z

oyamad and others added 30 commits November 28, 2014 12:34

GTH_SOLVE: Replaced with its Numba version

d057523

Minor edit

0bbb5f9

Numba version of mc_sample_path: Preliminary version

53a3451

Update LSS with common numba warning message

cb37391

Adding numpy version and adjusting @jit to using jit() for the case w…

02f7664

…here numba is not installed

Merge branch 'numba_improvements' into gth_solve_jit

d7ce11d

Numpy version added back

c51ce3f

`numba_installed` flag introduced

Change the order of some of the dict elements in testset

8ea1206

Revert "Change the order of some of the dict elements in testset"

0ace6ef

This reverts commit 8ea1206.

Add test for matrices with C- and F-contiguous orders

2055102

Following the issue and fix numba/numba#1103 and numba/numba#1104 Test should fail for this commit

Fix: Add order='C' when making a copy A1 of the input A

3665910

Minor edits

4cd7e7a

Updated numba import statements to use the common external module

3127ca2

Removing mc_sample_path() numba 0.17 version in preference for split …

45004f0

…version that works in 0.18.2

Add test for simulate with matrices with C- and F-orders

617ab43

Test should fail for this commit

Minor edit

2eaec7d

Fix: Preallocate cdfs with order='C'

0e24d87

For use with Numba <= 0.18.2

Minor edit

d726b14

search_cdf renamed to searchsorted; work properly for inputs s.t.…

f0bae7f

… a[n-1] <= v Docstring and test added

Adding test for numpy and numba functions ... needs discussion on ran…

17e7100

…dom seeds

Adding Utilities Module and migrated searchsorted

52a67f6

Updates to docstrings for both mc_sample_path functions

4dec740

Removing Tabs in Preference for Spaces

972e258

Bug fix - forgot an underscore

16fec88

Updating test with 1 x seed for each function run

cdf0b09

test_searchsorted moved from test_mc_tools.py to test_utilities.py

6e951e3

Missing import added; minor edits to comply with PEP 8

36e7518

The first searchsorted renamed to _searchsorted; docstring copied

99ad757

PEP 8 compliance

2c70760

Merge branch 'numba_improvements' into gth_solve_jit

8b7f4af

Numba import statement changed to use the external module

89302ec

oyamad added 4 commits May 6, 2015 21:03

mc_sample_path_numpy and test_mc_sample_path_functions deleted

c47e1aa

Minor edits

c1bbd76

mc_sample_path_numpy deleted from imports in test_mc_tools.py

bd8dfb4

mc_sample_path_numpy.__doc__ deleted

008231c

jstac added a commit that referenced this pull request May 7, 2015

Merge pull request #144 from QuantEcon/numba_improvements

4ee517e

Numba improvements

jstac merged commit 4ee517e into master May 7, 2015

jstac deleted the numba_improvements branch May 7, 2015 07:24

oyamad mentioned this pull request Apr 12, 2017

TEST: Add tests for game_theory/lemke_howson.py #302

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Numba improvements #144

Numba improvements #144

sglyon commented May 4, 2015

coveralls commented May 4, 2015

oyamad commented May 4, 2015

mmcky commented May 5, 2015

oyamad commented May 5, 2015

mmcky commented May 5, 2015

oyamad commented May 5, 2015

mmcky commented May 5, 2015

mmcky commented May 6, 2015

jstac commented May 6, 2015

coveralls commented May 6, 2015

oyamad commented May 6, 2015

jstac commented May 7, 2015

sglyon commented May 7, 2015

Numba improvements #144

Numba improvements #144

Conversation

sglyon commented May 4, 2015

coveralls commented May 4, 2015

oyamad commented May 4, 2015

mmcky commented May 5, 2015

oyamad commented May 5, 2015

mmcky commented May 5, 2015

oyamad commented May 5, 2015

mmcky commented May 5, 2015

mmcky commented May 6, 2015

jstac commented May 6, 2015

coveralls commented May 6, 2015

oyamad commented May 6, 2015

jstac commented May 7, 2015

sglyon commented May 7, 2015