Performance of Code Completion for LARGE files #23

wiredprairie · 2017-11-09T23:01:54Z

Environment data

VS Code version: 1.18.0
Python Extension version: 0.7.0
Python Version: 3.6.0
OS and version: Microsoft Windows [Version 10.0.15063]

Actual behavior

I've written a tool that given a tabular data structure with many thousands of columns creates a new Python class definition (into a .py file). The resulting Python file is used by data scientists to access the columns of the data structure in a programmatic way.

Each column results in generated code in this pattern currently:

@property
def color(self):
    """
    This is the documentation.
    This is the second line of documentation
    """
    return DataColumn(self, 'COLOR')

For generated files with hundreds of columns, the VS Code Python extension works spectacularly. However, for large files, it slows to a crawl. For example, one data structure results in a Python file with 40,000+ lines.

When using code completion in VS Code, it takes 20+ seconds to show the list of properties. Needless to say, that's too slow. And unfortunately, it's not a one time cost as every instantiation of the code completion feature causes the same delay.

I can adjust the output of the my code generation tool, but I don't know what I could do to reduce the parse time for code completion.

I did experiment and removed the documentation from the Python file and that had very little if any noticeable effect on the time taken to show matches.

Expected behavior

Fast code completion suggestion list (under 1 second).

Steps to reproduce:

Super large python file with class with thousands of properties is needed. I've created this public gist with an example.

Logs

Output from Python output panel
Empty.
Output from Console window (Help->Developer Tools menu)
Empty

The text was updated successfully, but these errors were encountered:

brettcannon · 2017-11-15T00:10:30Z

Unfortunately 40,000 lines of Python code is simply a large file to process. We use Jedi for our intellisense and it seems to not be handling a file of that magnitude that well. I'm assuming you still want the intellisense, just faster? Or do you want to just turn it off to avoid the penalty?

Otherwise we have discussed trying to share the intellisense engine between us and PTVS (would probably be a new one, not the one it currently has), but it would require downloading .NET and no concrete plans beyond "it's a possiblity" have been had yet.

wiredprairie · 2017-11-15T15:35:55Z

Jedi is unfortunately slow. I just confirmed that with a simple test.

import jedi
with open('example.py', 'r') as example:
    e=example.read()

script = jedi.Script(e)
completions = script.completions()  # <<< WAIT

This may not be an adequate comparison or usage, but using the code above, it takes less than 5 seconds to build the list of completions.

We'd definitely want Intellisense. The classes wouldn't be usable without Intellisense.

One issue is that it the penalty for extracting the completion data from a Python file appears to be each time Intellisense is requested, rather than being cached.

However, I'm not sure if users would accept a 20 second delay, even once per VS code launch for a file. (And a 5 second delay if that's the fastest it could be due to Jedi performance probably isn't acceptable either more than once a session.)

Ideally, the Jedi results would be cached until they've become stale or invalidated. As the files I'm talking about are generated, and very infrequently changing, a one time Jedi-to-cache file result would be a very effective way of reducing the time taken by further usage of Intellisense. Further, in our case, it would be great if the cache files could be deployed to a cache directory or as side-car files so that we'd Jedi-compile them once per Python file generation and then no user's VS Code instance would need to locally process the file, ever.

If the results were to be cached, especially for larger Python files, I would expect many Python extension users in VS Code would benefit.

wiredprairie · 2017-11-28T19:40:36Z

@brettcannon -- any thoughts about this issue?

brettcannon · 2017-11-28T19:45:24Z

In the new year we're hoping to start looking into our intellisense performance and quality so we can begin to address the issues you and others are running into with it.

MikhailArkhipov · 2017-12-13T17:23:35Z

This may be improved by fix for #152 (no longer immediately loading method docs, just names)

MikhailArkhipov · 2018-01-05T00:37:22Z

@wiredprairie - is it better in 0.9?

wiredprairie · 2018-01-05T14:42:58Z

@MikhailArkhipov - unfortunately, it still takes 20+ seconds.

brettcannon · 2018-04-19T19:48:53Z

@wiredprairie would it be possible to upload a test file for us to benchmark against?

(BTW we are upgrading Jedi to 0.12.0; it probably won't resolve this issue but you never know 😉 ).

wiredprairie · 2018-04-19T21:11:07Z

@brettcannon This is still a good example.

brettcannon · 2018-04-19T23:54:59Z

@wiredprairie Oops, sorry for not noticing the hyperlink in the initial message!

MilkyHearts · 2018-06-15T12:59:41Z

Its been almost a year and IntelliSense is still so slow...
This is really holding me back from using VSCode.

brettcannon · 2018-06-15T19:52:37Z

[ @MilkyHearts I edited your ✉️ ⬆️ to come off as more 😃 and less 😠 ]

brettcannon · 2018-06-15T19:56:19Z

@MilkyHearts @wiredprairie I can't make any promises about stability, accuracy, etc., but if you go into your settings and put "python.jediEnabled": false that will prompt you to restart VS Code and then it will download our experimental language server. It's written in C# and taken from our Python workload for Visual Studio, so it should be much faster.

We are also working on it regularly, so if you end up wanting to try newer bits later before we release a public preview, just go into the extension's directory and delete the Analysis directory; the extension will download the latest bits when you restart VS Code again.

But as I said, this is not even public preview for a reason, so no promises about anything. 😁

MilkyHearts · 2018-06-24T11:11:11Z

@brettcannon OMG THANK YOU! the autocomplete is instant after setting jediEnabled to false im so happy.

…dInputExpanders Themed input expanders

brettcannon · 2019-07-29T20:46:56Z

Closing as this is an upstream issue which we don't have direct control or influence over.

brettcannon added awaiting 1-decision area-intellisense LSP-related functionality: auto-complete, docstrings, navigation, refactoring, etc. feature-request Request for new features or functionality labels Nov 14, 2017

MikhailArkhipov self-assigned this Jan 5, 2018

brettcannon mentioned this issue Jan 5, 2018

Meta issue for intellisense issues #539

Closed

15 tasks

brettcannon added awaiting 2-PR and removed awaiting 1-decision labels Jan 30, 2018

MikhailArkhipov removed their assignment Feb 12, 2018

brettcannon added type-perf and removed bug Issue identified by VS Code Team member as probable bug feature-request Request for new features or functionality labels Apr 20, 2018

brettcannon added needs upstream fix and removed needs PR labels May 7, 2018

rchiodo pushed a commit that referenced this issue Oct 26, 2018

Merge pull request #23 from vscode-python-datascience/dev/ianhu/theme…

cd9a46b

…dInputExpanders Themed input expanders

brettcannon added the feature-request Request for new features or functionality label Jun 3, 2019

brettcannon closed this as completed Jul 29, 2019

ghost removed the needs upstream fix label Jul 29, 2019

lock bot locked as resolved and limited conversation to collaborators Aug 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance of Code Completion for LARGE files #23

Performance of Code Completion for LARGE files #23

wiredprairie commented Nov 9, 2017 •

edited

Loading

brettcannon commented Nov 15, 2017

wiredprairie commented Nov 15, 2017

wiredprairie commented Nov 28, 2017

brettcannon commented Nov 28, 2017

MikhailArkhipov commented Dec 13, 2017

MikhailArkhipov commented Jan 5, 2018

wiredprairie commented Jan 5, 2018

brettcannon commented Apr 19, 2018

wiredprairie commented Apr 19, 2018

brettcannon commented Apr 19, 2018

MilkyHearts commented Jun 15, 2018 •

edited by brettcannon

Loading

brettcannon commented Jun 15, 2018

brettcannon commented Jun 15, 2018

MilkyHearts commented Jun 24, 2018

brettcannon commented Jul 29, 2019

Performance of Code Completion for LARGE files #23

Performance of Code Completion for LARGE files #23

Comments

wiredprairie commented Nov 9, 2017 • edited Loading

Environment data

Actual behavior

Expected behavior

Steps to reproduce:

Logs

brettcannon commented Nov 15, 2017

wiredprairie commented Nov 15, 2017

wiredprairie commented Nov 28, 2017

brettcannon commented Nov 28, 2017

MikhailArkhipov commented Dec 13, 2017

MikhailArkhipov commented Jan 5, 2018

wiredprairie commented Jan 5, 2018

brettcannon commented Apr 19, 2018

wiredprairie commented Apr 19, 2018

brettcannon commented Apr 19, 2018

MilkyHearts commented Jun 15, 2018 • edited by brettcannon Loading

brettcannon commented Jun 15, 2018

brettcannon commented Jun 15, 2018

MilkyHearts commented Jun 24, 2018

brettcannon commented Jul 29, 2019

wiredprairie commented Nov 9, 2017 •

edited

Loading

MilkyHearts commented Jun 15, 2018 •

edited by brettcannon

Loading