feat: profiling in EDA tool #954

SmiteDeluxe · 2024-03-30T14:39:40Z

Closes #929

Summary of Changes

Communication from EDA -> Runner in own file that allows appending of code to the executed pipeline to get and generate new info.
This results in full profiling being done already, fetched when needed and displayed in webview.
Also includes start of from webview triggered actions that append code, in the form of filters. But this is to be picked up in new issue.

Other changes include refactorings and fixes for visual and edge case bugs.

Important to run this:

Some environments might need a change in the runner for images (plots) to work, that will be pushed to the Runner repo in PR Safe-DS/Runner#63.

To PipelineManager:

from safeds.data.image.containers import Image
import torch

To beginning of save_placeholder method:

if isinstance(value, Image):
    value = Image(value._image_tensor, torch.device("cpu"))

- runnerApi that now handles any eda vscode requests to runner, for starter getPlaceholderValue - edaPanel now renamed panelsMap to instancesMap that also saves RunnerApi instance - RunnerApi instance takes: services, pipelinePath (which is needed to get the document of pipeline that is to be extended) - along with above no longer option for PanelIdentifier or pipelineId to be undefined, not needed anymore without dev methods that start blank eda sessions and this way more safe and solid

- revaling already exisiting panel bug where undefined state, found out it's because the _update() method is only fully executed after current state was already found - new variable updateHtmlDone that is set to false on either creating new panel or revealing current one and to true on _update() done; constructCurrentState will only be executed once that variable is true or with a timeout of 10s

…o-pipelines

- filters: - for the filter to decide between: 1. search string 2. value range 3. distinct value, the tableView must decide for each column if numerical => value range or categorical => many values => search string OR little Values => distinct value - for that profiling for numerical items (that show % for example) now have new property "interpretation" which can be "warn" (for missing val), "category" (for this) or "default" - faster than iterating through categorical cols to count values for huge data, since we have this info on profiling generation already - whereas for value range find min max ourselves, since otherwise would need more in pipeline and more placeholder value queries, and this should be faster filters then in own component that decides what to show and calls vscode to initiate the runner code execution - as deliberated before kind of: selections are now not cleared by clicks anywhere anymore but only clicks on main cells, as the global window listener to clear was getting too convoluted, now don't need the rowClicked or columnClicked params anymore - preventClicks store is now set to false on context menu close with 100ms delay to allow time to prevent clicks - handleRightClickEnd decides per menu if to close and set those cleanup things or not, like for filter it doesn't if clicked anywhere in context menu by looping over html elements and their parents

…o-pipelines

- pipelineId renamed to piprlineExecutionId for more sense multiple pipelines: - eda from context now gets exact ast node of placeholder with range of the executed context and from there the pipeline container => pipeline name - pipeline name is passed to execute pipeline which is now needed - also sent to eda where 1. it is used for tableIdentifier as pipelineName + '.' + tableName (tableName new param used and passed outside of tableIdentifier) and 2. it is passed to runner for pipeline execution - in Runner the pipeline in question is found and in front of it's closing } the new code is then added - getStateByPlaceholder now getTableByPlaceholder and transform to state obj in eda class, whee table name and table identifier are known, this method now only relevant stuff - createOrShow async as well as calling register command methods - runnerApi now instance var of panel

- got rid of an excess profiling banner div: - Now apparently now methods for table resize/TableSpace needed anymore, all handled by html itself - Meaning I also cannot change much about that mechanism, other than the min width, set by setting width on startup of the elements - Also means a lot less code and complexity! - savedColumnWidths now is a svelte store, that the table subscribes to, meaning it updates correctly, only needed on resize and reorder (when letting the col go); also now no manual setting of stlyle for this anymore - the automatic handling by html meant that reordering did not properly take out of table, so now a "reorderPrototype" of a column header that is used for the under cursor display and updates with relevant data, while column is made "display: none" in table - Min width maybe as initial width if it is being streched, then increasing size of a col will not result in others shrinking, but how to decide if in full view or not? - Also full view makes scrolling for fixed stuff lag?? Mabye visible scroll bar or extra tiny div - Fixed that full view makes fixed stuff lag by making table width 100.1% instead of 100%, so always tiny bit out of view that causes scroll to exist - increased scroll buffer a bit to make more fluent - now you cannot see the table text through the borders of the headers/profiling anymore if table scrolled - have an absolute div at 100% with at top that is bright bg color normal height = 2 * rowHeight - if profiling expanded then delayed (since height animation of profilingInfo) setting of height to 2 * rowHeight + profilingInfo height, not complete height as not including for example borders but enough to cover all bg space that let's text through - top prop of this also = scrollTop

- changed profiling to always include value, not name, as we display values - thus image string also as "value" as well as string when just using "text" type (prev. "name" type)

- profiling placeholder name gen now not random but with codegen prefix + incr counter number

…o-pipelines

SmiteDeluxe · 2024-04-03T13:10:19Z

@lars-reimann @WinPlay02 Now the Table hashing appears to be working but the getColumn() command fails because of another reason:

My code is still:

package b

from safeds.data.tabular.containers import Table

pipeline mainpipeline {
    val Tithhghgani = Table.fromCsvFile("/home/jonas/titanic.csv");
    val test = Tithhghgani.getColumn("Sex");
    val test2 = test.missingValueRatio();
    val Cereaddfkf5fl = Table.fromCsvFile("/home/jonas/cereal.csv");
}

pipeline secondary {
    val Titadssfsffjgfg3d832 = Table.fromCsvFile("/home/jonas/netflix_titles.csv");
    val Cereaf2gfgfff8dl = Table.fromCsvFile("/home/jonas/tweets.csv");
}

And my steps were to merge main into this branch, run npm i and npm run langium:generate and then pull Runner and run poetry install

WinPlay02 · 2024-04-03T13:20:24Z

@SmiteDeluxe Could you attach the full log here? The messages before the error would be helpful to find the issue

SmiteDeluxe · 2024-04-03T13:26:40Z

@WinPlay02 Here you go

WinPlay02 · 2024-04-03T13:51:11Z

@WinPlay02 Here you go

@SmiteDeluxe I see, thanks. I'm looking into a fix

packages/safe-ds-vscode/src/extension/mainClient.ts

packages/safe-ds-vscode/src/extension/eda/apis/runnerApi.ts

packages/safe-ds-eda/src/components/TableView.svelte

- fix: generating memoized class member calls (on non-static members) should still use the class for python code generation - added testcase ------------------ Fixes the (last) issue in #954 --------- Co-authored-by: megalinter-bot <[email protected]>

- addToAndExecutePipeline now also rejects on runtime error, thus don't have to wait for timeout - RunnerAPI now gets passed pipelineNode from which then the end of pipeline is found, as previous approach could lead to bugs if not exactly matching pattern - some refactoring

SmiteDeluxe · 2024-04-04T11:12:10Z

@WinPlay02 Now the getColumn() call on it's own works, just like missingValueRatio(), the problem is that a chained call of .getColumn("Sex").missingValueRatio() does not work and throws the error below:

code:

package b

from safeds.data.tabular.containers import Table
from safeds.data.tabular.containers import Column

pipeline mainpipeline {
    val Titanghijjihfgk = Table.fromCsvFile("/home/jonas/titanic.csv");
    val test = Titanghijjihfgk.getColumn("Sex").missingValueRatio();
    val Cereaddfkf5fl = Table.fromCsvFile("/home/jonas/cereal.csv");
}

pipeline secondary {
    val Titadssfsffjgfg3d832 = Table.fromCsvFile("/home/jonas/netflix_titles.csv");
    val Cereaf2gfgfff8dl = Table.fromCsvFile("/home/jonas/tweets.csv");
}

This worked before with my own fake stubs.

WinPlay02 · 2024-04-04T11:56:46Z

@WinPlay02 Now the getColumn() call on it's own works, just like missingValueRatio(), the problem is that a chained call of .getColumn("Sex").missingValueRatio() does not work and throws the error below:

code:
package b

from safeds.data.tabular.containers import Table
from safeds.data.tabular.containers import Column

pipeline mainpipeline {
    val Titanghijjihfgk = Table.fromCsvFile("/home/jonas/titanic.csv");
    val test = Titanghijjihfgk.getColumn("Sex").missingValueRatio();
    val Cereaddfkf5fl = Table.fromCsvFile("/home/jonas/cereal.csv");
}

pipeline secondary {
    val Titadssfsffjgfg3d832 = Table.fromCsvFile("/home/jonas/netflix_titles.csv");
    val Cereaf2gfgfff8dl = Table.fromCsvFile("/home/jonas/tweets.csv");
}
This worked before with my own fake stubs.

@SmiteDeluxe A fix is prepared in #987

…o-pipelines

SmiteDeluxe · 2024-04-05T09:44:24Z

@WinPlay02 @lars-reimann profiling is working again now!

@lars-reimann Can you whenever you have time look over the open conversations again and see if there is anything else needed before we can merge?

lars-reimann

Congratulations on finishing this. It looks great and works well.

## [0.10.0](v0.9.0...v0.10.0) (2024-04-06) ### Features * add settings to enable inlay hints individually ([#992](#992)) ([b0f3e62](b0f3e62)) * filter suggestions by node type ([#999](#999)) ([8d22e67](8d22e67)), closes [#998](#998) * forbid instance and static class members with same name ([#988](#988)) ([7fa6fd4](7fa6fd4)) * improved completion provider ([#997](#997)) ([61e776b](61e776b)), closes [#41](#41) * inlay hints for inferred types of lambda parameters ([#993](#993)) ([c064e0e](c064e0e)) * mark entire type cast as wrong if cast is impossible ([#991](#991)) ([72d4e2e](72d4e2e)) * profiling in EDA tool ([#954](#954)) ([854122c](854122c)), closes [#929](#929) * require `safe-ds-runner>=0.8.0,<0.9.0` ([#976](#976)) ([1003e6c](1003e6c)) * resolve name paths in `{[@link](https://github.com/link) }` tags in documentation ([#978](#978)) ([b59d6f0](b59d6f0)) ### Bug Fixes * catch internal errors caused by wrong synthetic nodes created by completion provider ([#1001](#1001)) ([8a6ab99](8a6ab99)) * chained memoized calls ([#987](#987)) ([df89291](df89291)) * correctly import declarations for member functions ([#983](#983)) ([79f9b08](79f9b08)) * error in Python generator for assignments with class/enum variant call as RHS ([#977](#977)) ([46b2bb2](46b2bb2)), closes [#975](#975) * generation of memoized class member calls ([#982](#982)) ([ed06aef](ed06aef)) * generation of Python imports ([#979](#979)) ([f69d836](f69d836)), closes [#974](#974) * invalid Python code generated for constructor calls ([#981](#981)) ([c7d006f](c7d006f)), closes [#980](#980) * Python generation for type casts ([#1000](#1000)) ([621ab86](621ab86))

lars-reimann · 2024-04-06T18:46:16Z

🎉 This PR is included in version 0.10.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

SmiteDeluxe added 30 commits February 29, 2024 20:52

fix: waitForUpdateHtmlDone now not part of constructCurrentState anymore

bc1ac24

fix: small things & feat: first runnerApi methods that append pipeline

5afb4df

fix: properly show "0" values in table

00a491d

feat: get and set basic profiling for new table

4709cc3

Merge remote-tracking branch 'origin/main' into 929-eda-append-code-t…

7a0481b

…o-pipelines

fix: col/row deselection if click row/col again and only one

926d79c

fix: profiling minimal better styling

0b368f7

feat: optionally pass table to getProfiling for performance

f220bf5

feat: more profiling info & profiling styling

e953262

fix: ProfilingInfo own comp & better profiling height calc

cebbda6

fix: profiling fetch also if state existing but no profiling info

2789859

fix: icon colors & profiling height bug & warn icon if missing vals

2cb1da6

feat: filter and sort icons in col headers

6ee653e

Merge remote-tracking branch 'origin/main' into 929-eda-append-code-t…

0d476f8

…o-pipelines

feat: CODEGEN_PREFIX in front of generated placeholders

8350e3c

fix: rename current warn to error & no color in state, decided in svelte

5a532c9

fix: updateTableSpace column names trimmed & minor things

18a24e0

fix: webview reload and profiling semantic

95734a4

fix: change state profiling semantics

8126561

- changed profiling to always include value, not name, as we display values - thus image string also as "value" as well as string when just using "text" type (prev. "name" type)

fix: placeholder generation now incr number

608cce2

- profiling placeholder name gen now not random but with codegen prefix + incr counter number

feat: profiling histograms

a4fd31c

Merge remote-tracking branch 'origin/main' into 929-eda-append-code-t…

d27a4de

…o-pipelines

refactor: some comments and code, more readable

7306942

fix: image in top of profiling

6e3ab4f

Merge remote-tracking branch 'origin/main' into 929-eda-append-code-t…

5e951a6

…o-pipelines

SmiteDeluxe added 2 commits April 3, 2024 14:55

Merge remote-tracking branch 'origin/main' into 929-eda-append-code-t…

131aab7

…o-pipelines

Merge remote-tracking branch 'origin/main' into 929-eda-append-code-t…

5784f35

…o-pipelines

Merge branch 'main' into 929-eda-append-code-to-pipelines

cf64b65

WinPlay02 mentioned this pull request Apr 3, 2024

fix: generation of memoized class member calls #982

Merged

lars-reimann reviewed Apr 3, 2024

View reviewed changes

packages/safe-ds-eda/src/components/TableView.svelte Outdated Show resolved Hide resolved

WinPlay02 and others added 4 commits April 3, 2024 22:17

Merge branch 'main' into 929-eda-append-code-to-pipelines

7f5b64b

style: apply automated linter fixes

43ff883

Merge branch 'main' into 929-eda-append-code-to-pipelines

1d98097

SmiteDeluxe added 2 commits April 5, 2024 10:57

Merge remote-tracking branch 'origin/main' into 929-eda-append-code-t…

98c113d

…o-pipelines

Merge remote-tracking branch 'origin/main' into 929-eda-append-code-t…

12e8fd6

…o-pipelines

Merge branch 'main' into 929-eda-append-code-to-pipelines

2b8cbf5

lars-reimann changed the title ~~feat: add code to pipelines (#929) & in the end full profiling and refactors/fixes~~ feat: profiling in EDA tool Apr 6, 2024

Merge branch 'main' into 929-eda-append-code-to-pipelines

697c66f

lars-reimann approved these changes Apr 6, 2024

View reviewed changes

lars-reimann merged commit 854122c into main Apr 6, 2024
7 checks passed

lars-reimann deleted the 929-eda-append-code-to-pipelines branch April 6, 2024 17:53

lars-reimann added the released Included in a release label Apr 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: profiling in EDA tool #954

feat: profiling in EDA tool #954

SmiteDeluxe commented Mar 30, 2024 •

edited by WinPlay02

Loading

SmiteDeluxe commented Apr 3, 2024

WinPlay02 commented Apr 3, 2024

SmiteDeluxe commented Apr 3, 2024

WinPlay02 commented Apr 3, 2024

SmiteDeluxe commented Apr 4, 2024

WinPlay02 commented Apr 4, 2024

SmiteDeluxe commented Apr 5, 2024 •

edited

Loading

lars-reimann left a comment

lars-reimann commented Apr 6, 2024

feat: profiling in EDA tool #954

feat: profiling in EDA tool #954

Conversation

SmiteDeluxe commented Mar 30, 2024 • edited by WinPlay02 Loading

Summary of Changes

Important to run this:

SmiteDeluxe commented Apr 3, 2024

WinPlay02 commented Apr 3, 2024

SmiteDeluxe commented Apr 3, 2024

WinPlay02 commented Apr 3, 2024

SmiteDeluxe commented Apr 4, 2024

WinPlay02 commented Apr 4, 2024

SmiteDeluxe commented Apr 5, 2024 • edited Loading

lars-reimann left a comment

Choose a reason for hiding this comment

lars-reimann commented Apr 6, 2024

SmiteDeluxe commented Mar 30, 2024 •

edited by WinPlay02

Loading

SmiteDeluxe commented Apr 5, 2024 •

edited

Loading