Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding custom plots to Dask Dashboard #3503

Open
mronda opened this issue Feb 20, 2020 · 6 comments
Open

Adding custom plots to Dask Dashboard #3503

mronda opened this issue Feb 20, 2020 · 6 comments

Comments

@mronda
Copy link

mronda commented Feb 20, 2020

Hi,
I am running multiple tasks that takes a while to return, so I want to be able to graph some custom data in between to monitor their progress. Can you guys point me to the documentation/code so I can see where to make changes to the Bokeh Dashboard? Thanks in advance!
-Max

@quasiben
Copy link
Member

@mronda I don't think we have docs on customizing the dashboard (though that would be welcome). Instead, I would recommend reading through 2 recent pull requests:

Can I ask what additional data you were hoping visualize ?

@mronda
Copy link
Author

mronda commented Feb 20, 2020

Hi @quasiben , thanks for the response. I checked the first pr but still not getting how I should connect to that server and pass, from my script, the data to be received by Bokeh server. What handler would I need in my script to make that communication? Not too familiar with Bokeh, any help would be a huge time saver right now.
Oh and I am trying to plot the moving average of a model for monitoring purposes.
Thanks in advance !
-Max

@jacobtomlinson
Copy link
Member

I wonder if a section of the dashboard showing published datasets would be useful? If a dataset is already persisted and has a type of int, float or str it could previewed in the dashboard too.

That way you could continuously persist the moving average and view it in the dashboard?

Perhaps this is abusing the published datasets feature a little too much though...

@mronda
Copy link
Author

mronda commented Feb 20, 2020

Hi @jacobtomlinson , So I've tried using client.publish_datasets to do some visualization of my data. The problem I encountered, and correct me if I am wrong, is that I needed to constantly unpublish and publish the data as I could not persist it to the client. I've also tried using a dask.distributed Queue, which worked fine but I am looking into a more pipeline approach where I can use the dashboard provided to monitor the running average. Any recommendations? Maybe a better way to use those collections? Thanks!
-Max

@mrocklin
Copy link
Member

mrocklin commented Feb 20, 2020 via email

@quasiben
Copy link
Member

I think this is the PR, #2169, @mrocklin is referring to.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants