Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Core Feature][flytekit] Ability to track resource utilization and gpu utilization of a python task as a plugin #1509

Open
kumare3 opened this issue Sep 21, 2021 · 6 comments
Labels
enhancement New feature or request stale

Comments

@kumare3
Copy link
Contributor

kumare3 commented Sep 21, 2021

Motivation: Why do you think this is important?
When users are running tasks that use GPU's, CPUs and Memory, it might be useful to allow some meta-outputs that render this information

Goal: What should the final outcome look like, ideally?
There are many libraries like https://github.com/gpuopenanalytics/pynvml etc that can be used to collect stats for various resources and we could either stream or outputs these stats as a rendered PNG at the end of the run. The rendering can be done either in python or on the server and the outputs should be meta outputs.

Describe alternatives you've considered
Use other external libraries like Wandb etc

[Optional] Propose: Link/Inline OR Additional context
Write a new plugin flytekit-resource-monitoring that can load components that can output certain meta outputs.
For this, we will need support for meta-outputs.

@kumare3 kumare3 added enhancement New feature or request untriaged This issues has not yet been looked at by the Maintainers and removed untriaged This issues has not yet been looked at by the Maintainers labels Sep 21, 2021
@kumare3
Copy link
Contributor Author

kumare3 commented Sep 21, 2021

cc @EngHabu / @cosmicBboy / @eapolinario
This will be extremely useful in debugging ML jobs. We could use the same technique later to render loss functions etc for a job

@eapolinario
Copy link
Contributor

Just to be clear, we could separate the work to enable meta-outputs from the Logical types work in order to work on this new flytekit plugin, right, @kumare3 ?

@kumare3
Copy link
Contributor Author

kumare3 commented Sep 21, 2021

Yup, I think we need meta outputs first and then this if at all

@kumare3 kumare3 changed the title [Core Feature][flytekit] Ability to track resource utilization and gpu utilization as a plugin [Core Feature][flytekit] Ability to track resource utilization and gpu utilization of a python task as a plugin Oct 2, 2021
@github-actions
Copy link

Hello 👋, This issue has been inactive for over 9 months. To help maintain a clean and focused backlog, we'll be marking this issue as stale and will close the issue if we detect no activity in the next 7 days. Thank you for your contribution and understanding! 🙏

@github-actions github-actions bot added the stale label Aug 27, 2023
@github-actions
Copy link

github-actions bot commented Sep 4, 2023

Hello 👋, This issue has been inactive for over 9 months and hasn't received any updates since it was marked as stale. We'll be closing this issue for now, but if you believe this issue is still relevant, please feel free to reopen it. Thank you for your contribution and understanding! 🙏

@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 4, 2023
@eapolinario eapolinario reopened this Nov 2, 2023
@github-actions github-actions bot removed the stale label Nov 3, 2023
Copy link

Hello 👋, this issue has been inactive for over 9 months. To help maintain a clean and focused backlog, we'll be marking this issue as stale and will engage on it to decide if it is still applicable.
Thank you for your contribution and understanding! 🙏

@github-actions github-actions bot added the stale label Jul 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request stale
Projects
None yet
Development

No branches or pull requests

2 participants