Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generalize resource management #2

Open
w568w opened this issue Sep 30, 2024 · 0 comments
Open

Generalize resource management #2

w568w opened this issue Sep 30, 2024 · 0 comments
Labels
complexity: high Requires fundemental changes or thorough insight on the whole project. enhancement New feature or request

Comments

@w568w
Copy link
Owner

w568w commented Sep 30, 2024

Currently, node resources are simply understood as NVIDIA GPU resources. Any scheduling revolves around one to multiple GPUs. All resources are abstracted as a Device struct that holds information specific to NVIDIA GPUs. It is not possible to specify other resources or submit a task that requires zero graphics card.

Device needs to be further generalized into something like Resource, which should at least include memory resource for each node.

p.s. as for CPU, I believe the implementation is easier than that of memory; moreover, once memory resource management is implemented, it is easy to reuse the logic for CPU, so we can temporarily set it aside for now.

@w568w w568w added enhancement New feature or request complexity: high Requires fundemental changes or thorough insight on the whole project. labels Sep 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
complexity: high Requires fundemental changes or thorough insight on the whole project. enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant