Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add env param KV_CACHE_LOCATION to control kv cache memory numanode location #462

Open
wants to merge 4 commits into
base: main
Choose a base branch
from

Conversation

a3213105
Copy link
Contributor

@a3213105 a3213105 commented Jun 28, 2024

Usage:
before you run instance
export KV_CACHE_LOCATION=#memory_numa_node_id_you_want_to_use_for_kv_cache

by defaults, kv_cache location is the same as other parts of instance.

…ocation

Usage:
before you run instance
export KV_CACHE_LOCATION=#memory_numa_node_id_you_want_to_use_for_kv_cache

by defaults, kv_cache location is the same as other parts of instance.
@pujiang2018
Copy link
Contributor

@a3213105 we need to put the environment reading to a unified place, please put it to environment.h

@a3213105
Copy link
Contributor Author

a3213105 commented Jul 2, 2024

@a3213105 we need to put the environment reading to a unified place, please put it to environment.h

DONE

@@ -71,6 +71,9 @@ class Env {
// get Primitive Cache M
int getPrimitiveCacheM() { return primitiveCacheM; }

// get KV Cache Location
int getKVCacheLocation() { return primitiveCacheM; }
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should be kvCacheLocation? and initKVCacheLocation is never called?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

src/common/kvcache_mgr.h Outdated Show resolved Hide resolved
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants