feat: add bedrock client #1

takipipo · 2024-04-24T10:31:18Z

Feature

Add bedrock client

takipipo · 2024-04-24T10:36:13Z

src/llmperf/ray_clients/bedrock_client.py

+        body = {
+            "prompt": prompt,
+            "temperature": 0.5,
+            "top_p": 0.9,
+            "max_gen_len": 512,


Still not sure please confirm

ถ้าในกรณีที่ไม่ได้ add request_config.sampling_params ตอนรัน command
มันจะถูก set ว่า request_config.sampling_params = {"max_tokens": num_output_tokens}
ref:

llmperf/token_benchmark_ray.py

Lines 67 to 69 in 7512357

if not additional_sampling_params:

additional_sampling_params = {}

llmperf/token_benchmark_ray.py

Lines 92 to 93 in 7512357

default_sampling_params = {"max_tokens": num_output_tokens}

default_sampling_params.update(additional_sampling_params)

llmperf/token_benchmark_ray.py

Lines 147 to 159 in 7512357

metadata = {

"model": model,

"mean_input_tokens": mean_input_tokens,

"stddev_input_tokens": stddev_input_tokens,

"mean_output_tokens": mean_output_tokens,

"stddev_output_tokens": stddev_output_tokens,

"num_concurrent_requests": num_concurrent_requests,

"additional_sampling_params": additional_sampling_params,

}

metadata["results"] = ret

return metadata, completed_requests

soln:
แก้ "max_token" ให้เป็น parameter ที่ใช้บอก "max_token" ในแต่ละ client เช่น sagemaker ใช้ "max_new_tokens"

llmperf/src/llmperf/ray_clients/sagemaker_client.py

Lines 45 to 47 in 7512357

if "max_tokens" in sampling_params:

sampling_params["max_new_tokens"] = sampling_params["max_tokens"]

del sampling_params["max_tokens"]

takipipo added 2 commits April 24, 2024 17:28

feat: add bedrock client

0dcb822

feat: add bedrock choice in common.py

7512357

takipipo commented Apr 24, 2024

View reviewed changes

feat(bedrock): utilize max token len from request config

251a0b4

mhokchuekchuek approved these changes Apr 25, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add bedrock client #1

feat: add bedrock client #1

takipipo commented Apr 24, 2024 •

edited

Loading

takipipo Apr 24, 2024

mhokchuekchuek Apr 24, 2024

	if not additional_sampling_params:
	additional_sampling_params = {}

	default_sampling_params = {"max_tokens": num_output_tokens}
	default_sampling_params.update(additional_sampling_params)

	metadata = {
	"model": model,
	"mean_input_tokens": mean_input_tokens,
	"stddev_input_tokens": stddev_input_tokens,
	"mean_output_tokens": mean_output_tokens,
	"stddev_output_tokens": stddev_output_tokens,
	"num_concurrent_requests": num_concurrent_requests,
	"additional_sampling_params": additional_sampling_params,
	}

	metadata["results"] = ret

	return metadata, completed_requests

	if "max_tokens" in sampling_params:
	sampling_params["max_new_tokens"] = sampling_params["max_tokens"]
	del sampling_params["max_tokens"]

feat: add bedrock client #1

Are you sure you want to change the base?

feat: add bedrock client #1

Conversation

takipipo commented Apr 24, 2024 • edited Loading

Feature

takipipo Apr 24, 2024

Choose a reason for hiding this comment

mhokchuekchuek Apr 24, 2024

Choose a reason for hiding this comment

takipipo commented Apr 24, 2024 •

edited

Loading