Return model selection metadata when using a prompt router #304

nickovs · 2024-12-11T06:54:48Z

Bedrock now supports the use of prompt routers to choose between multiple models based on the input. This can reduce cost and latency for simple questions while still using more powerful models for more complex inputs.

When a prompt router is used, the Bedrock API will include information about the selected model alongside other response metadata, either in response['trace']['promptRouter'] for invocation with the Converse endpoint or as part of the final metadata event in event['metadata']['trace']['promptRouter'] when using the ConverseStream endpoint.

It would be helpful to return this metadata to the caller when accessing routed LLMs through landchain-aws, since the model choice determines the cost of execution of the query and this may be of importance to the caller.

The text was updated successfully, but these errors were encountered:

3coins added bedrock Needs Discussion enhancement New feature or request labels Dec 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return model selection metadata when using a prompt router #304

Return model selection metadata when using a prompt router #304

nickovs commented Dec 11, 2024

Return model selection metadata when using a prompt router #304

Return model selection metadata when using a prompt router #304

Comments

nickovs commented Dec 11, 2024