add Completions API support #88

mattf · 2024-08-16T11:05:17Z

        Create a new NVIDIA LLM for Completions APIs.

        This class provides access to a NVIDIA NIM for completions. By default, it
        connects to a hosted NIM, but can be configured to connect to a local NIM
        using the `base_url` parameter. An API key is required to connect to the
        hosted NIM.

        Args:
            model (str): The model to use for reranking.
            nvidia_api_key (str): The API key to use for connecting to the hosted NIM.
            api_key (str): Alternative to nvidia_api_key.
            base_url (str): The base URL of the NIM to connect to.

        API Key:
        - The recommended way to provide the API key is through the `NVIDIA_API_KEY`
            environment variable.

        Additional arguments that can be passed to the Completions API:
        - max_tokens (int): The maximum number of tokens to generate.
        - stop (str or List[str]): The stop sequence to use for generating completions.
        - temperature (float): The temperature to use for generating completions.
        - top_p (float): The top-p value to use for generating completions.
        - frequency_penalty (float): The frequency penalty to apply to the completion.
        - presence_penalty (float): The presence penalty to apply to the completion.
        - seed (int): The seed to use for generating completions.
        - best_of (int): The number of completions to generate and return the best of.
        - echo (bool): Whether to echo the prompt in the completion.
        - logit_bias (Dict[str, float]): The logit bias to apply to the completion.
        - logprobs (int): The number of logprobs to return.
        - n (int): The number of completions to generate.
        - suffix (str): The suffix to use for generating completions.
        - user (str): The user ID to use for generating completions.

        These additional arguments can also be passed with `bind()`, e.g.
        `NVIDIA().bind(max_tokens=512)`, or pass directly to `invoke()` or `stream()`,
        e.g. `NVIDIA().invoke("prompt", max_tokens=512)`.

dglogo · 2024-08-19T00:35:18Z

libs/ai-endpoints/docs/llms/nvidia_ai_endpoints.ipynb

libs/ai-endpoints/langchain_nvidia_ai_endpoints/llm.py

dglogo

Thanks for putting this together!

libs/ai-endpoints/langchain_nvidia_ai_endpoints/llm.py

libs/ai-endpoints/tests/unit_tests/test_completions_models.py

libs/ai-endpoints/langchain_nvidia_ai_endpoints/llm.py

libs/ai-endpoints/docs/llms/nvidia_ai_endpoints.ipynb

mattf requested review from dglogo and raspawar August 16, 2024 11:05

mattf self-assigned this Aug 16, 2024

add Completions API support

fdae5ab

mattf force-pushed the mattf/add-completions-support branch from d9096cf to fdae5ab Compare August 16, 2024 11:12

dglogo reviewed Aug 19, 2024

View reviewed changes

libs/ai-endpoints/docs/llms/nvidia_ai_endpoints.ipynb

Copy link

Collaborator

dglogo Aug 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

dglogo reviewed Aug 19, 2024

View reviewed changes

libs/ai-endpoints/langchain_nvidia_ai_endpoints/llm.py Outdated Show resolved Hide resolved

dglogo reviewed Aug 19, 2024

View reviewed changes

libs/ai-endpoints/langchain_nvidia_ai_endpoints/llm.py Outdated Show resolved Hide resolved

fix spelling of completions and NVIDIA

e877733

mattf requested a review from dglogo August 19, 2024 16:53

trim param docs to include only known functional params

386a42b

dglogo approved these changes Aug 20, 2024

View reviewed changes

raspawar reviewed Aug 27, 2024

View reviewed changes

libs/ai-endpoints/langchain_nvidia_ai_endpoints/llm.py Show resolved Hide resolved

raspawar reviewed Aug 27, 2024

View reviewed changes

libs/ai-endpoints/tests/unit_tests/test_completions_models.py Show resolved Hide resolved

raspawar reviewed Aug 27, 2024

View reviewed changes

libs/ai-endpoints/langchain_nvidia_ai_endpoints/llm.py Show resolved Hide resolved

raspawar reviewed Aug 27, 2024

View reviewed changes

libs/ai-endpoints/docs/llms/nvidia_ai_endpoints.ipynb Show resolved Hide resolved

mattf added 3 commits August 27, 2024 07:21

fix ChatNVIDA -> ChatNVIDIA

6ff6bb7

add Completions example to README

e3b290e

set default model to nvidia/mistral-nemo-minitron-8b-base

ac5d18a

mattf force-pushed the mattf/add-completions-support branch from 0b9f521 to ac5d18a Compare August 27, 2024 21:09

dglogo and others added 3 commits August 27, 2024 19:45

updated llm nb

4c351cd

add _identifying_params

a785670

add ainvoke / astream basic tests

a42e389

mattf requested a review from raspawar August 28, 2024 10:56

fix lint

f21f394

mattf merged commit 9f9b762 into main Aug 28, 2024
12 checks passed

mattf deleted the mattf/add-completions-support branch August 28, 2024 16:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add Completions API support #88

add Completions API support #88

mattf commented Aug 16, 2024

dglogo Aug 19, 2024

dglogo left a comment

add Completions API support #88

add Completions API support #88

Conversation

mattf commented Aug 16, 2024

dglogo Aug 19, 2024

Choose a reason for hiding this comment

dglogo left a comment

Choose a reason for hiding this comment