Skip to content

Commit

Permalink
Add max_new_tokens to every generate call in genai-guilde
Browse files Browse the repository at this point in the history
  • Loading branch information
pavel-esir committed Jul 24, 2024
1 parent 82dbefd commit ad0b59c
Showing 1 changed file with 6 additions and 6 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -44,7 +44,7 @@ will not work with these instructions, make sure to
import openvino_genai as ov_genai
pipe = ov_genai.LLMPipeline(model_path, "CPU")
print(pipe.generate("The Sun is yellow because"))
print(pipe.generate("The Sun is yellow because", max_new_tokens=100))
.. tab-item:: C++
:sync: cpp
Expand All @@ -57,7 +57,7 @@ will not work with these instructions, make sure to
int main(int argc, char* argv[]) {
std::string model_path = argv[1];
ov::genai::LLMPipeline pipe(model_path, "CPU");
std::cout << pipe.generate("The Sun is yellow because");
std::cout << pipe.generate("The Sun is yellow because", ov::genai::max_new_tokens(100));
}
The `LLMPipeline` is the main object used for decoding. You can construct it directly from the
Expand Down Expand Up @@ -85,7 +85,7 @@ below, where a lambda function outputs words to the console immediately upon gen
pipe = ov_genai.LLMPipeline(model_path, "CPU")
streamer = lambda x: print(x, end='', flush=True)
pipe.generate("The Sun is yellow because", streamer=streamer)
pipe.generate("The Sun is yellow because", streamer=streamer, max_new_tokens=100)
.. tab-item:: C++

Expand All @@ -104,7 +104,7 @@ below, where a lambda function outputs words to the console immediately upon gen
// false means continue generation.
return false;
};
pipe.generate("The Sun is yellow because", ov::genai::streamer(streamer));
pipe.generate("The Sun is yellow because", ov::genai::streamer(streamer), ov::genai::max_new_tokens(100));
}
You can also create your custom streamer for more sophisticated processing:
Expand Down Expand Up @@ -132,7 +132,7 @@ You can also create your custom streamer for more sophisticated processing:
# Decode tokens and process them.
pipe = ov_genai.LLMPipeline(model_path, "CPU")
pipe.generate("The Sun is yellow because", streamer=CustomStreamer())
pipe.generate("The Sun is yellow because", streamer=CustomStreamer(), max_new_tokens=100)
.. tab-item:: C++
Expand Down Expand Up @@ -164,7 +164,7 @@ You can also create your custom streamer for more sophisticated processing:
std::string model_path = argv[1];
ov::genai::LLMPipeline pipe(model_path, "CPU");
pipe.generate("The Sun is yellow because", ov::genai::streamer(custom_streamer));
pipe.generate("The Sun is yellow because", ov::genai::streamer(custom_streamer), ov::genai::max_new_tokens(100));
}
Using GenAI in Chat Scenario
Expand Down

0 comments on commit ad0b59c

Please sign in to comment.