Skip to content

Commit

Permalink
[Doc]Add asynchronous engine arguments to documentation. (vllm-projec…
Browse files Browse the repository at this point in the history
…t#3810)

Co-authored-by: Simon Mo <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
  • Loading branch information
3 people authored Apr 5, 2024
1 parent 94ec8b0 commit 68c595a
Showing 1 changed file with 16 additions and 0 deletions.
16 changes: 16 additions & 0 deletions docs/source/models/engine_args.rst
Original file line number Diff line number Diff line change
Expand Up @@ -118,3 +118,19 @@ Below, you can find an explanation of every engine argument for vLLM:
.. option:: --quantization (-q) {awq,squeezellm,None}

Method used to quantize the weights.

Async Engine Arguments
----------------------
Below are the additional arguments related to the asynchronous engine:

.. option:: --engine-use-ray

Use Ray to start the LLM engine in a separate process as the server process.

.. option:: --disable-log-requests

Disable logging requests.

.. option:: --max-log-len

Max number of prompt characters or prompt ID numbers being printed in log. Defaults to unlimited.

0 comments on commit 68c595a

Please sign in to comment.