vllm-project · ywang96 · Apr 5, 2024 · Apr 3, 2024 · Apr 3, 2024 · Apr 4, 2024
diff --git a/docs/source/models/engine_args.rst b/docs/source/models/engine_args.rst
@@ -118,3 +118,19 @@ Below, you can find an explanation of every engine argument for vLLM:
 .. option:: --quantization (-q) {awq,squeezellm,None}
 
     Method used to quantize the weights.
+
+Async Engine Arguments
+----------------------
+Below are the asynchronous engine related arguments:
+
+.. option:: --engine-use-ray
+
+    Use Ray to start the LLM engine in a separate process as the server process.
+
+.. option:: --disable-log-requests
+
+    Disable logging requests.
+
+.. option:: --max-log-len
+
+    Max number of prompt characters or prompt ID numbers being printed in log. Defaults to unlimited.