Skip to content

[Continious Batching] Speculative decoding based on paged attention #1163

[Continious Batching] Speculative decoding based on paged attention

[Continious Batching] Speculative decoding based on paged attention #1163

Triggered via pull request July 31, 2024 12:53
Status Cancelled
Total duration 5m 22s
Artifacts

genai_package.yml

on: pull_request
Matrix: macos_genai_package
Matrix: ubuntu_genai_package
Matrix: windows_genai_package
Fit to window
Zoom out
Zoom in

Annotations

12 errors
windows_genai_package (Debug)
Canceling since a higher priority waiting request for 'genai_package-speculative_decoding' exists
windows_genai_package (Debug)
The operation was canceled.
ubuntu_genai_package (Release)
Canceling since a higher priority waiting request for 'genai_package-speculative_decoding' exists
ubuntu_genai_package (Release)
The operation was canceled.
ubuntu_genai_package (Debug)
Canceling since a higher priority waiting request for 'genai_package-speculative_decoding' exists
ubuntu_genai_package (Debug)
The operation was canceled.
windows_genai_package (Release)
Canceling since a higher priority waiting request for 'genai_package-speculative_decoding' exists
windows_genai_package (Release)
The operation was canceled.
macos_genai_package (Release)
Canceling since a higher priority waiting request for 'genai_package-speculative_decoding' exists
macos_genai_package (Release)
The operation was canceled.
macos_genai_package (Debug)
Canceling since a higher priority waiting request for 'genai_package-speculative_decoding' exists
macos_genai_package (Debug)
The operation was canceled.