Back to pull request #4538

CUDA: Faster Mixtral prompt processing #6962

Sign in to view logs

Triggered via pull request December 20, 2023 13:53

JohannesGaessler

synchronize #4538

JohannesGaessler:cuda-mixtral-pp-2

Status Success

Total duration 46m 12s

Artifacts –

build.yml

on: pull_request

Matrix: windows-latest-cmake-cublas

Matrix: windows-latest-cmake

ubuntu-focal-make

ubuntu-latest-cmake

macOS-latest-make

macOS-latest-cmake

macOS-latest-cmake-ios

macOS-latest-cmake-tvos

ios-xcode-build

Matrix: macOS-latest-swift

Matrix: ubuntu-latest-cmake-mpi

Matrix: ubuntu-latest-cmake-sanitizer

Annotations

1 error

windows-latest-cmake (avx512, -DLLAMA_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DLLAMA_AVX512=ON -DBUIL...

Process completed with exit code 1.