CUDA: Faster Mixtral prompt processing #6962
Triggered via pull request
December 20, 2023 13:53
Status
Success
Total duration
46m 12s
Artifacts
–
build.yml
on: pull_request
Matrix: windows-latest-cmake-cublas
Matrix: windows-latest-cmake
ubuntu-focal-make
1m 34s
ubuntu-latest-cmake
1m 48s
macOS-latest-make
3m 22s
macOS-latest-cmake
5m 45s
macOS-latest-cmake-ios
1m 38s
macOS-latest-cmake-tvos
1m 15s
ios-xcode-build
2m 8s
Matrix: macOS-latest-swift
Matrix: ubuntu-latest-cmake-mpi
Matrix: ubuntu-latest-cmake-sanitizer
release
0s
Annotations
1 error
windows-latest-cmake (avx512, -DLLAMA_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DLLAMA_AVX512=ON -DBUIL...
Process completed with exit code 1.
|