Thank you @fabse
Search Criteria
Package Details: llama.cpp-cuda b9305-1
Package Actions
| Git Clone URL: | https://aur.archlinux.org/llama.cpp-cuda.git (read-only, click to copy) |
|---|---|
| Package Base: | llama.cpp-cuda |
| Description: | Port of Facebook's LLaMA model in C/C++ (with NVIDIA CUDA optimizations) |
| Upstream URL: | https://github.com/ggml-org/llama.cpp |
| Licenses: | MIT |
| Conflicts: | ggml, libggml, llama.cpp |
| Provides: | llama.cpp |
| Submitter: | txtsd |
| Maintainer: | fabse |
| Last Packager: | fabse |
| Votes: | 17 |
| Popularity: | 1.77 |
| First Submitted: | 2024-10-26 20:17 (UTC) |
| Last Updated: | 2026-05-24 12:33 (UTC) |
Dependencies (18)
- cuda (cuda11.1AUR, cuda-12.2AUR, cuda12.0AUR, cuda11.4AUR, cuda-12.5AUR, cuda-12.9AUR, cuda-12.8AUR, cuda-pascalAUR)
- curl (curl-gitAUR, curl-c-aresAUR)
- gcc-libs (gcc-libs-gitAUR, gccrs-libs-gitAUR, gcc-libs-snapshotAUR)
- glibc (glibc-gitAUR, glibc-eacAUR, glibc-git-native-pgoAUR)
- nvidia-utils (nvidia-410xx-utilsAUR, nvidia-440xx-utilsAUR, nvidia-430xx-utilsAUR, nvidia-340xx-utilsAUR, nvidia-510xx-utilsAUR, nvidia-utils-teslaAUR, nvidia-575xx-utilsAUR, nvidia-340xx-utils-macbookAUR, nvidia-535xx-utilsAUR, nvidia-utils-betaAUR, nvidia-470xx-utilsAUR, nvidia-390xx-utilsAUR, nvidia-550xx-utilsAUR, nvidia-580xx-utilsAUR, nvidia-vulkan-utilsAUR, nvidia-525xx-utilsAUR)
- python
- cmake (cmake3AUR, cmake-gitAUR) (make)
- cudnn (cudnn9.10-cuda12.9AUR, cudnn-pascalAUR) (make)
- git (git-gitAUR, git-glAUR, git-wd40AUR) (make)
- ninja (ninja-gitAUR, ninja-memAUR, ninja-noemacs-gitAUR, ninja-kitwareAUR, ninja-fuchsia-gitAUR, n2-ninja-symlinkAUR) (make)
- shaderc (shaderc-gitAUR, shaderc-gitAUR) (make)
- nccl (nccl-cuda12.9AUR, nccl-gitAUR) (optional) – needed for multi-GPU parallelism
- python-ggufAUR (python-gguf-gitAUR) (optional) – needed for convert_hf_to_gguf.py
- python-numpy (python-numpy-gitAUR, python-numpy-mkl-binAUR, python-numpy1AUR, python-numpy-mkl-tbbAUR, python-numpy-mklAUR) (optional) – needed for convert_hf_to_gguf.py
- python-pytorch (python-pytorch-cuda12.9AUR, python-pytorch-opt-cuda12.9AUR, python-pytorch-cuda, python-pytorch-opt, python-pytorch-opt-cuda, python-pytorch-opt-rocm, python-pytorch-rocm) (optional) – needed for convert_hf_to_gguf.py
- python-safetensors (optional) – needed for convert_hf_to_gguf.py
- python-sentencepieceAUR (python-sentencepiece-gitAUR, python-sentencepiece-binAUR) (optional) – needed for convert_hf_to_gguf.py
- python-transformersAUR (python-transformers-gitAUR) (optional) – needed for convert_hf_to_gguf.py
Required by (5)
- llamaman-bin (requires llama.cpp) (optional)
- scmd-bin (requires llama.cpp)
- voxd (requires llama.cpp) (optional)
- voxd-bin (requires llama.cpp) (optional)
- voxd-git (requires llama.cpp) (optional)
Sources (3)
tuananh commented on 2026-05-24 15:01 (UTC)
fabse commented on 2026-05-24 12:37 (UTC) (edited on 2026-05-24 13:06 (UTC) by fabse)
also, I noticed in the git-version of this package that cuda apparently doesn't compile natively by default? but it can be toggled like in this commit
I'm not using this package myself, but do ya all want that enabled?
fabse commented on 2026-05-24 12:33 (UTC)
sorry about the delay, I was waiting for this pr to be merged, which simplified building the webui, but now it should be working! lemme know if not otherwise :)
shafish commented on 2026-05-23 10:43 (UTC)
不想折腾UI问题的话直接用 llama.cpp-cuda-git 就好了
tuananh commented on 2026-05-22 15:26 (UTC)
web ui is broken. gotta add -DLLAMA_BUILD_WEBUI=ON
doaxan commented on 2026-05-20 12:30 (UTC)
Web UI is broken for b9245-1: https://github.com/ggml-org/llama.cpp/issues/23379 https://github.com/ggml-org/llama.cpp/issues/23105
fabse commented on 2026-05-07 18:51 (UTC)
oh right the config should be kept between installs, it will be included in the next release
alegru commented on 2026-05-07 14:47 (UTC)
Can you add backup=("etc/conf.d/llama.cpp") to PKGBUILD?
fabse commented on 2026-05-06 20:09 (UTC)
oh nice, thanks for letting me know, I can confirm the workaround is not needed anymore and is now removed (will be pushed here on the next release)
svenstaro commented on 2026-05-06 17:54 (UTC)
FYI gcc15 is now in [extra] and after re-sourcing your profile.d, $NVCC_CCBIN should correctly point to /usr/bin/g++-15 and all the CUDA stuff should just work.
Pinned Comments
txtsd commented on 2024-10-26 20:17 (UTC) (edited on 2024-12-06 14:15 (UTC) by txtsd)
Alternate versions
llama.cpp
llama.cpp-vulkan
llama.cpp-sycl-fp16
llama.cpp-sycl-fp32
llama.cpp-cuda
llama.cpp-cuda-f16
llama.cpp-hip