Package Details: python-vllm-cuda 0.20.1-1

Git Clone URL: https://aur.archlinux.org/python-vllm-cuda.git (read-only, click to copy)
Package Base: python-vllm-cuda
Description: high-throughput and memory-efficient inference and serving engine for LLMs
Upstream URL: https://github.com/vllm-project/vllm
Licenses: Apache-2.0
Conflicts: python-vllm
Provides: python-vllm
Submitter: envolution
Maintainer: ncihnegn
Last Packager: ncihnegn
Votes: 2
Popularity: 0.68
First Submitted: 2024-12-01 16:12 (UTC)
Last Updated: 2026-05-04 07:10 (UTC)

Pinned Comments

Latest Comments

ganelon commented on 2026-05-10 02:07 (UTC)

@Strykar It got past that part when I manually cloned it rather than using yay. However, it is not yet done building... we'll see

Strykar commented on 2026-04-30 23:50 (UTC)

Build fails at:

/home/strykar/.cache/aurutils/sync/python-vllm-cuda/PKGBUILD: line 94: cd: .deps/vllm-flash-attn-src: No such file or directory
==> ERROR: A failure occurred in prepare().
    Aborting...
FAILED: [code=4] python-vllm-cuda.stamp 

Is this even required anymore?

envolution commented on 2025-02-12 21:30 (UTC)

@Sherlock-Holo thanks for your report, it's been added to makedepends

Sherlock-Holo commented on 2025-02-11 09:28 (UTC) (edited on 2025-02-11 09:30 (UTC) by Sherlock-Holo)

when build this package, it says

Traceback (most recent call last):
  File "/home/sherlock/.cache/yay/python-vllm-cuda/src/vllm/setup.py", line 18, in <module>
    from setuptools_scm import get_version
ModuleNotFoundError: No module named 'setuptools_scm'

if add the miss makedepends python-setuptools-scm, it will fail with

Traceback (most recent call last):
  File "/home/sherlock/.cache/yay/python-vllm-cuda/src/vllm/setup.py", line 633, in <module>
    version=get_vllm_version(),
            ~~~~~~~~~~~~~~~~^^
  File "/home/sherlock/.cache/yay/python-vllm-cuda/src/vllm/setup.py", line 527, in get_vllm_version
    raise RuntimeError("Unknown runtime environment")
RuntimeError: Unknown runtime environment

envolution commented on 2024-12-28 04:47 (UTC) (edited on 2024-12-28 04:51 (UTC) by envolution)

Not working currently due to lack of python 3.13 support in vllm-flash-attention. Try python-vllm-bin or the cpu version python-vllm