Search Criteria
Package Details: python-vllm-rocm 0.21.0-1
Package Actions
| Git Clone URL: | https://aur.archlinux.org/python-vllm-rocm.git (read-only, click to copy) |
|---|---|
| Package Base: | python-vllm-rocm |
| Description: | high-throughput and memory-efficient inference and serving engine for LLMs (ROCm support) |
| Upstream URL: | https://github.com/vllm-project/vllm |
| Licenses: | Apache-2.0 |
| Provides: | python-vllm |
| Submitter: | davispuh |
| Maintainer: | davispuh |
| Last Packager: | davispuh |
| Votes: | 3 |
| Popularity: | 0.88 |
| First Submitted: | 2026-02-24 22:16 (UTC) |
| Last Updated: | 2026-05-23 21:08 (UTC) |
Dependencies (58)
- amdsmi (opencl-amdAUR, rocm-binAUR, rocm-gfx101x-binAUR, rocm-gfx103x-binAUR, rocm-gfx110x-binAUR, rocm-gfx120x-binAUR, rocm-gfx1150-binAUR, rocm-gfx1151-binAUR, rocm-gfx1152-binAUR, rocm-nightly-gfx120x-all-binAUR, rocm-nightly-gfx1151-binAUR, rocm-nightly-gfx110x-binAUR)
- numactl (numactl-gitAUR)
- python-aiohttp
- python-blake3AUR
- python-cachetools
- python-cbor2
- python-cloudpickle
- python-diskcacheAUR
- python-einopsAUR
- python-fastapi
- python-ggufAUR (python-gguf-gitAUR)
- python-huggingface-hub (python-huggingface-hub-gitAUR)
- python-ijson
- python-importlib-metadata
- python-mistral-commonAUR (python-mistral-common-gitAUR)
- python-msgspec
- python-openai
- python-opencv (python-opencv-cuda)
- python-partial-json-parserAUR (python-partial-json-parser-gitAUR)
- python-prometheus-fastapi-instrumentatorAUR
- Show 38 more dependencies...
Required by (6)
- python-pydantic-ai (requires python-vllm) (optional)
- python-pydantic-ai-slim (requires python-vllm) (optional)
- python-vllm-omni (requires python-vllm)
- python-vllm-omni (optional)
- python-vllm-omni-git (requires python-vllm)
- python-vllm-omni-git (optional)
Latest Comments
« First ‹ Previous 1 2
cmhacks commented on 2026-03-26 10:52 (UTC)
Hi, thanks for maintaining this package!
Currently, PYTORCH_ROCM_ARCH is set to compile for all 8 GPU architectures (gfx906, gfx908, gfx90a, gfx942, gfx1100, gfx1101, gfx1200, gfx1201), which results in extremely long build times — often over an hour — since every HIP kernel is compiled once per target.
Most users only have one GPU and only need a single architecture. Would it be possible to either:
Split into per-architecture packages (e.g., python-vllm-rocm-gfx906, python-vllm-rocm-gfx1201, etc.) so users can install only the one matching their hardware, or Auto-detect the system GPU at build time using rocminfo or AMDGPU_TARGETS to compile only for the installed hardware?
This would drastically reduce build times (up to ~8x faster) and resource usage for end users.
Thanks for considering this!
« First ‹ Previous 1 2