Search Criteria
Package Details: python-flash-attn 2.8.3-1
Package Actions
| Git Clone URL: | https://aur.archlinux.org/python-flash-attn.git (read-only, click to copy) |
|---|---|
| Package Base: | python-flash-attn |
| Description: | Fast and memory-efficient exact attention |
| Upstream URL: | https://github.com/Dao-AILab/flash-attention |
| Licenses: | BSD-3-Clause |
| Provides: | python-flash-attention |
| Submitter: | Smoolak |
| Maintainer: | Smoolak |
| Last Packager: | Smoolak |
| Votes: | 0 |
| Popularity: | 0.000000 |
| First Submitted: | 2025-12-11 01:40 (UTC) |
| Last Updated: | 2025-12-11 01:40 (UTC) |
Dependencies (11)
- python-einopsAUR
- python-pytorch-cuda (python-pytorch-cuda12.9AUR, python-pytorch-opt-cuda12.9AUR, python-pytorch-opt-cuda)
- cuda (cuda11.1AUR, cuda-12.2AUR, cuda12.0AUR, cuda11.4AUR, cuda-12.5AUR, cuda-12.9AUR, cuda-12.8AUR, cuda-pascalAUR) (make)
- git (git-gitAUR, git-glAUR, git-wd40AUR) (make)
- ninja (ninja-gitAUR, ninja-memAUR, ninja-noemacs-gitAUR, ninja-kitwareAUR, ninja-fuchsia-gitAUR, n2-ninja-symlinkAUR) (make)
- python-build (make)
- python-installer (make)
- python-packaging (make)
- python-psutil (make)
- python-setuptools (make)
- python-wheel (make)
Required by (1)
- python-nanotron (optional)
Latest Comments
graysky commented on 2026-03-28 15:48 (UTC)
You need swap. I have 96G of RAM and it also consumed all of it. I created a 256G swap file which peaked at 40-50G.
You can also save building all supported arches by adding:
Where x = (compute capability x 10) of your specific card. For example, the CC on a 4090 is 8.9 so 8.9x10 = 89.
See: https://developer.nvidia.com/cuda-gpus for the other architectures.
zwastik commented on 2026-02-14 04:13 (UTC)
I can't compile this package, it swallows all my 64GB of ram and crashes. I tried putting
in the build() section but I did not made a difference.