@luciddream Yes, it’s a test I made to compare and test the performance of OpenCL. As soon as I solve some issues, I’ll publish it for the community. :)
Search Criteria
Package Details: opencl-amd-dev 1:7.1.1-1
Package Actions
| Git Clone URL: | https://aur.archlinux.org/opencl-amd-dev.git (read-only, click to copy) |
|---|---|
| Package Base: | opencl-amd-dev |
| Description: | ROCm extra runtime and developer packages. This package needs 14.03GB of disk space. |
| Upstream URL: | http://www.amd.com |
| Licenses: | custom:AMD |
| Conflicts: | composablekernel-dev, hip-dev, hip-doc, hip-samples, hipblas, hipblas-common, hipblas-common-dev, hipblas-dev, hipblaslt, hipblaslt-dev, hipcc, hipcub, hipcub-dev, hipfft, hipfft-dev, hipfort, hipfort-dev, hipify-clang, hiprand, hiprand-dev, hipsolver, hipsolver-dev, hipsparse, hipsparse-dev, hipsparselt, hipsparselt-dev, hiptensor, hiptensor-dev, migraphx, migraphx-dev, miopen, miopen-hip, miopen-hip-dev, mivisionx, mivisionx-dev, openmp-extras-dev, rccl, rccl-dev, rocalution, rocalution-dev, rocblas, rocblas-dev, rocdecode, rocdecode-dev, rocfft, rocfft-dev, rocjpeg, rocjpeg-dev, rocm-developer-tools, rocm-hip-libraries, rocm-hip-runtime-dev, rocm-hip-sdk, rocm-llvm, rocm-ml-libraries, rocm-ml-sdk, rocm-opencl-sdk, rocprim, rocprim-dev, rocprofiler-compute, rocprofiler-debug, rocprofiler-sdk, rocprofiler-sdk-rocpd, rocprofiler-sdk-roctx, rocprofiler-systems, rocrand, rocrand-dev, rocshmem, rocshmem-dev, rocsolver, rocsolver-dev, rocsparse, rocsparse-dev, rocthrust, rocthrust-dev, rocwmma-dev, rpp, rpp-dev |
| Provides: | composablekernel-dev, half, hip-dev, hip-doc, hip-samples, hipblas, hipblas-common, hipblas-common-dev, hipblas-dev, hipblaslt, hipblaslt-dev, hipcc, hipcub, hipcub-dev, hipfft, hipfft-dev, hipfort, hipfort-dev, hipify-clang, hiprand, hiprand-dev, hipsolver, hipsolver-dev, hipsparse, hipsparse-dev, hipsparselt, hipsparselt-dev, hiptensor, hiptensor-dev, migraphx, migraphx-dev, miopen, miopen-hip, miopen-hip-dev, mivisionx, mivisionx-dev, openmp-extras-dev, rccl, rccl-dev, rocalution, rocalution-dev, rocblas, rocblas-dev, rocdecode, rocdecode-dev, rocfft, rocfft-dev, rocjpeg, rocjpeg-dev, rocm-developer-tools, rocm-hip-libraries, rocm-hip-runtime-dev, rocm-hip-sdk, rocm-llvm, rocm-ml-libraries, rocm-ml-sdk, rocm-opencl-sdk, rocprim, rocprim-dev, rocprofiler-compute, rocprofiler-debug, rocprofiler-sdk, rocprofiler-sdk-rocpd, rocprofiler-sdk-roctx, rocprofiler-systems, rocrand, rocrand-dev, rocshmem, rocshmem-dev, rocsolver, rocsolver-dev, rocsparse, rocsparse-dev, rocthrust, rocthrust-dev, rocwmma-dev, rpp, rpp-dev |
| Submitter: | luciddream |
| Maintainer: | luciddream |
| Last Packager: | luciddream |
| Votes: | 11 |
| Popularity: | 0.97 |
| First Submitted: | 2021-12-26 15:01 (UTC) |
| Last Updated: | 2025-11-27 16:26 (UTC) |
Dependencies (5)
- blas (blis-cblas-openmpAUR, blis-cblasAUR, aocl-blis-aoccAUR, atlas-lapackAUR, blas-aocl-gccAUR, blas-aocl-aoccAUR, blas-mklAUR, blas-openblas-gitAUR, aocl-blisAUR, openblas-lapackAUR, blas-gitAUR, blas-openblas)
- lapack (aocl-libflame-aoccAUR, atlas-lapackAUR, blas-aocl-gccAUR, blas-aocl-aoccAUR, blas-mklAUR, aocl-libflameAUR, blas-openblas-gitAUR, openblas-lapackAUR, lapack-gitAUR, blas-openblas)
- opencl-amdAUR (amdapp-sdkAUR)
- suitesparse
- wget (wget-gitAUR) (make)
Required by (173)
- adaptivecpp (requires rocm-llvm) (make)
- adaptivecpp-git (requires rocm-llvm) (make)
- darknet-hankai-git (requires rocm-hip-sdk) (optional)
- ggml-hipblas-git (requires hipblas)
- ggml-hipblas-git (requires rocm-hip-sdk) (make)
- ginkgo-hpc-cuda-git (requires hipsparse) (make)
- ginkgo-hpc-cuda-git (requires hipblas) (make)
- ginkgo-hpc-cuda-git (requires rocthrust) (make)
- ginkgo-hpc-cuda-git (requires hipfft) (make)
- ginkgo-hpc-cuda-git (requires hiprand) (make)
- ginkgo-hpc-docs-git (requires rocthrust) (make)
- ginkgo-hpc-docs-git (requires hipfft) (make)
- ginkgo-hpc-docs-git (requires hiprand) (make)
- ginkgo-hpc-docs-git (requires hipsparse) (make)
- ginkgo-hpc-docs-git (requires hipblas) (make)
- ginkgo-hpc-git (requires hipsparse) (make)
- ginkgo-hpc-git (requires hipblas) (make)
- ginkgo-hpc-git (requires rocthrust) (make)
- ginkgo-hpc-git (requires hipfft) (make)
- ginkgo-hpc-git (requires hiprand) (make)
- Show 153 more...
Sources (68)
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/c/composablekernel-dev/composablekernel-dev_1.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/half/half_1.12.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hip-dev/hip-dev_7.1.52802.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hip-doc/hip-doc_7.1.52802.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hip-samples/hip-samples_7.1.52802.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipblas-common-dev/hipblas-common-dev_1.3.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipblas-dev/hipblas-dev_3.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipblas/hipblas_3.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipblaslt-dev/hipblaslt-dev_1.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipblaslt/hipblaslt_1.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipcc/hipcc_1.1.1.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipcub-dev/hipcub-dev_4.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipfft-dev/hipfft-dev_1.0.21.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipfft/hipfft_1.0.21.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipfort-dev/hipfort-dev_0.7.1.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipify-clang/hipify-clang_20.0.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hiprand-dev/hiprand-dev_3.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hiprand/hiprand_3.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipsolver-dev/hipsolver-dev_3.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipsolver/hipsolver_3.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipsparse-dev/hipsparse-dev_4.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipsparse/hipsparse_4.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipsparselt-dev/hipsparselt-dev_0.2.5.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hipsparselt/hipsparselt_0.2.5.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hiptensor-dev/hiptensor-dev_2.0.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/h/hiptensor/hiptensor_2.0.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/m/migraphx-dev/migraphx-dev_2.14.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/m/migraphx/migraphx_2.14.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/m/miopen-hip-dev/miopen-hip-dev_3.5.1.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/m/miopen-hip/miopen-hip_3.5.1.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/m/mivisionx-dev/mivisionx-dev_3.4.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/m/mivisionx/mivisionx_3.4.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/o/openmp-extras-dev/openmp-extras-dev_20.70.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/o/openmp-extras-runtime/openmp-extras-runtime_20.70.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rccl-dev/rccl-dev_2.27.7.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rccl/rccl_2.27.7.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocalution-dev/rocalution-dev_4.0.1.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocalution/rocalution_4.0.1.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocblas-dev/rocblas-dev_5.1.1.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocblas/rocblas_5.1.1.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocdecode-dev/rocdecode-dev_1.4.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocdecode/rocdecode_1.4.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocfft-dev/rocfft-dev_1.0.35.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocfft/rocfft_1.0.35.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocjpeg-dev/rocjpeg-dev_1.2.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocjpeg/rocjpeg_1.2.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocm-cmake/rocm-cmake_0.14.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocm-llvm/rocm-llvm_20.0.0.25444.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocprim-dev/rocprim-dev_4.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocprofiler-compute/rocprofiler-compute_3.3.1.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocprofiler-sdk-rocpd/rocprofiler-sdk-rocpd_1.0.0-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocprofiler-sdk-roctx/rocprofiler-sdk-roctx_1.0.0-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocprofiler-sdk/rocprofiler-sdk_1.0.0-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocprofiler-systems/rocprofiler-systems_1.2.1.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocrand-dev/rocrand-dev_4.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocrand/rocrand_4.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocshmem-dev/rocshmem-dev_3.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocsolver-dev/rocsolver-dev_3.31.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocsolver/rocsolver_3.31.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocsparse-dev/rocsparse-dev_4.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocsparse/rocsparse_4.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocthrust-dev/rocthrust-dev_4.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/roctracer-dev/roctracer-dev_4.1.70101.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/roctracer/roctracer_4.1.70101.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rocwmma-dev/rocwmma-dev_2.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rpp-dev/rpp-dev_2.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/r/rpp/rpp_2.1.0.70101-38~24.04_amd64.deb
- https://repo.radeon.com/rocm/apt/7.1.1/pool/main/t/transferbench-dev/transferbench-dev_1.64.00.70101-38~24.04_amd64.deb
cmhacks commented on 2025-11-29 09:28 (UTC)
luciddream commented on 2025-11-29 09:12 (UTC)
@cmhacks cool, is that benchmark something you have made?
cmhacks commented on 2025-11-28 22:45 (UTC) (edited on 2025-11-28 22:47 (UTC) by cmhacks)
@luciddream All working like a charm. Many thanks for your hard work dude :)
❯ ./rocm-9070xt-test
================================================================================
ULTIMATE GPU BENCHMARK SUITE
================================================================================
ROCm 7.1.1 Optimizations Applied:
• HSA_OVERRIDE_GFX_VERSION=12.0.1 (RDNA4 Fix)
• ROCM_FORCE_WAVE32=1 (MFMA Boost)
• HIP_ENABLE_GRAPHS=1 (Lower Overhead)
TIP: For maximum stability/performance, run these commands manually:
echo "manual" | sudo tee /sys/class/drm/card0/device/power_dpm_force_performance_level
echo "5" | sudo tee /sys/class/drm/card0/device/pp_power_profile_mode
--------------------------------------------------------------------------------
Detected ICDs:
• ROCm: /etc/OpenCL/vendors/AA_amdocl64.icd
• Pocl: /etc/OpenCL/vendors/pocl.icd
• Rusticl: /etc/OpenCL/vendors/ZZ_rusticl.icd
--------------------------------------------------------------------------------
--- Detected OpenCL Devices ---
[a] Run All (ROCm + Rusticl + HIP + Matrix) - Default
Select device [a]:
Running benchmarks with warmup and sustained load (min 1s)... Please wait.
=== BENCHMARK EXECUTION ===
Running ROCm (OpenCL)... Done
Running Pocl (OpenCL)... Done
Running Rusticl (OpenCL)... Done
Running HIP Vector (Shaders)... Done
Running Matrix Cores (AI)... Done
=== PERFORMANCE REPORT ===
+----------------------+----------------+----------------+----------------+----------------+----------------+
| METRIC | ROCm | Pocl | Rusticl | HIP | MATRIX (AI) |
+----------------------+----------------+----------------+----------------+----------------+----------------+
| FP64 (TFLOPS) | 0.60 | 0.05 | 0.00 | - | - |
| FP32 (TFLOPS) | 4.68 | 0.34 | 4.66 | 4.77 | - |
| FP16 (TFLOPS) | 0.54 | 0.05 | 0.57 | - | 164.00 |
| BF16 (TFLOPS) | - | - | - | 0.60 | 161.34 |
| INT64 (TOPS) | 0.08 | 0.01 | 0.07 | 0.08 | - |
| INT32 (TOPS) | 0.15 | 0.01 | 0.15 | 0.15 | - |
| INT16 (TOPS) | 0.29 | 0.03 | 0.28 | 0.28 | - |
| INT8 (TOPS) | 0.44 | 0.07 | 0.53 | 0.45 | 86.68 |
| FP8 (TFLOPS) | - | - | - | 0.58 | 83.81 |
| INT4 (TOPS) | 1.20 | 0.10 | 1.18 | 1.20 | 0.00 |
| FP4 (TFLOPS) | 1.21 | 0.10 | 1.18 | 1.21 | 0.00 |
| FP6 (TFLOPS) | - | - | - | - | 0.00 |
+----------------------+----------------+----------------+----------------+----------------+----------------+
Note: All values in TFLOPS/TOPS. 'Matrix' uses hipBLASLt.
Benchmark Complete.
::BENCHMARK_COMPLETE::
=== SYSTEM RECOMMENDATION ===
Based on the benchmark results:
• ROCm Wins: 7
• Pocl Wins: 0
• Rusticl Wins: 2
Recommendation:
✅ Use ROCm as your default OpenCL driver.
It performed better in 7 tests.
::WINNER::ROCm
(Note: For AI/ML workloads, always use HIP/ROCm regardless of OpenCL choice)
Report saved to: ./reports/rocm_benchmark_report_1764369720.txt
luciddream commented on 2025-11-27 16:29 (UTC)
hey @cmhacks, I just made a new release. It's very rushed but hopefully I didn't forget anything
cmhacks commented on 2025-11-27 14:58 (UTC)
Good day @luciddream
Eta for 7.1.1 ?
Thank you very much for your hard work! :)
LeonardoCarvalho commented on 2025-10-28 18:20 (UTC)
@luciddream I just got lost in the infinite web of the everchanging dependencies on ROCm right now, your observation is on point.
Let me try to get time and mental bandwidth to try something better for my specific scenarios.
Thanks anyway!
luciddream commented on 2025-10-19 16:42 (UTC) (edited on 2025-10-19 16:44 (UTC) by luciddream)
@LeonardoCarvalho
But what I understand from the MIOpen HIP documentation is that, these are pre-compiled kernels and will only help with the startup performance for the first run only, it doesn't affect the ability to run MIOpen, correct? There should (probably) only be warnings about them.
I'm not opposed to including anything, just trying to understand what is missing at the moment and the best way to move forward.
LeonardoCarvalho commented on 2025-10-19 16:23 (UTC)
For some specific projects (like vLLM and others that use CK) some runtimes need specific bindings.
The gfx1030 family is interesting because they are just below gfx11**, and they can do almost everything needed to serve models.
One fine example is https://rocm.docs.amd.com/projects/MIOpen/en/latest/index.html. It has specifics for gfx1030 gfx900 gfx906 gfx908 gfx90 and gfx942 GPU families.
I think that a separate AUR for each of the fast growing LLM projects AMD is maturing would only add overhead, and maybe a flag for specific chipsets would be easier to keep centralized in this one..
luciddream commented on 2025-10-18 23:06 (UTC)
Hi Leonardo! Glad to know this package is useful for you.
Do you mind explaining what these extra packages are needed for? I did a little research and I saw that usually these are not necessary to run models, but maybe they are useful.
But still, if they are necessary for a certain software there might be better ways to include them (for example making separate extra packages for each GPU model)
LeonardoCarvalho commented on 2025-10-18 14:00 (UTC)
One question:
I have a card that needs some specific packages to be added: - miopen-hip-gfx1030kdb7.0.2_3.5.0.70002-56~24.04_amd64.deb - miopen-hip-gfx1030kdb_3.5.0.70002-56~24.04_amd64.deb - miopen-hip-gfx1030kdb-rpath7.0.2_3.5.0.70002-56~24.04_amd64.deb
There are a couple more for other models, how can a flag to add them be added to the PKGBUILD?
Pinned Comments
luciddream commented on 2022-01-12 16:47 (UTC) (edited on 2025-11-27 16:27 (UTC) by luciddream)
Latest release: 7.1.1. It uses 14.61GB of disk.
Issues with current release:
rocgdbto function.