aboutsummarylogtreecommitdiffstats
path: root/next.txt
blob: 2b538911e12a3def33507806ac3565c90287c1d6 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
Make tensorflow-amd-bin package.

# Maybe
https://wiki.archlinux.org/title/Arch_packaging_standards -> reproducible
make test.py actually fail if something goes wrong
set up "auto-build" system. One script to run, log, then test. etc.
    systemd timer, stuff like that
    check latests builds and the thing from official tensorflow pointing to official rocm build
more complete testing of tensorflow. What other functions are there? can I do gpt-2?
    there are official tests... somehow...
documentation
    what environment variables can you use to suppress verbose logging?
    export TF_CPP_MAX_VLOG_LEVEL=-1 # optional; see https://github.com/RadeonOpenCompute/ROCm/issues/1594
    what about the other thing? NUMA returned -1 or something
    document other ways to install
publicity
recruit someone with a different gfx architecture to test
    actually, just post to the aur and see if there's even any interest. The Docker container works well enough.

# Notes
Working builds:
Download and run the rocm/tensorflow docker image
    https://github.com/mpeschel10/test-tensorflow-rocm

Download and install the wheel
    # For python older than 3.11, pip has builds from 3.7 to 3.10
    # find rocm last release via https://github.com/tensorflow/build#community-supported-tensorflow-builds
    # to http://ml-ci.amd.com:21096/job/tensorflow/job/release-rocmfork-r212-rocm-enhanced/job/release-build-whl/lastSuccessfulBuild
    # to http://ml-ci.amd.com:21096/job/tensorflow/job/release-rocmfork-r212-rocm-enhanced/job/release-build-whl/lastSuccessfulBuild/artifact/packages-3.11/tensorflow_rocm-2.12.0.550-cp311-cp311-manylinux2014_x86_64.whl
    curl http://ml-ci.amd.com:21096/job/tensorflow/job/release-rocmfork-r212-rocm-enhanced/job/release-build-whl/lastSuccessfulBuild/artifact/packages-3.11/tensorflow_rocm-2.12.0.550-cp311-cp311-manylinux2014_x86_64.whl > ~/Downloads/tensorflow_rocm.whl
    pip install tensorflow_rocm.whl
    
Compile it:
    Last functional build on jenkins was http://ml-ci.amd.com:21096/job/tensorflow/job/release-rocmfork-r212-rocm-enhanced/job/release-build-whl/lastSuccessfulBuild/
    Possibly from https://github.com/ROCmSoftwarePlatform/tensorflow-upstream, branch r2.12-rocm-enhanced
    revision 18ddd5aa0329993f581bdb433b999b85c15f69e3 
    copy most of the stuff from the docker build file
    makepkg
    it works!!!

test with gfx803
    does not work

that is not dead which may eternal lie,
and with strange aeons, even death may die