OpenMythosをLinuxMint 22.3に環境構築してみました

 1.概要 Claude Mythosが話題になっています。このClaude Mythosの公開論文を元にOpen Mythosが開発されてOSS公開されたとのニュースが目に止まりました。早速、Open Mythosの環境構築にチャレンジしました。 2.詳細 pytorchベースで、Pytorch-12.6、CUDA-12.6環境を構築して、python3でimportできることを確認しました。 環境は以下の通りです。 HW amd 3200G, Memory 16GB, SSD 256GB, nvidia-1660 super SW LinuxMint 22.3, NVIDIA-driver-595-open, CUDA-12.6, Pytorch-12.6 (1) nouveauの無効化 /etc/modprobe.d/blacklist-nouveau.conf作成 # nvidia(nouveau) blacklist nouveau options nouveau modeset=0 適用します $ sudo update-initramfs -u (2) nvidia driver設定 $ ubuntu-drivers devices $ sudo apt -y install nvidia-driver-595-open (3) CUDA設定 $ wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/cuda-keyring_1.1-1_all.deb $ sudo dpkg -i cuda-keyring_1.1-1_all.deb $ sudo apt update $ sudo apt install cuda-toolkit-12-6 path設定を~/.bashrcに追加( vi ~/.bashrc ) # CUDA Toolkit export PATH="/usr/local/cuda/bin:$PATH" export LD_LIBRARY_PATH="/usr/local/cuda/lib64:$LD_LIBRARY_PATH" (4) 動作確認 $ nvidia-smi $ nvcc -V...

AMD 32000GのBlack Outはkernel-6.17.0-22で改善しました

 1.概要

本日、kernel-6.17.0-22-genericがリリースされ、AMD 3200GのPCに適用しました。
再起動を2回実施して問題改善しています。

2.詳細

kernel-6.14.0-37-genericでholdしていたので、kernel-6.17.0-22-genericを適用しました。
dmesgのamdgpuのlogにはerrorはないようです。

参考
[本ブログ内参照]
【問題解決】AMD 3200Gがkernel-6.17でBlack Outする

添付資料

$ uname -a
Linux asrock2 6.17.0-22-generic #22~24.04.1-Ubuntu SMP PREEMPT_DYNAMIC Thu Mar 26 15:25:54 UTC 2 x86_64 x86_64 x86_64 GNU/Linux

$ ls -l /boot/initrd.img-6*
-rw-r--r-- 1 root root 85167754  4月 19 05:47 /boot/initrd.img-6.14.0-37-generic
-rw-r--r-- 1 root root 85786485  4月 21 02:47 /boot/initrd.img-6.17.0-22-generic

$ dmesg | grep amdgpu
[    9.714275] [drm] amdgpu kernel modesetting enabled.
[    9.714419] amdgpu: Virtual CRAT table created for CPU
[    9.714430] amdgpu: Topology: Add CPU node
[    9.714655] amdgpu 0000:07:00.0: amdgpu: initializing kernel modesetting (RAVEN 0x1002:0x15D8 0x1002:0x15D8 0xC9).
[    9.714671] amdgpu 0000:07:00.0: amdgpu: register mmio base: 0xF7500000
[    9.714674] amdgpu 0000:07:00.0: amdgpu: register mmio size: 524288
[    9.716545] amdgpu 0000:07:00.0: amdgpu: detected ip block number 0 <soc15_common>
[    9.716549] amdgpu 0000:07:00.0: amdgpu: detected ip block number 1 <gmc_v9_0>
[    9.716551] amdgpu 0000:07:00.0: amdgpu: detected ip block number 2 <vega10_ih>
[    9.716553] amdgpu 0000:07:00.0: amdgpu: detected ip block number 3 <psp>
[    9.716555] amdgpu 0000:07:00.0: amdgpu: detected ip block number 4 <powerplay>
[    9.716557] amdgpu 0000:07:00.0: amdgpu: detected ip block number 5 <dm>
[    9.716559] amdgpu 0000:07:00.0: amdgpu: detected ip block number 6 <gfx_v9_0>
[    9.716561] amdgpu 0000:07:00.0: amdgpu: detected ip block number 7 <sdma_v4_0>
[    9.716562] amdgpu 0000:07:00.0: amdgpu: detected ip block number 8 <vcn_v1_0>
[    9.740394] amdgpu 0000:07:00.0: amdgpu: Fetched VBIOS from ROM BAR
[    9.740403] amdgpu: ATOM BIOS: 113-PICASSO-115
[    9.755219] amdgpu 0000:07:00.0: vgaarb: deactivate vga console
[    9.756351] amdgpu 0000:07:00.0: amdgpu: Trusted Memory Zone (TMZ) feature enabled
[    9.756416] amdgpu 0000:07:00.0: amdgpu: vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
[    9.756425] amdgpu 0000:07:00.0: amdgpu: VRAM: 512M 0x000000F400000000 - 0x000000F41FFFFFFF (512M used)
[    9.756427] amdgpu 0000:07:00.0: amdgpu: GART: 1024M 0x0000000000000000 - 0x000000003FFFFFFF
[    9.756642] amdgpu 0000:07:00.0: amdgpu: amdgpu: 512M of VRAM memory ready
[    9.756646] amdgpu 0000:07:00.0: amdgpu: amdgpu: 7707M of GTT memory ready.
[    9.765118] amdgpu: hwmgr_sw_init smu backed is smu10_smu
[    9.767630] amdgpu 0000:07:00.0: amdgpu: Found VCN firmware Version ENC: 1.15 DEC: 3 VEP: 0 Revision: 0
[    9.788543] amdgpu 0000:07:00.0: amdgpu: reserve 0x400000 from 0xf41f800000 for PSP TMR
[    9.853083] amdgpu 0000:07:00.0: amdgpu: RAS: optional ras ta ucode is not available
[    9.858052] amdgpu 0000:07:00.0: amdgpu: RAP: optional rap ta ucode is not available
[    9.861240] amdgpu 0000:07:00.0: amdgpu: psp gfx command LOAD_TA(0x1) failed and response status is (0x7)
[    9.862889] amdgpu 0000:07:00.0: amdgpu: [drm] Display Core v3.2.340 initialized on DCN 1.0
[    9.911719] snd_hda_intel 0000:07:00.1: bound 0000:07:00.0 (ops amdgpu_dm_audio_component_bind_ops [amdgpu])
[    9.953459] amdgpu 0000:07:00.0: amdgpu: kiq ring mec 2 pipe 1 q 0
[    9.967968] kfd kfd: amdgpu: Allocated 3969056 bytes on gart
[    9.967988] kfd kfd: amdgpu: Total number of KFD nodes to be created: 1
[    9.968143] amdgpu: Virtual CRAT table created for GPU
[    9.968219] amdgpu: Topology: Add dGPU node [0x15d8:0x1002]
[    9.968222] kfd kfd: amdgpu: added device 1002:15d8
[    9.968236] amdgpu 0000:07:00.0: amdgpu: SE 1, SH per SE 1, CU per SH 11, active_cu_number 8
[    9.968241] amdgpu 0000:07:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
[    9.968243] amdgpu 0000:07:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[    9.968245] amdgpu 0000:07:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[    9.968247] amdgpu 0000:07:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
[    9.968249] amdgpu 0000:07:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
[    9.968250] amdgpu 0000:07:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
[    9.968252] amdgpu 0000:07:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
[    9.968254] amdgpu 0000:07:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
[    9.968255] amdgpu 0000:07:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
[    9.968257] amdgpu 0000:07:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 11 on hub 0
[    9.968259] amdgpu 0000:07:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 8
[    9.968261] amdgpu 0000:07:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 8
[    9.968262] amdgpu 0000:07:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 8
[    9.968264] amdgpu 0000:07:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 8
[    9.968266] amdgpu 0000:07:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 8
[    9.974475] amdgpu: pp_dpm_get_sclk_od was not implemented.
[    9.974477] amdgpu: pp_dpm_get_mclk_od was not implemented.
[    9.974605] amdgpu 0000:07:00.0: amdgpu: Runtime PM not available
[    9.974977] amdgpu 0000:07:00.0: [drm] Registered 4 planes with drm panic
[    9.974979] [drm] Initialized amdgpu 3.64.0 for 0000:07:00.0 on minor 1
[    9.979684] amdgpu 0000:07:00.0: amdgpu: [drm] Failed to setup vendor infoframe on connector HDMI-A-3: -22 
[    9.982717] fbcon: amdgpudrmfb (fb0) is primary device
[   10.081045] amdgpu 0000:07:00.0: [drm] fb0: amdgpudrmfb frame buffer device
[   16.229330] amdgpu 0000:07:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=io+mem

コメント