Improve multi-gpu handling at the limit

Still not complete, needs some refinement to our prediction to understand the discrete GPUs available space so we can see how many layers fit in each one since we can't split one layer across multiple GPUs we can't treat free space as one logical block
2025-05-11 18:36:41 +02:00 · 2024-05-18 12:34:31 -07:00 · 2024-05-18 12:34:31 -07:00 · 6fd04ca922
commit 6fd04ca922
parent 206797bda4
11 changed files with 390 additions and 90 deletions
--- a/gpu/amd_linux.go
+++ b/gpu/amd_linux.go
@ -27,7 +27,7 @@ const (
 	GPUTotalMemoryFileGlob = "mem_banks/*/properties" // size_in_bytes line

 	// Direct Rendering Manager sysfs location
-	DRMDeviceDirGlob   = "/sys/class/drm/card[0-9]/device"
+	DRMDeviceDirGlob   = "/sys/class/drm/card*/device"
 	DRMTotalMemoryFile = "mem_info_vram_total"
 	DRMUsedMemoryFile  = "mem_info_vram_used"