Add GPU container with RTX 5070 passthrough #55

Merged
AlexCaswen merged 3 commits from 55-ml-container-gpu into main 2026-05-10 05:02:41 +00:00
AlexCaswen commented 2026-05-10 04:56:00 +00:00 (Migrated from gitlab.com)

Changes

New gpu container for GPU compute workloads (MoLE research, rvLLM, trading inference):

  • incus_storage_volume.gpu_data on m3-trade-pool, mounted at /var/lib/models
  • incus_instance.gpu at 10.121.25.32 (workload range)
  • RTX 5070 physical GPU passthrough via gpu device
  • Uses default + nixos-container profiles
  • Boot priority 50 (workload tier)

Why workload range, not AI range

The GPU container is a compute tool, not an AI agent. It sits alongside dev (.30) and trade (.31) in the workload range. The .4x range is reserved for autonomous agents (claude-code, codex).

Post-merge

Container boots as bare NixOS with GPU access. Follow-up: configuration.nix with CUDA/nvidia packages, rvLLM, and a deploy job.

Closes #55

## Changes New `gpu` container for GPU compute workloads (MoLE research, rvLLM, trading inference): - `incus_storage_volume.gpu_data` on `m3-trade-pool`, mounted at `/var/lib/models` - `incus_instance.gpu` at `10.121.25.32` (workload range) - RTX 5070 physical GPU passthrough via `gpu` device - Uses `default` + `nixos-container` profiles - Boot priority 50 (workload tier) ## Why workload range, not AI range The GPU container is a compute tool, not an AI agent. It sits alongside dev (.30) and trade (.31) in the workload range. The .4x range is reserved for autonomous agents (claude-code, codex). ## Post-merge Container boots as bare NixOS with GPU access. Follow-up: configuration.nix with CUDA/nvidia packages, rvLLM, and a deploy job. Closes #55
AlexCaswen commented 2026-05-10 05:00:31 +00:00 (Migrated from gitlab.com)

added 1 commit

  • 74690c32 - Rename ml -> gpu, move to workload range .32

Compare with previous version

added 1 commit <ul><li>74690c32 - Rename ml -&gt; gpu, move to workload range .32</li></ul> [Compare with previous version](/AlexCaswen/m3-infra/-/merge_requests/55/diffs?diff_id=1791647536&start_sha=d162f048f27cf7a209b6c66855c8bf2872abf6ef)
AlexCaswen commented 2026-05-10 05:00:42 +00:00 (Migrated from gitlab.com)

added 1 commit

  • e97260d9 - Add gpu container with RTX 5070 passthrough at .32

Compare with previous version

added 1 commit <ul><li>e97260d9 - Add gpu container with RTX 5070 passthrough at .32</li></ul> [Compare with previous version](/AlexCaswen/m3-infra/-/merge_requests/55/diffs?diff_id=1791647587&start_sha=74690c32783131ad049f55e978a4246f3d971d31)
AlexCaswen commented 2026-05-10 05:00:52 +00:00 (Migrated from gitlab.com)

changed title from Add ML container with RTX 5070 GPU passthrough to Add GPU container with RTX 5070 passthrough

<p>changed title from <code class="idiff">Add <span class="idiff left deletion">ML</span> container with RTX 5070 <span class="idiff right deletion">GPU </span>passthrough</code> to <code class="idiff">Add <span class="idiff left right addition">GPU</span> container with RTX 5070 passthrough</code></p>
AlexCaswen commented 2026-05-10 05:00:52 +00:00 (Migrated from gitlab.com)

changed the description

changed the description
AlexCaswen commented 2026-05-10 05:02:10 +00:00 (Migrated from gitlab.com)

assigned to @AlexCaswen

assigned to @AlexCaswen
AlexCaswen (Migrated from gitlab.com) merged commit 7aae4a82d0 into main 2026-05-10 05:02:41 +00:00
AlexCaswen commented 2026-05-10 05:02:42 +00:00 (Migrated from gitlab.com)

mentioned in commit 7aae4a82d0

mentioned in commit 7aae4a82d0b333fd83c13feea200b134ffc47272
Sign in to join this conversation.
No reviewers
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
midwitmoneymgmt/m3-infra!55
No description provided.