VLM · local, aws-gpu

Gemma 4

Gemma 4 is the perception/tool VLM path for robot camera streams, sim frames, and multimodal workflow glue.

Runtime Target

Start with VLM frame analysis around MuJoCo or real robot cameras; use cloud when local memory is not enough.

PRO

aws-batch

nvidia-l4-or-a10g

mujoco-camera-stream

robot camera stream

local-or-cloud

AWS Batch runs a Gemma 4 GPU smoke on L4 with the Hugging Face model id, alongside catalog metadata, paid gating, autoscaling, and session planning.

Scale-to-zero AWS Batch infrastructure, ECR worker image wiring, and cloud-session control-plane responses are deployed as the first production path.

Production camera-stream serving, longer VLM evaluations, and Nebius lifecycle dispatch still need deployment hardening.