[matt-strix] gemma4:26b short — prefill=1275.71 tok/s decode= 53.83 tok/s [matt-strix] gemma4:26b long — prefill=14326.07 tok/s decode= 52.42 tok/s [matt-strix] gemma4:31b short — prefill= 291.74 tok/s decode= 10.64 tok/s [matt-strix] gemma4:31b long — prefill= 3277.8 tok/s decode= 10.42 tok/s [matt-strix] gemma4:26b-q8 short — model not available on host [matt-strix] gemma4:26b-q8 long — model not available on host