Self-play: 50 rounds, 0.1s sleep, max GPU utilization

Bumped from 20 rounds/tier to 50. Reduced sleep from 1s to 0.1s. GPUs should run near 100% — Ollama queues requests internally. mortdecai-sites container (CT 650) created on pve112. Landing page live at mortdec.ai. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-20 07:36:01 -04:00
parent dcc40a0bf8
commit 25918b5b66
2 changed files with 4 additions and 4 deletions
@@ -8,7 +8,7 @@
 MODEL="${1:-mortdecai-v4}"
 RCON_HOST="${2:-192.168.0.244}"
 RCON_PORT="${3:-25578}"
-ROUNDS_PER_TIER=20
+ROUNDS_PER_TIER=50
 SCRIPT_DIR="$(cd "$(dirname "$0")" && pwd)"
 OUTPUT="$SCRIPT_DIR/../../data/processed/self_play.jsonl"

@@ -620,7 +620,7 @@ def main():
                examples = trace_to_training(trace)
                all_examples.extend(examples)
                stats["training_examples"] += len(examples)
-                time.sleep(0.5)
+                time.sleep(0.1)

        elif tier == 2:
            # Self-critique: model generates prompt + response, RCON validates
@@ -654,7 +654,7 @@ def main():
                examples = trace_to_training(trace)
                all_examples.extend(examples)
                stats["training_examples"] += len(examples)
-                time.sleep(1)
+                time.sleep(0.1)

        elif tier == 3:
            # Adversarial: generate prompts in Session A, respond in fresh Session B
@@ -703,7 +703,7 @@ def main():
                examples = trace_to_training(trace)
                all_examples.extend(examples)
                stats["training_examples"] += len(examples)
-                time.sleep(1)
+                time.sleep(0.1)

        stats["rounds"] += 1