Mortdecai

Files

T

Seth de14f4a1c8 3-GPU overnight self-play: 3090 Ti + 2080 Ti + RTX 4000

Round-robin load balancing across three Ollama instances:
- 141:11434 (RTX 3090 Ti 24GB)
- 141:11435 (RTX 2080 Ti 11GB) — new second instance
- 179:11434 (RTX 4000 16GB)

Each tier cycles to a different GPU. 3x throughput overnight.
Cycles: Tier 1 drills → Tier 2 self-critique → Tier 3 adversarial → repeat

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-20 00:54:29 -04:00

.gitkeep

Initial project scaffold: dataset schema, 31 seed training examples, Mineflayer bot framework, and 7-phase roadmap

2026-03-18 01:51:28 -04:00

distill.py

Swarm bots, RCON validation, Haiku distillation complete

2026-03-18 19:18:19 -04:00

generate_tool_training.py

Tool-calling training: 1,159 multi-turn examples with error correction

2026-03-19 18:49:08 -04:00

overnight_selfplay.sh

3-GPU overnight self-play: 3090 Ti + 2080 Ti + RTX 4000