Mortdecai

Files

T

Seth 48b627d498 Add LoRA training scripts and fix bake-off token budget

- training/scripts/train_lora.py: Unsloth QLoRA trainer for qwen3:8b
- training/scripts/train_lora.sh: Launch script for steel141 RTX 3090 Ti
- eval/bakeoff.py: Fixed token budget (400->1500) that caused qwen3
  models to exhaust tokens on thinking, added --no-think flag
- agent/serve.py: Default model changed to gemma3n:e4b

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-18 10:40:18 -04:00

.gitkeep

Initial project scaffold: dataset schema, 31 seed training examples, Mineflayer bot framework, and 7-phase roadmap

2026-03-18 01:51:28 -04:00

train_lora.py

Add LoRA training scripts and fix bake-off token budget

2026-03-18 10:40:18 -04:00

train_lora.sh

Add LoRA training scripts and fix bake-off token budget

2026-03-18 10:40:18 -04:00