Files

T

Seth bd65f4a84c Add LICENSE, MODEL_CARD, requirements, CONTRIBUTING

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-20 21:43:21 -04:00

Model Card: Mortdecai

Model Details

Field	Value
Name	Mortdecai
Version	0.4.0
Base Model	Qwen3.5-9B (Apache 2.0)
Adaptation	QLoRA (4-bit base + LoRA adapters in FP16)
Parameters	9.4B total, 29M trainable (0.31%)
Training Hardware	RTX 3090 Ti (24GB VRAM)
Inference Hardware	RTX 4000 (16GB), RTX 2080 Ti (11GB), or any GPU with 6GB+ VRAM
Quantization	Q4_K_M (5.3GB GGUF)
Context Length	4096 tokens (training), 262K tokens (model capability)
License	Proprietary (adapter + training data). Base model: Apache 2.0

Mortdecai is designed for Minecraft Java Edition 1.21.x server operations:

Not intended for:

Source	Count	Description
Hand-curated examples	966	Command syntax, recipes, enchantments, entities, effects
Player interactions	654	Real prayers from live server players
Sudo translations	525	Natural language → command pairs
Tool-calling sequences	1,159	Multi-turn RCON execution with error correction
Self-play	5,000+	Model-generated prompts validated via RCON
API distillation	344	Claude Haiku gold-standard responses
Error corrections	150+	Wrong → right command pairs

Total: ~8,400+ examples

Manual curation — Minecraft Wiki, command reference, recipe databases
Live server logs — Real player interactions on Paper 1.21.x servers
Bot collection — Mineflayer bots with Gemini/Dolphin prompt generation
API distillation — Claude Haiku and Gemini Flash responses
Self-play — Model generates edge cases, attempts via RCON, learns from results
RCON validation — Every command tested against a live Minecraft server

Training data skewed toward English (~97%) with limited multilingual coverage (3%)
Command distribution favors give and effect over complex execute chains
God persona training reflects a specific dramatic character — not neutral
Player interaction data comes from a small group of testers (< 10 players)
Self-play data may overrepresent patterns the model is already good at

The model uses a 5-level risk hierarchy:

Additional safety layers: