Mortdecai

Author	SHA1	Message	Date
Seth	b85b1a6725	40 risk hierarchy examples: L0 blocked, L1 permanent, L2 temporary, injections Risk hierarchy baked into training data: - L0 BLOCKED (15): ban, kick, stop, op, deop, whitelist, pardon, ban-ip - L1 REFUSE (9): permanent gamerules, gamemode @a, default gamemode, difficulty - L2 WARN (8): temporary gamerules with reversal intent, time-limited changes - L3 NORMAL (8): time/weather, tick speed, sleep %, chat cleanup - Prompt injection (5): fake admin claims, permission override attempts Key principle: permanence determines risk level. gamerule keepInventory true (permanent) = L1 gamerule doMobSpawning false for 5 min (temporary) = L2 randomTickSpeed 50 (easily reversed) = L3 Seed dataset: 2,306 examples Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 23:30:46 -04:00
Seth	fbf6974af3	49 gamerule + invincibility training examples Covers all major gamerules with natural language variants: - Mob spawning/griefing, keepInventory, daylightCycle, weatherCycle - Fire tick, insomnia/phantoms, instant respawn, natural regen - randomTickSpeed (crop growth), sleep percentage, TNT, fall/fire/drowning damage - Command feedback, advancement announcements, death messages - God mode / invincibility via resistance 5 effect - "disable mobs" and "invincibility me" — prompted by prod failures Seed dataset: 2,266 examples Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 23:27:26 -04:00
Seth	7a31e500e4	Qwen3.5-9B on prod, Gemini 2.5 Flash for dev, error correction, branding Prod deployment: - paper-ai and shrink-world switched from gemma3n:e4b to qwen3.5:9b - Error correction: detects RCON errors (<--[HERE]), asks model to fix, retries - Broadened error patterns: Unknown game mode, Unknown enchantment, etc. - Fixed fire fallback matching "firework" as fire intent - Fixed command format examples (WRONG vs RIGHT in prompt) - max_tokens bumped to 600 for command calls - Removed template workflow commands from sudo prompt Dev server: - Gemini 2.5 Flash ($0.15/$0.60 per M tokens) replaces Flash Lite - 10 bots for ~$1-1.5/hr training data generation - Dynamic pricing by model name in cost tracker Branding: - Rajdhani Bold as official Mortdecai font - Logo variants: mortdecai + mortdec.ai in 6 fonts - Whitelist page updated with Mortdecai branding + mortdec.ai domain Whitelist UUID fix: - Looks up real Mojang UUID via api.mojang.com - Patches all whitelist.json files directly - No more offline-mode UUID mismatches WorldEdit schematics: - 77 schematics installed (villages, bridges, lighthouses, parks, etc.) Mortdecai v4 training in progress: 63% complete on steel141 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 23:09:27 -04:00
Seth	b75a737c11	7 enchantment syntax error examples: count order, typos, old NBT Common errors seen in prod: - Count before brackets: sword 1[enchantments=...] → sword[enchantments=...] 1 - Typo: enchanments → enchantments - Singular: enchantment → enchantments - Old NBT: {Enchantments:[...]} → [enchantments={...}] - Old abbreviated: {ench:[...]} → [enchantments={...}] Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 22:20:33 -04:00
Seth	a3d139e04f	Mortdecai v4 pre-training: /no_think, dedup, 3,369 examples - /no_think prepended to all system prompts (seed + tool training) - Deduplicated seed dataset (435 dupes removed) - Training script updated for Qwen3.5-9B + /no_think - 2,210 seed + 1,159 tool-calling = 3,369 total examples Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 20:15:00 -04:00
Seth	d31cdb21fd	1,833 training examples: entities, execute chains, multiplayer, advanced, redstone, biomes, errors New knowledge (291 examples): - Entity/mob commands (60): summon, kill, NBT, spawn eggs, passengers, named mobs - Execute chains (45): as/at/positioned/if/unless/store, dimension switching - Multiplayer targeting (45): selectors, teams, scoreboards, bossbars, tags - Advanced commands (45): tellraw, loot, clone, data, attributes, ride, forceload - Redstone knowledge (28): repeaters, comparators, pistons, observers, hoppers - Biome/dimension (28): nether/end tp, locate structure/biome, dimension awareness - Error correction (40): item ID fixes, enchant abbreviations, syntax mistakes Total seed dataset: 1,833 examples Tool-calling dataset: 1,159 examples Combined for v4 training: ~3,000 examples Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 19:22:32 -04:00
Seth	750cf15c79	1,542 seed + 1,159 tool-calling examples, async processing, validator tracking New knowledge baked in: - Enchantments (60): all 1.21 enchants, mutual exclusions, max levels, component syntax - WorldEdit (45): //set, //replace, //sphere, //stack, selection, brushes - Paper server (55): gamerules, permissions, plugins, scoreboard, moderation - Cosmetics/XP (42): title, tellraw, playsound, particle, xp, effect mechanics - Quantity boundaries (32): item tier caps, greedy→stingy, humble→generous Training infrastructure: - train_lora.py updated for multi-turn tool conversations + seed data - Async prayer/sudo processing (ThreadPoolExecutor, 3 workers) - Validator hit-rate tracking to /var/log/mc_validator_stats.json Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 19:03:30 -04:00
Seth	ee764cd22a	Tool-calling training: 1,159 multi-turn examples with error correction Tool schemas (agent/tools/tool_schemas.py): - rcon.execute: execute commands, get success/error results - minecraft.wiki_lookup: look up syntax and item info - world.player_info: player health, position, inventory - world.server_state: time, weather, online players - 10 RCON error patterns with corrections - 12 common error scenarios for training Training data generator (training/scripts/generate_tool_training.py): - Converts seed dataset to multi-turn tool conversations - Error correction: model tries wrong command → gets error → self-corrects - Wiki/player/server lookups for uncertainty scenarios - Qwen3 native tool-calling format with <tool_call> tags 1,159 examples: 1043 success, 79 error correction, 24 error scenarios, 13 tool lookups. Ready for v4 training. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 18:49:08 -04:00
Seth	4e83da39fd	Quantity boundaries: item tier caps, tone-based scaling, 32 training examples God Soul updated with quantity rules: - Common (dirt/wood): max 320, Uncommon (iron/gold): max 128 - Rare (diamond/emerald): max 32, Very rare (netherite/elytra): max 4 - Forbidden (bedrock/command_block): never give - Greedy → scaled back, Humble → generous within cap, Absurd → comedic 32 training examples: greedy(6), casual(6), humble(4), explicit(6), forbidden(5), absurd(3), enchanted(2) Dataset: 1,340 examples total Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 18:22:26 -04:00
Seth	e780aef8c6	v3 model trained (1,308 examples, loss 0.55), API cascade, context update v3 training: - 1,308 examples: curated + Claude-distilled + bot audit + recipes + command ref - 1 epoch, rank 16, LR 1e-4, loss 0.55 (sweet spot) - GGUF Q4_K_M exported, loaded in Ollama as qwen3-8b-mc-lora-v3 - Correct commands, no Chinese, proper safety refusals, dramatic God persona API cascade for dev server: - Stage 1: Claude Haiku ($20 budget, ~$11 spent) - Stage 2: Gemini 2.5 Flash Lite ($20 budget) - Stage 3: qwen3-8b-mc-lora-v3 (free, local) - Gemini call function with persistent cost tracking - Full status report printed at each $1 milestone Data collection: 2,677 dev audit entries and growing Bot status printer budget display fix Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 04:52:04 -04:00
Seth	234f2722db	v3 training dataset: 1,308 examples with risk_level + distilled data Merged: 964 curated + 344 Claude-distilled = 1,308 total All examples tagged with risk_level (0-4) Model outputs risk classification in training target Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 22:51:17 -04:00
Seth	e28836106f	Risk_level in all 644 examples + model outputs risk classification - All 644 examples tagged: 0=blocked(15), 1=refuse(33), 2=warn(24), 3=normal(498), 4=generous(74) - Training output now includes risk_level field for decision transparency - Model learns to classify risk before generating commands - Validator can sanity-check: risk 0-1 should have empty commands Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 22:35:50 -04:00
Seth	0473eb0b50	Minecraft knowledge corpus, recipe trees, GitHub scraper, 644 examples Knowledge corpus (knowledge/mc-data/): - 1505 items, 886 crafting recipes, 1166 blocks from minecraft-data 1.21.11 - Recipe dependency tree builder (knowledge/build_recipe_tree.py) - Crafting chain training: "give me everything to make X from scratch" - Smelting recipes, version awareness examples Training data (644 examples total): - 107 command syntax reference examples (every command + common errors) - 176 recipe/crafting chain examples (63 crafting, 103 material-giving, 11 smelting) - 344 Claude-distilled examples (222 sudo + 122 god via Haiku) - Live bot audit data ingested (128 examples from dev server) Swarm bots: - Swimming/water escape logic - Door opening - Context-aware prayers (inventory, health, time, depth) - Prefix enforcement on all Gemini/Dolphin prompts GitHub log scraper (data/scrape_server_logs.py): - Searches GitHub for Minecraft server logs with commands - Strict 1.20.5+ version filter - Extracts command pairs, converts to training format Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 20:33:09 -04:00
Seth	65ee146043	Swarm bots, RCON validation, Haiku distillation complete Swarm bots (ingame/swarm_bots.js): - 10 survival bots with generated names (SwiftWolf, DarkWolf, etc.) - All bots wander, take damage, auto-respawn, pray when hurt - Gemini + Dolphin(5%) + Multilingual(3%) prompt generation - 20-60s interaction interval per bot Distillation results: - 222 sudo examples via Haiku ($0.28) - 122 god examples via Haiku ($0.37) — with God Soul personality - Total: 344 distilled, $0.65 spent of $5 budget - RCON validation: 74.7% fully valid, 30 real errors out of ~1000 commands validate_distilled.py: - Executes distilled commands on live server via RCON - Distinguishes real errors from benign (no player online) - Tags each example with validation status Dev server switched to Claude Haiku via Anthropic API: - llm_provider: anthropic with $5 budget cap - Auto-fallback to Ollama when budget exhausted - Cost tracking with logging Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 19:18:19 -04:00
Seth	62419976e5	361 training examples, default to 1 epoch Ingested 128 new examples from bot-driven data collection. Dropped: 86 duplicates, 19 language mismatches, 10 prompt leaks, 19 empty. Changed default epochs from 3 to 1 (previous run overfit at loss 0.10). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 18:03:33 -04:00
Seth	13debc8a59	Add audit log ingestion pipeline with language/leak filtering data/ingest_audit.py: - Pulls training audit logs from CT 644 (dev + prod) - Filters: language mismatch (Chinese output for English input), system prompt leaks, empty responses, duplicates - Keeps multilingual examples where input/output languages match - Converts to dataset schema, appends to seed_dataset.jsonl - --dry-run to preview, --source dev/prod/both Tested: 237 entries → 112 kept (16 lang mismatch, 10 prompt leak, 86 dupe, 13 empty dropped) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 17:58:52 -04:00
Seth	78031d16c0	Risk gradient (0-5), updated system prompts, 233 examples Risk gradient system: - All 233 training examples tagged with risk_level (0-5) - 0=blocked(15), 1=refuse(9), 2=warn(17), 3=normal(169), 4=generous(23) - Schema updated with risk_level and scoring_mode fields - Eval harness uses risk_level for safety scoring System prompts rewritten: - Shared syntax rules and risk gradient reference across all modes - Sudo: permission level 4, do what admin asks, only refuse level 0-1 - God: permission level 2-4 (mood-dependent), character-driven decisions - God_system: permission level 3, 80% benevolent / 15% mischievous / 5% wrathful Data: - 20 new live playtest examples from training audit log (233 total) - 43 wrong→right pairs (17 from validator repairs) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 16:14:54 -04:00
Seth	9d789d2524	Three-tier constraint model, mode-aware eval, boundary examples, playtest tooling Eval harness: - Mode-aware scoring: sudo=strict (exact match), pray/god=soft (category match, in-character, appropriate intensity) - New metrics: cmd_category_match, appropriate_intensity, scoring_mode breakdown - Eval defaults to steel141 (192.168.0.141) — prod GPU reserved for serving Dataset (213 examples): - Added 31 boundary/adversarial examples (safety edges, abstention, near-boundary) - Updated pray example reasoning: character-driven logic, not prescriptive outputs - Tagged pray examples with scoring_mode=soft Playtest tooling: - whitelist.sh: add/remove/list across all 3 servers - FRIENDS_INVITE.md + Discord version: playtester recruitment docs - Server addresses and implementation details for both training servers PLAN.md: - Three-tier constraint model documented (sudo/pray/god_system) - Success criteria split by scoring mode - All session decisions logged Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 15:57:01 -04:00
Seth	38b9a02e45	Phase 2: eval harness, 182 examples, live bake-off, playtest infrastructure - Expanded dataset from 31 to 182 examples (45 manual + 106 extracted from server logs) - Built eval/harness.py with per-category breakdowns and baseline tracking - Built eval/live_bakeoff.py for RCON-verified model comparison on live server - Extracted training data from prayer logs, sudo logs, and bug reports on CT 644 - Added Reddit post draft and modmail for playtester recruitment - Updated server context: all servers now online-mode=false + whitelist - Updated PLAN.md with Phase 2 progress Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 13:38:12 -04:00
Seth	827850b8d7	Initial project scaffold: dataset schema, 31 seed training examples, Mineflayer bot framework, and 7-phase roadmap - IDEA.md: project scope (Minecraft ops AI assistant via qwen3-coder LoRA/SFT) - PLAN.md: complete roadmap with prior art analysis, architecture, phased plan, dev server docs - data/schema.json: training example JSON Schema with negative_output support - data/processed/seed_dataset.jsonl: 31 validated examples from repair code, prayer logs, session history - data/validate_dataset.py: schema validator with summary statistics - ingame/: Mineflayer bot framework (test_connect, spawn_bots, aware_bots with full event logging) - Directory structure for knowledge/, eval/, training/, agent/ (Phase 1.3+ work)	2026-03-18 01:51:28 -04:00

20 Commits