data: first operator run — 4 issues found, 3 notes, 2 escalations
Noob playtest: 4/7 passed, 3 blocked (perms). Found: context bleed via Mind's Eye, opus slot misconfigured, player_message tag leaking, test players need perms. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -0,0 +1,5 @@
|
||||
# context-bleed
|
||||
|
||||
## 2026-03-28 19:30
|
||||
|
||||
Cross-session context bleed observed: ask mode queried slingshooter08 (online player) instead of TestNoob after pray mode discovered that player. Session IDs are separate (TestNoob:ask vs TestNoob:pray) but Mind's Eye world context injection includes online player names, which the model then targets instead of the requesting player. This could cause confused responses in production.
|
||||
@@ -0,0 +1,5 @@
|
||||
# playtest-permissions
|
||||
|
||||
## 2026-03-28 19:30
|
||||
|
||||
Bot test players (TestNoob, TestBuilder, etc.) need permissions granted for all modes before playtesting. Use /raw to grant: perms_manage grant TestNoob sudo,pray,ask,raw. Without this, 3/7 noob commands are blocked by the permission check.
|
||||
@@ -0,0 +1,5 @@
|
||||
# tag-leaking
|
||||
|
||||
## 2026-03-28 19:30
|
||||
|
||||
`<player_message>` XML tags sometimes leak into response_text for /ask mode responses. The gateway's tag stripping (_parse_player_message) should catch these but isn't in all cases. Observed on ask mode test: "what does redstone do".
|
||||
Reference in New Issue
Block a user