TigerPOMDP · replay
event-log playback ·
Episode:
#2 — clean win
#34 — the growls lied
#51 — long deliberation
Two doors — listen, believe, open
◀ tiger LEFTbelief b = P(tiger-left | growls)tiger RIGHT ▶
Return distribution — partial observability is bimodal
treasure +10
tiger −100
growl / belief left
growl / belief right
space play · ←→ step