Thermostat
A continuous-state HVAC MDP — real-valued temperature, threshold control, noisy dynamics.
A continuous-state Markov decision process for HVAC control. A room's temperature is a floating-point measure, not a discrete state. Each cycle the controller heats if the temperature is at or below the threshold and coasts otherwise; the heater adds a normally-distributed gain and the weather subtracts a normally-distributed loss, so the trajectory drifts continuously. Comfort and energy are tracked as separate reward channels.
Shows the engine handling continuous state and distribution-valued updates, not just discrete transitions. The two independent reward channels let you study the comfort-versus-energy trade-off offline, directly from the exported room rows.
Linked tables with guaranteed referential integrity.
Generated REST endpoints. Also exposed as MCP tools.
OSI-compatible definition, emitted with the dataset.
# thermostat.osi.yaml — emitted automatically semantic_model: name: "thermostat" source: "duckdb://thermostat.db" entities: - name: room primary_key: id dimensions: - name: state type: categorical - name: t type: time measures: - name: row_count agg: count - name: active agg: sum filter: "state = 'ACTIVE'"
More worlds.
Game of Life
Conway's automaton as a perfectly observable, deterministic grid world.
London Underground
A live tube graph — eleven lines, hundreds of trains, platforms held as a mutex.
Pac-Man
A self-playing arcade game — ghosts chase a flood-filled distance field.