A critical challenge in FlashAdventure is the observation-behavior gap, which refers to the time lag between when an agent observes information and when it can act upon it. Unlike prior benchmarks that focus on short-term objectives or include short story arcs, FlashAdventure emphasizes completion of full story arcs involving long-term objectives.

Adventure games require agents to manage long-term dependencies crucial for solving full story arcs. Tolman's theory on latent learning suggests that humans can retrieve and apply clues after a long delay, which can also be explored in agents to assess whether similar emergent behaviors occur.