mock: attempt realistic state transitions #860

hawkw · 2025-02-18T19:49:36Z

Presently, when the mock propolis-server receives a put-state request
for Stopped, it sets self.state to Stopped and shuts down the mock
serial console, here:

propolis/bin/mock-server/src/lib/lib.rs

Lines 107 to 111 in c849bab

    
           api::InstanceStateRequested::Stop => { 
        
               self.state = api::InstanceState::Stopped; 
        
               self.serial_task.shutdown().await; 
        
               Ok(()) 
        
           }

However, it never actually sets the state_watcher state, which is
what's used by the instance_state_monitor endpoint, to Stopping,
the way we do for other state transitions:

propolis/bin/mock-server/src/lib/lib.rs

Lines 92 to 106 in c849bab

    
           api::InstanceStateRequested::Run 
        
           | api::InstanceStateRequested::Reboot => { 
        
               self.generation += 1; 
        
               self.state = api::InstanceState::Running; 
        
               self.state_watcher_tx 
        
                   .send(api::InstanceStateMonitorResponse { 
        
                       gen: self.generation, 
        
                       state: self.state, 
        
                       migration: api::InstanceMigrateStatusResponse { 
        
                           migration_in: None, 
        
                           migration_out: None, 
        
                       }, 
        
                   }) 
        
                   .map_err(|_| Error::TransitionSendFail) 
        
           }

If we want to actually use the mock server to test sled-agent behavior
around instance stop, we need to make it behave realistically here.

In real life, stopping an instance will cause it to go through multiple
state transitions: first to Stopping and then to Stopped. This also
Currently, the mock doesn't have a way to cause multiple state transitions
to be observed by the instance_state_monitor client. Therefore, I've
changed the implementation to support this, using a map of states by
generation number. Now, when the state monitor requests the next state
transition from a given generation, we will return the state at
gen + 1 in that map if one exists, or wait until more states are
added to the map. Transitions that cause the instance to go through
multiple states will now add all of those states to the queue of states
to simulate.

The state used by instance_get and for determining what state
transitions are updated is now represented by a variable tracking the
current state generation. This is updated only once we expose a new
state to the instance_state_monitor client, so the understanding of
the instance's state used to determine what requested transitions
are valid is kept in sync with what we've claimed to be from the state
monitor's perspective.

Testing: I've pointed the omicron repo's propolis-mock-server dep
at commit 28d81cb and run
cargo nextest run -p omicron-sled-agent.¹ All the tests still pass.

Fixes #857

I believe only the sled-agent test suite uses
propolis-mock-server? ↩

hawkw added 4 commits February 18, 2025 11:40

mock: attempt realistic state transitions

332ec46

only allow Rebooting in Running

33cdcdf

rustfmt/line wrap

739d45c

don't change the advertised state until observed

28d81cb

hawkw requested a review from gjcolombo February 19, 2025 21:44

hawkw marked this pull request as ready for review February 19, 2025 21:44

gjcolombo approved these changes Feb 19, 2025

View reviewed changes

hawkw merged commit 98d0823 into master Feb 20, 2025
11 checks passed

hawkw deleted the eliza/more-realistic-mock branch February 20, 2025 18:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mock: attempt realistic state transitions #860

mock: attempt realistic state transitions #860

hawkw commented Feb 18, 2025 •

edited

Loading

	api::InstanceStateRequested::Stop => {
	self.state = api::InstanceState::Stopped;
	self.serial_task.shutdown().await;
	Ok(())
	}

	api::InstanceStateRequested::Run
	\| api::InstanceStateRequested::Reboot => {
	self.generation += 1;
	self.state = api::InstanceState::Running;
	self.state_watcher_tx
	.send(api::InstanceStateMonitorResponse {
	gen: self.generation,
	state: self.state,
	migration: api::InstanceMigrateStatusResponse {
	migration_in: None,
	migration_out: None,
	},
	})
	.map_err(\|_\| Error::TransitionSendFail)
	}

mock: attempt realistic state transitions #860

mock: attempt realistic state transitions #860

Conversation

hawkw commented Feb 18, 2025 • edited Loading

Footnotes

hawkw commented Feb 18, 2025 •

edited

Loading