Topic on Talk:Wikimedia Release Engineering Team/MW-in-Containers thoughts

"On some trigger, the new pod is added into the production pool and slowly scaled out to answer user requests until it is the only pod running or is removed"

5 comments • 15:03, 12 June 2020 4 years ago

5

Scaling out happens in kubernetes by increasing the number of pods so this sentence needs some rewording. Deployments just spawn up a batch of new k8s pods (25% by default, configurable though) and kill the same amount of pods from the previous deploy. Rinse and repeat in an A/B fashion until the entirety of the pods have been replaced.

Reply 15:53, 10 June 2020 4 years ago

Jdforrester (WMF) (talkcontribs)

Ah, yes, will re-word.

Reply 07:01, 11 June 2020 4 years ago

Jdforrester (WMF) (talkcontribs)

Is this change sufficient?

Reply 10:30, 11 June 2020 4 years ago

AKosiaris (WMF) (talkcontribs)

Yes, I think so. I am still a bit unclear on the ** How does we tell the controller (?) to know to which deployment state to route a given request? part, but judging from the "controller" having a question we probably want to define that first

Reply 14:04, 11 June 2020 4 years ago

Jdforrester (WMF) (talkcontribs)

That's wrapped up in the decision about how we want the A/B split of traffic to the new deploy – do we just do it uniformly randomly across all request types (standard k8s behaviour), or do we want do something closer to what we do now (roll out by request type sharded by target wiki), or something else. I've left that open as I don't think it's been discussed.

Reply 15:03, 12 June 2020 4 years ago

Reply to ""On some trigger, the new pod is added into the production pool and slowly scaled out to answer user requests until it is the only pod running or is removed""