Wikimedia Release Engineering Team/Checkin archive/20160725
2016-07-25
editVacations/Important dates
editHow to do it: https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Time_off
- July 25 - August 15: Željko vacation. Will have laptop with me. Reachable via phone.
- July 30 - August 21: Antoine vacation. At home 1st week.
- August 1st - 5th: Mukunda - vacation: Concert & relaxation
...
- January 9-11: Dev Summit
- January 12-13: All Hands
Team Business
editRotating positions and absences
editManiphest query for deployment blocker tasks: https://phabricator.wikimedia.org/u/blockers
weeks of July 25 and Aug 1
edit- Train: Tyler
- SoS: Mukunda / Tyler
- Out:
- Zeljko: July 25 - Aug 15
- Antoine: July 30 - Aug 21
- Mukunda: Aug 1-5
weeks of Aug 8 and Aug 15
edit- Train: Mukunda
- SoS: Chad
- Out:
- Zeljko: July 25 - Aug 15
- Antoine: July 30 - Aug 21
Time spent spreadsheet
edit
Actions from last meeting
edit- TODO: file task re upgrading MW-Vagrant guests to Jessie
- done by bryan :)
- https://phabricator.wikimedia.org/T136429
Scrum of Scrums
edit- https://phabricator.wikimedia.org/project/board/64/
- Blocked on us: https://phabricator.wikimedia.org/maniphest/query/h7YTCBTJsepS/#R
This week
edit- Blocking
- Blocked
- Updates
- Labs update (with CI distruption)
- Proposed 2016-08-02. What time is best?
- Andrew still wondering about https://phabricator.wikimedia.org/T139771
- If everything is fine w/CI response then it's all good :)
- Labs update (with CI distruption)
Last week
editBlocking
- Android to differential
- Blocked
- None
- Updates
- Zuul upgraded this week, should address a bunch of issues
- New SWAT deploy process going ok, reminder to install
https://wikitech.wikimedia.org/wiki/X-Wikimedia-Debug if you're putting things up for SWAT
Other Team Business
editProject tech leads
edit- https://phabricator.wikimedia.org/T139540#2485589
- tl;dr: how about we just add an explicit "Lead" for each quarterly goal?
Offsite
edit- agenda/purpose :)
- https://phabricator.wikimedia.org/T138437
- Will be meeting with Kristen Lans from TPG re potential TPG support in 2 hours :)
- Last offsite writeup with lessons learned: https://docs.google.com/document/d/17C6x_Sys21DcEZ_HxgLA7FkCYeiTUAzZX1XbEcZTNfw/edit#
Q1 goal/project check-in
editPhase out Ubuntu Precise
editReplace primary production Continuous Integration host (gallium
) - task T95757
edit
- NEXT: https://phabricator.wikimedia.org/T139771 - "Identify metric (or metrics) that gives a useful indication of user-perceived (Wikimedia developer) service of CI"
- Tyler and Hashar reply to Faidon's comment, keeping focused on getting off of gallium for now
Upgrade Phabricator database servers to Maria10/Jessie - task T138460
edit- Done
Upgrade Beta Cluster database servers to Maria10/Jessie - task T138778
editwaiting on Jaime to priority- Priority is "this quarter" (not "this month" or "next week")
- up to us to schedule, should be no more than an hour for Jaime
- NEXT: Needs an owner
- DAN!
Reduce Technical Debt
editPerform a technical debt analysis of software and services maintained by WMF Release Engineering - task T138225
- Original mega sheet: https://docs.google.com/spreadsheets/d/1Kxj9p4fKVNo2h23yAQVoOGg77dZ4FLxeXuYrH-1CrPA/edit#gid=0
- Already is tracking specific 'things' which need to be addressed
- Redux: https://docs.google.com/spreadsheets/d/1Ncbgbg-ZPSSScOaGswQSJRtreuJOlizRCFln4KyfMWI/edit#gid=0
- Simply severity+importance.
- Redux Redux: https://docs.google.com/spreadsheets/d/1btVdLuV59GZkQax8Hk0jkWDeyF5O_M5HkylWL0WxHxo/edit#gid=0
- Just severity
- Next steps?
- Fill out Redux Redux
- Based on Redux Redux identify the one thing to focus on
- then plan accordingly in Phabricator
Streamline deployments (long-lived branches)
editkeyresult task:
- Convert our production deployment strategy to use long-lived branches - task T89945
project view: https://phabricator.wikimedia.org/project/view/2117/
Non-Quarterly goal work
editSWAT deploy changes
edit- European SWAT deploys next steps (task T137970
- NEXT: stalled pending finding people to do the SWAT window while Antoine and Zeljko are on vacation
- Week of 20th August lets gogogo
CI Scaling/Nodepool
edit- Wait time for Nodepool instances https://grafana.wikimedia.org/dashboard/db/releng-kpis
- Zuul repackaged with latest upstream. Will upgrade all fleet early this week.
- debian-glue job enhanced
- TODO: Zuul packaging tutorial
- TODO: android job. Move to Jessie, hacked over the week-end has to be polished https://phabricator.wikimedia.org/T139137
- MySQL on CI slaves either shutdown randomly / or does not start on boot :(
Browser tests
editDifferential migration
editDifferential weekly (https://etherpad.wikimedia.org/p/diffuerential-weekly ) TODOs:
- Mukunda had questiosn for antione re puppet (keys into the private store, production or other? for CI image builder)
- Update documentation on creating/renaming of repos in Diffusion
- Update task with discussion about ACLs?
- Announce plan to migrate MW-Vagrant to Differential
- https://phabricator.wikimedia.org/T131419#2439362
- outstanding patches should be either merged, abandoned or migrated to differential revisions.
Beta Cluster
edit- "deployment-fluorine becomes unresponsive frequently" - https://phabricator.wikimedia.org/T140313
- TODO: Submit patch ( https://gerrit.wikimedia.org/r/#/c/299672/ ) for PuppetSWAT?
Other
edit- Figure out how to help Jaime with the DB schema inconsistencies issue:
- https://phabricator.wikimedia.org/T132416 and https://phabricator.wikimedia.org/T104459 (see also: https://www.mediawiki.org/wiki/Development_policy#Database_patches )
- What can we do in CI to help prevent, mostly?
- Chad will lick this cookie :)
Scap querying logstash now for the canaries:
- Email to review: https://etherpad.wikimedia.org/p/scap-announce-2016-07-25
People status updates
editAntoine
editLast week
edit- Gerrit upgrade / Zuul upgrade
- Target host to replace gallium
- Sync up with Tyler for CI / gallium phase out
- Moaar maintenance
- Offsite site/date
This week
edit- Zuul upgrade to latest upstream
- Zuul packaging doc
- Vacations backup plan
Chad
editLast week
edit- Moar Gerrit. Train. Choo choo.
This week
edit- Gerrit. Remove precise remnants from puppet, tune cache stuff, CSS tweaks for crap UI. Triaging old bugs to see which are fixed / invalid / still fixable.
- DB consistencies thingie for Jaime. I owe him one.
Dan
editLast week
edit- Getting back
This week
edit- Start poking at MW-Vagrant jessie base image https://phabricator.wikimedia.org/T136429
- Figure out where we're at with Malu
Mukunda
editLast week
edit- Get the merge-wmf-branch script cleaned up and shared with the team for feedback
- Brainstorm improvements / other ideas around branch merging / cherry-picking
This week
edit- T141278: Decide how ReleaseTaggerBot fits into the brave new world of long-lived-branches https://phabricator.wikimedia.org/T141278
Tyler
editLast week
edit- MW Canary work
This week
editŽeljko
editLast week
edit- trying to do the first SWAT (depending on https://phabricator.wikimedia.org/T140264 MediaWiki deployment shell access request for zfilipin)
- Analyze (and share analysis of) the browser testing feedback survey https://phabricator.wikimedia.org/T139247
- Run language screenshots script for VisualEditor in Jenkins https://phabricator.wikimedia.org/T139613
This week
editVacation