Wikimedia Release Engineering Team/Checkin archive/20180122
Pad for the Release Engineering Team weekly meeting Archive at: https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Checkin_archive
2018-01-22
editVacations/Important dates
edit- Feb 2nd: Mukunda vacation day
- Feb 9th: thcipriani half-day
- Feb 19 (Mon): President's Day (All US Staff)
- Mar 30 (Fri): WMF Holiday
- April 14 (Fri): WMF Holiday
- May 15?/16/17: Team offsite in Barcelona
- May 18-20: Wikimedia Hackathon in Barcelona
- May 21 (Mon): Tech-Mgt F2F
Team Business
editRotating positions and absences
editManiphest query for deployment blocker tasks: https://phabricator.wikimedia.org/maniphest/?project=PHID-PROJ-fmcvjrkfvvzz3gxavs3a&statuses=open%28%29&group=none&order=newest#R
Jan 15 and Jan 22
edit- Train: Tyler
- SoS: Mukunda
- Out
- Jan 15 (Mon): Martin Luther King Day (All US Staff)
- Jan 22/23: Dev Summit
- Jan 24: Tech Management F2F
- Jan 25/26: WMF All Hands
Jan 29 and Feb 05
edit
Feb 12 and Feb 19
editActions from last meeting
editScrum of Scrums
edit- Greg to copy to etherpad after meeting: https://etherpad.wikimedia.org/p/Scrum-of-Scrums
This week
editRelease Engineering
edit- Blocking
- None?
- Blocked
- Ops: "Stack overflow when Redis is down" - https://phabricator.wikimedia.org/T185055
- Would like some guidance from Ops on the documentation bits Gergo identifies
- Ops: "Stack overflow when Redis is down" - https://phabricator.wikimedia.org/T185055
- Updates
Last week
edit- Blocking
- None?
- Blocked
- "Stack overflow when Redis is down" - https://phabricator.wikimedia.org/T185055
- Need help from Operations and/or Performance
- "Stack overflow when Redis is down" - https://phabricator.wikimedia.org/T185055
- Updates
- Catching up the train this week and rolling out the last version before DevSummit/All Hands and RelEng team offsite weeks. [wiki[email]]
- We moved Wednesday morning’s SWAT window 1 hour earlier (to 10am) to give us an hour break before the new MW version rolls to second set of wikis (all non-wikipedias) which was a follow-up from a recent post-mortem. [wiki][email]
- We broke git-fat deploy repos in scap (old config no longer valid), workaround/fix available in all relevant repos.
- https://phabricator.wikimedia.org/T184882#3899710
- (Yes, we’re re-doing how the CI for scap is done, see: https://phabricator.wikimedia.org/T184628 )
- Updated the Debian packaging for Zuul (CI task scheduler) and released 2.5.0-8-gcbc7f62-wmf6, unblocking an upgrade of Gerrit.
- Converted our home-grown docker image builder to `docker-pkg` from Giuseppe
- Getting started with the basics of planning our team offsite pre Barcelona Hackathon. Submitt
Puppet SWAT
edit- list of patches you want to submit to Puppet SWAT
Logspam \ Last week's train updates
edit
Other Team Business
edit- Offsite schedule:
Q3 goal/project check-in
edit- All of it in table form: https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Goals/201718Q3
Quarterly Goals
editProgram 1: Outcome 5: Milestone 1: Develop and migrate to a JavaScript-based browser testing stack
edit- Due: End of this quarter
- What: Specific improvements to the now canonical framework, see: task T182421, notably:
- Upgrade webdriverIO to version 4.9
- Investigate replacing nodemw with mwbot
- Video recording for Selenium tests in Node.js
- Task: task T182421
- T175179 Create selenium-CirrusSearch-jessie daily Jenkins job
- Done.
Program 1: Outcome 5: Objective 1: Maintain existing shared Continuous Integration infrastructure
edit- Goals
- Draft requirements for a Kubernetes based solution for CI - task T183513
- Migrate MediaWiki PHPUnit tests to Shipyard (docker-based CI) (~40% of Nodepool usage) - task T183512
- Unify production and CI docker image build process - task T177276
- Done 01/15
- Lack of progress due to Antoine oddities in personal schedule. Requires MySQL backend which is not so trivial to handle.
Program 3: Outcome 1: Objective 2: Identify and find stewards for high-priority/high use code segment orphans
edit- Due: End of quarter
- task T174091
- Meeting with Toby and Victoria scheduled from Feb1
Program 3: Outcome 2: Objective 2: Define and implement a process to regularly address technical debt across the Foundation
edit- Due: End of quarter
- task T174095
- started investigating approaches to syncing up the various tech debt activities across WMF.
Program 3: Outcome 2: Objective 3: Promote and surface important technical debt topics at large gatherings of Wikimedia developers (e.g., DevSummit and Hackathon(s))
edit- Due: End of next quarter
- task T174096
- no progress.
Program 6: Outcome 2: Objective 2: Set up a continuous integration and deployment pipeline
edit- Due: End of this quarter
- Keyword: SSD
- phab project: https://phabricator.wikimedia.org/project/view/2453/
- Goal:
- Verify basic functionality of 'production' deployment and image (initially targeting mathoid):
- Functional PoC within integration in the deployment-pipeline
- Deploy to isolated k8s
- Verify basic functionality of 'production' deployment and image (initially targeting mathoid):
- Helm test is working with service-checker
- Minikube packaging still in process: https://github.com/thcipriani/releng-debs-minikube-stuff
Quaterly non-goal "Work"
editProgram 1: Outcome 1: Objective 1: Scap (Tech Debt Sprint FY201718-Q2)
editProgram 1: Outcome 5: Objective 1: Maintain existing shared Continuous Integration infrastructure
editProgram 1: Outcome 6: Milestone 1: Maintain Gerrit
editProgram 1: Outcome 6: Milestone 2: Maintain Phabricator
edit- Streamline logspam workflows by adding some integration with phabricator
- Store git-lfs (and other phab uploads) in swift: task T182085
- Phabricator upgrade happened without complications. Security bug fixed:
- We now have translatewiki translations in production Phabricator and a currently manual process for updating the strings:
Program 1: Outcome 5: Objective 1: MW Nightlies server
editOther work
edit*Legacy Encoding Train Blocker port mortem meeting scheduled. *Three stewardship review items have been submitted. Greg and I to review tomorrow.
Grooming
editTeam Kanban Board Review and Triage
edit- closed and touched in the 7 days
- No update for 4 weeks
- No update for 3 weeks
- No update for 2 weeks
- No update for 1 week
- All Open
- Review To Triage column of #releng
Once / month-ish review of backlog(s)
edit- releng Review To Triage column of #releng
- releng-kanban Review unassigned in kanban
- releng-kanban Review 'backlog' colum of -kanban
- releng-next - Review for things we need to put on our kanban backlog
- releng-backlog - oh my, the huge backlog of things...