Tidy replacement: With latest HTML5Depurate and visual diff test run, ~93.4% of test pages render with pixel perfect accuracy. Working through remaining diff scenarios (a lot of the diffs are harmless minor pixel shifts) to identify real issues and having wikitext be fixed or fix core parser as required.
Tim working on a HTML5 parser to fix some core parser issues around doBlockLevels.
Will attempt switching Parsoid to service-runner sometime next week (Arlo & Services)
Arlo close to getting html2html endpoint done (last remaining blocker to enable Parsoid new HTML version with separate data-mw)
VE & CX: Please start planning how to handle this new version.
Kunal working on using the new Linker code for files/images.
Loaded 3 months of pageview data into Druid and querying it is very fast
Cleaning up limn-flow-data, limn-edit-data, and limn-languge-data a little bit, deploying today
Working on processing data from the mediawiki databases and turning it into an analytics-friendly schema (one example is slowly changing dimensions such as page_title recorded as (page_id, page_title, valid_from, valid_to))
pageview requests that miss the cache are rate-limited to 10 req/s. More req/s than that will throw 429