Open main menu


Joined 6 June 2008

I am one of those dreaded "ops devs" doing "devops", whatever that means. But basically I write code. I work primarily on the XML dumps infrastructure. I'm very interested in anything that touches on translation of content or interfaces, and anything that impacts the multilingual reach or the Wikimedia projects or facilitates communication between the various language communities of the projects, but this is out of the scope of my WMF work.

If you want to reach me quickly, look for me on irc in #wikitech-l with the user name atglenn or apergos. Timezone: EET. If you want to reach me the slow way send an email to user name ariel with domain name (Grrr, spammers!)

All things dump-related that I'd love to see move forward:

  • Excerpts of the dumps in various formats from specific projects. Wiktionary is a popular request.
  • Repackaging the dumps as multiple bz2 files contatenated togather with a few pages per file ("multistream bz2").
  • Re-use of the above multi-stream files with some clever scripting for off-line viewing.
  • Process equivalent to the current rsync of our image server, after images move to Swift.
  • Maintained cross-platform easy-to-use tool for converting XML dumps to MySQL for import (eg mwimport).
  • ...? Other cool dump-related ideas?




Committed identity: 87e597e49a6ac3f091f72b7659baf56c16d8db46bcc6680387c90ab4e8fdaf751868517e21b1f34cf2093d1c6e7e14687cb3ecd7e8104b1a5cddeeabf5d6e11a is a SHA-512 commitment to this user's real-life identity.