Wikimedia Release Engineering Team/DataDataData Sync Up/2019-05-21

2019-05-21 edit

Phab task edit

Last time edit

  • Previous meeting was long ago...

Today's Agenda edit

    • Vacation and not much movement
    • Review requirements
    • Go over email draft

What data we have currently or are planning to collect edit

  • Schema
  • Data samples

How we might want to query that data edit

  • Our data is highly structured (see schemas)
    • Is Hadoop or ES more appropriate for that? Would we lose structure by putting it in Hadoop?
    • How much do we have to know about how data's structure before we put it in ES?
      • Can relationships/schema be changed after data is stored?

TODOs (by next meeting) edit

  • Dan to send email to Analytics and set up meeting