Wikimedia meetings/Tech/Data availability/2010-10-18

XML Snapshots edit

Attendees: Ariel, Tomasz

Last:

  • Did job descriptions

Today:

Low level data dumps work

  • Need it for resuming current jobs that have been interrupted
  • Gives the ability to cat and rejoin jobs
  • Requirement for parallelization of jobs

Tomasz todo:

  • Check in on mail about parrelization

Open Issues edit

  • Need someone to find out why we can't use pigz
  • Create a central auth dump 25579
  • Oddness on fetching really early revisions