Here we describe the Broad GDAC standardized data runs, which aim to produce version-stamped packages representing a frozen snapshot of all TCGA analysis data at a given time
- Cast in a form amenable to immediate algorithmic analysis (no additional data preparation required).
- Which minimizes redundant effort across centers & groups to download & prepare data for further analysis.
- While enhancing provenance and reproducibility.
The standardized data packages may be accessed as described here. More background information on this effort is available in this presentation from the April 2011 TCGA meeting, which was refined in subsequent presentations on May 12th and May 19th, as well as the 2011 NCI TSM meeting and ongoing discussions with TCGA collaborators.