Child pages
  • Methylation Preprocessor

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.


For a given gene, we select the probe with the maximum standard deviation across all beta values. Then we discard any probes with a standard deviation below a specified cutoff. The default cutoff is .2, but it can be tuned based on the desired output file size.

Correlation with


expression data

We start by filtering the methylation and expression data even further. We make a list of genes and individuals appearing in each file, and keep only the beta values for genes and individuals that appear in both. This leaves us with multiple methylation probes per gene, and a single measured expression level per gene.


- * : calculating the mean signal values among each gene
- * : picking the probe with the min correlation with clinical mRNAseq data for each gene
- * : picking the probe with the min correlation with mRNAseq data for each gene without probeset id which is used for correlation between clinical and methylation.