Countless statistical methods have been described for the analysis of DNA microarrays, and each yields distinct results. This raises the question whether DNA microarrays are robust diagnostic tools.

In order to address this issue, we compared five formally similar statistical tests for gene selection on a single data set derived from acute leukemia patients. Inter-test agreement of gene selection, of sample classification and with standard clinical diagnosis was calculated using Cohen's κ-score.

The inter-test agreement scores were 0.15 < κ < 0.68 for gene selection, and 0.60 < κ < 0.89 for sample classification.... |