By surveying all validated SNPs in the human genome we have found that SNPs positioned 1, 2, 4, 6 or 8 bp apart are more frequent than SNPs 3, 5, 7 or 9 bp apart. This holds even when we correct for nucleotide frequencies and site dependencies in nucleotide usage in the genome. The observed pattern is not restricted to any of the genomic regions that might give sequencing or alignment errors; i.e. transposable elements (SINE, LINE and LTR), tandem repeats and large duplicated regions. However we can define periodic DNA, which virtually capture the entire pattern. Periodic DNA is defined as small DNA sequences (16.9 bp average length) with a high degree of periodicity in nucleotide usage. Periodic DNA is widely distributed in the genome, underrepresented in... |