A palindromic sequence is a nucleic acid sequence in a double-stranded DNA or RNA molecule whereby reading in a certain direction (e.g. 5' to 3') on one strand is identical to the sequence in the same direction (e.g. 5' to 3') on the complementary strand. This definition of palindrome thus depends on complementary strands being palindromic of each other.
The meaning of palindrome in the context of genetics is slightly different from the definition used for words and sentences. Since a double helix is formed by two paired antiparallel strands of nucleotides that run in opposite directions, and the nucleotides always pair in the same way (adenine (A) with thymine (T) in DNA or uracil (U) in RNA; cytosine (C) with guanine (G)), a (single-stranded) nucleotide sequence is said to be a palindrome if it is equal to its reverse complement. For example, the DNA sequence ACCTAGGT
is palindromic with its nucleotide-by-nucleotide complement TGGATCCA
because reversing the order of the nucleotides in the complement gives the original sequence.
A palindromic nucleotide sequence is capable of forming a hairpin. The stem portion of the hairpin is a pseudo-double stranded portion since the entire hairpin is a part of same (single) strand of nucleic acid. Palindromic motifs are found in most genomes or sets of genetic instructions. They have been specially researched in bacterial chromosomes and in the so-called Bacterial Interspersed Mosaic Elements (BIMEs) scattered over them. In 2008, a genome sequencing project discovered that large portions of the human X and Y chromosomes are arranged as palindromes.[1] A palindromic structure allows the Y chromosome to repair itself by bending over at the middle if one side is damaged.
Palindromes also appear to be found frequently in the peptide sequences that make up proteins,[2][3] but their role in protein function is not clearly known. It has been suggested that the existence of palindromes in peptides might be related to the prevalence of low-complexity regions in proteins, as palindromes are frequently associated with low-complexity sequences. Their prevalence may also be related to the propensity of such sequences to form alpha helices[4] or protein/protein complexes.[5]