DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter, DNA methylation typically acts to repress gene transcription. In mammals, DNA methylation is essential for normal development and is associated with a number of key processes including genomic imprinting, X-chromosome inactivation, repression of transposable elements, aging, and carcinogenesis.
As of 2016, two nucleobases have been found on which natural, enzymatic DNA methylation takes place: adenine and cytosine. The modified bases are N6-methyladenine,[1]5-methylcytosine[2] and N4-methylcytosine.[3]
Methylation of cytosine to form 5-methylcytosine occurs at the same 5 position on the pyrimidine ring where the DNA base thymine's methyl group is located; the same position distinguishes thymine from the analogous RNA base uracil, which has no methyl group. Spontaneous deamination of 5-methylcytosine converts it to thymine. This results in a T:G mismatch. Repair mechanisms then correct it back to the original C:G pair; alternatively, they may substitute A for G, turning the original C:G pair into a T:A pair, effectively changing a base and introducing a mutation. This misincorporated base will not be corrected during DNA replication as thymine is a DNA base. If the mismatch is not repaired and the cell enters the cell cycle the strand carrying the T will be complemented by an A in one of the daughter cells, such that the mutation becomes permanent. The near-universal use of thymine exclusively in DNA and uracil exclusively in RNA may have evolved as an error-control mechanism, to facilitate the removal of uracils generated by the spontaneous deamination of cytosine.[12] DNA methylation as well as many of its contemporary DNA methyltransferases have been thought to evolve from early world primitive RNA methylation activity and is supported by several lines of evidence.[13]
In plants and other organisms, DNA methylation is found in three different sequence contexts: CG (or CpG), CHG or CHH (where H correspond to A, T or C). In mammals however, DNA methylation is almost exclusively found in CpG dinucleotides, with the cytosines on both strands being usually methylated. Non-CpG methylation can however be observed in embryonic stem cells,[14][15][16] and has also been indicated in neural development.[17] Furthermore, non-CpG methylation has also been observed in hematopoietic progenitor cells, and it occurred mainly in a CpApC sequence context.[18]
^D. B. Dunn, J. D. Smith: "The occurrence of 6-methylaminopurine in deoxyribonucleic acids". In: Biochem J. 68(4), Apr 1958, S. 627–636. PMID 13522672. PMC1200409.
^B. F. Vanyushin, S. G. Tkacheva, A. N. Belozersky: "Rare bases in animal DNA". In: Nature. 225, 1970, S. 948–949. PMID4391887.
^Melanie Ehrlich, Miguel A. Gama-Sosa, Laura H. Carreira, Lars G. Ljungdahl, Kenneth C. Kuo, Charles W. Gehrke: "DNA methylation in thermophilic bacteria: N6-methylcytosine, 5-methylcytosine, and N6-methyladenine." In: Nucleic Acids Research. 13, 1985, S. 1399. PMID 4000939. PMC341080.
^Hu CW, Chen JL, Hsu YW, Yen CC, Chao MR (January 2015). "Trace analysis of methylated and hydroxymethylated cytosines in DNA by isotope-dilution LC-MS/MS: first evidence of DNA methylation in Caenorhabditis elegans". The Biochemical Journal. 465 (1): 39–47. doi:10.1042/bj20140844. PMID25299492.
^Bird A (December 2001). "Molecular biology. Methylation talk between histones and DNA". Science's Compass. Science. 294 (5549): 2113–2115. doi:10.1126/science.1066726. hdl:1842/464. PMID11739943. S2CID82947750. As a result of this process, known as repeat-induced point mutation (RIP), the wild-type Neurospora genome contains a small fraction of methylated DNA, the majority of the DNA remaining nonmethylated.
^Dodge JE, Ramsahoye BH, Wo ZG, Okano M, Li E (May 2002). "De novo methylation of MMLV provirus in embryonic stem cells: CpG versus non-CpG methylation". Gene. 289 (1–2): 41–48. doi:10.1016/S0378-1119(02)00469-9. PMID12036582.