Overlapping gene

An overlapping gene (or OLG)[1][2] is a gene whose expressible nucleotide sequence partially overlaps with the expressible nucleotide sequence of another gene.[3] In this way, a nucleotide sequence may make a contribution to the function of one or more gene products. Overlapping genes are present in and a fundamental feature of both cellular and viral genomes.[2] The current definition of an overlapping gene varies significantly between eukaryotes, prokaryotes, and viruses.[2] In prokaryotes and viruses overlap must be between coding sequences but not mRNA transcripts, and is defined when these coding sequences share a nucleotide on either the same or opposite strands. In eukaryotes, gene overlap is almost always defined as mRNA transcript overlap. Specifically, a gene overlap in eukaryotes is defined when at least one nucleotide is shared between the boundaries of the primary mRNA transcripts of two or more genes, such that a DNA base mutation at any point of the overlapping region would affect the transcripts of all genes involved. This definition includes 5′ and 3′ untranslated regions (UTRs) along with introns.

Overprinting refers to a type of overlap in which all or part of the sequence of one gene is read in an alternate reading frame from another gene at the same locus.[4] The alternative open reading frames (ORF) are thought to be created by critical nucleotide substitutions within an expressible pre-existing gene, which can be induced to express a novel protein while still preserving the function of the original gene.[5] Overprinting has been hypothesized as a mechanism for de novo emergence of new genes from existing sequences, either older genes or previously non-coding regions of the genome.[6] It is believed that most overlapping genes, or genes whose expressible nucleotide sequences partially overlap with each other, evolved in part due to this mechanism, suggesting that each overlap is composed of one ancestral gene and one novel gene.[7] Subsequently, overprinting is also believed to be a source of novel proteins, as de novo proteins coded by these novel genes usually lack remote homologs in databases.[8] Overprinted genes are particularly common features of the genomic organization of viruses, likely to greatly increase the number of potential expressible genes from a small set of viral genetic information.[9] It is likely that overprinting is responsible for the generation of numerous novel proteins by viruses over the course of their evolutionary history.

  1. ^ Nelson, Chase W, et al. (1 October 2020). "Dynamically evolving novel overlapping gene as a factor in the SARS-CoV-2 pandemic". eLife. 9. doi:10.7554/eLife.59633. PMC 7655111. PMID 33001029.
  2. ^ a b c Wright BW, Molloy MP, Jaschke PR (5 October 2021). "Overlapping genes in natural and engineered genomes". Nature Reviews Genetics. 23 (3): 154–168. doi:10.1038/s41576-021-00417-w. ISSN 1471-0064. PMC 8490965. PMID 34611352.
  3. ^ Y. Fukuda, M. Tomita et T. Washio (1999). "Comparative study of overlapping genes in the genomes of Mycoplasma genitalium and Mycoplasma pneumoniae". Nucleic Acids Res. 27 (8): 1847–1853. doi:10.1093/nar/27.8.1847. PMC 148392. PMID 10101192.
  4. ^ Pavesi A (26 May 2021). "Origin, Evolution and Stability of Overlapping Genes in Viruses: A Systematic Review". Genes. 12 (6): 809. doi:10.3390/genes12060809. ISSN 2073-4425. PMC 8227390. PMID 34073395.
  5. ^ Normark S, Bergström S, Edlund T, Grundström T, Jaurin B, Lindberg FP, Olsson O (December 1983). "Overlapping Genes". Annual Review of Genetics. 17 (1): 499–525. doi:10.1146/annurev.ge.17.120183.002435. ISSN 0066-4197. PMID 6198955.
  6. ^ Cite error: The named reference keese_1992 was invoked but never defined (see the help page).
  7. ^ Keese PK, Gibbs A (15 October 1992). "Origins of genes: "big bang" or continuous creation?". Proceedings of the National Academy of Sciences. 89 (20): 9489–9493. Bibcode:1992PNAS...89.9489K. doi:10.1073/pnas.89.20.9489. ISSN 0027-8424. PMC 50157. PMID 1329098.
  8. ^ Gibbs A, Keese PK (19 October 1995), "In search of the origins of viral genes", Molecular Basis of Virus Evolution, Cambridge University Press, pp. 76–90, doi:10.1017/cbo9780511661686.008, ISBN 978-0-521-45533-6, retrieved 3 December 2021
  9. ^ Pavesi A, Magiorkinis G, Karlin DG, Wilke CO (15 August 2013). "Viral Proteins Originated De Novo by Overprinting Can Be Identified by Codon Usage: Application to the "Gene Nursery" of Deltaretroviruses". PLOS Computational Biology. 9 (8): e1003162. Bibcode:2013PLSCB...9E3162P. doi:10.1371/journal.pcbi.1003162. PMC 3744397. PMID 23966842.