In molecular biology, the TATA box (also called the Goldberg–Hogness box)[1] is a sequence of DNA found in the core promoter region of genes in archaea and eukaryotes.[2] The bacterial homolog of the TATA box is called the Pribnow box which has a shorter consensus sequence.
The TATA box is considered a non-coding DNA sequence (also known as a cis-regulatory element). It was termed the "TATA box" as it contains a consensus sequence characterized by repeating T and A base pairs.[3] How the term "box" originated is unclear. In the 1980s, while investigating nucleotide sequences in mouse genome loci, the Hogness box sequence was found and "boxed in" at the -31 position.[4] When consensus nucleotides and alternative ones were compared, homologous regions were "boxed" by the researchers.[4] The boxing in of sequences sheds light on the origin of the term "box".
The TATA box was first identified in 1978[1] as a component of eukaryotic promoters. Transcription is initiated at the TATA box in TATA-containing genes. The TATA box is the binding site of the TATA-binding protein (TBP) and other transcription factors in some eukaryotic genes. Gene transcription by RNA polymerase II depends on the regulation of the core promoter by long-range regulatory elements such as enhancers and silencers.[5] Without proper regulation of transcription, eukaryotic organisms would not be able to properly respond to their environment.
Based on the sequence and mechanism of TATA box initiation, mutations such as insertions, deletions, and point mutations to this consensus sequence can result in phenotypic changes. These phenotypic changes can then turn into a disease phenotype. Some diseases associated with mutations in the TATA box include gastric cancer, spinocerebellar ataxia, Huntington's disease, blindness, β-thalassemia, immunosuppression, Gilbert's syndrome, and HIV-1. The TATA-binding protein (TBP) could also be targeted by viruses as a means of viral transcription.[6]
:4
was invoked but never defined (see the help page).:16
was invoked but never defined (see the help page).