Pathway analysis

Pathway resources and types of pathway analysis using databases like KEGG, Reactome and WikiPathways.[1]

Pathway is the term from molecular biology for a curated schematic representation of a well characterized segment of the molecular physiological machinery, such as a metabolic pathway describing an enzymatic process within a cell or tissue or a signaling pathway model representing a regulatory process that might, in its turn, enable a metabolic or another regulatory process downstream. A typical pathway model starts with an extracellular signaling molecule that activates a specific receptor, thus triggering a chain of molecular interactions.[2] A pathway is most often represented as a relatively small graph with gene, protein, and/or small molecule nodes connected by edges of known functional relations. While a simpler pathway might appear as a chain,[3] complex pathway topologies with loops and alternative routes are much more common. Computational analyses employ special formats of pathway representation.[4][5] In the simplest form, however, a pathway might be represented as a list of member molecules with order and relations unspecified. Such a representation, generally called Functional Gene Set (FGS), can also refer to other functionally characterised groups such as protein families, Gene Ontology (GO) and Disease Ontology (DO) terms etc. In bioinformatics, methods of pathway analysis might be used to identify key genes/ proteins within a previously known pathway in relation to a particular experiment / pathological condition or building a pathway de novo from proteins that have been identified as key affected elements. By examining changes in e.g. gene expression in a pathway, its biological activity can be explored. However most frequently, pathway analysis refers to a method of initial characterization and interpretation of an experimental (or pathological) condition that was studied with omics tools or genome-wide association study.[6] Such studies might identify long lists of altered genes. A visual inspection is then challenging and the information is hard to summarize, since the altered genes map to a broad range of pathways, processes, and molecular functions (with a large gene fraction lacking any annotation). In such situations, the most productive way of exploring the list is to identify enrichment of specific FGSs in it. The general approach of enrichment analyses is to identify FGSs, members of which were most frequently or most strongly altered in the given condition, in comparison to a gene set sampled by chance. In other words, enrichment can map canonical prior knowledge structured in the form of FGSs to the condition represented by altered genes.

  1. ^ Mubeen S, Hoyt CT, Gemünd A, Hofmann-Apitius M, Fröhlich H, Domingo-Fernández D (2019). "The Impact of Pathway Database Choice on Statistical Enrichment Analysis and Predictive Modeling". Frontiers in Genetics. 10: 1203. doi:10.3389/fgene.2019.01203. PMC 6883970. PMID 31824580.
  2. ^ Berg JM, Tymoczko JL, Stryer L (2002). Biochemistry (5th ed.). New York: W.H. Freeman. ISBN 978-0-7167-3051-4.
  3. ^ Ohlrogge J, Browse J (July 1995). "Lipid biosynthesis". The Plant Cell. 7 (7): 957–70. doi:10.1105/tpc.7.7.957. PMC 160893. PMID 7640528. S2CID 219201001.
  4. ^ "Main Page - SBML.caltech.edu". sbml.org.
  5. ^ "KGML (KEGG Markup Language)". www.genome.jp.
  6. ^ García-Campos MA, Espinal-Enríquez J, Hernández-Lemus E (2015). "Pathway Analysis: State of the Art". Frontiers in Physiology. 6: 383. doi:10.3389/fphys.2015.00383. PMC 4681784. PMID 26733877.