Algorithms for effective querying of compound graph-based pathway databases

Source Title
BMC Bioinformatics
Print ISSN
Electronic ISSN
BioMed Central Ltd.
1 - 16
Journal Title
Journal ISSN
Volume Title

Background: Graph-based pathway ontologies and databases are widely used to represent data about cellular processes. This representation makes it possible to programmatically integrate cellular networks and to investigate them using the well-understood concepts of graph theory in order to predict their structural and dynamic properties. An extension of this graph representation, namely hierarchically structured or compound graphs, in which a member of a biological network may recursively contain a sub-network of a somehow logically similar group of biological objects, provides many additional benefits for analysis of biological pathways, including reduction of complexity by decomposition into distinct components or modules. In this regard, it is essential to effectively query such integrated large compound networks to extract the sub-networks of interest with the help of efficient algorithms and software tools. Results: Towards this goal, we developed a querying framework, along with a number of graph-theoretic algorithms from simple neighborhood queries to shortest paths to feedback loops, that is applicable to all sorts of graph-based pathway databases, from PPIs (protein-protein interactions) to metabolic and signaling pathways. The framework is unique in that it can account for compound or nested structures and ubiquitous entities present in the pathway data. In addition, the queries may be related to each other through "AND" and "OR" operators, and can be recursively organized into a tree, in which the result of one query might be a source and/or target for another, to form more complex queries. The algorithms were implemented within the querying component of a new version of the software tool PATIKAweb (Pathway Analysis Tool for Integration and Knowledge Acquisition) and have proven useful for answering a number of biologically significant questions for large graph-based pathway databases. Conclusion: The PATIKA Project Web site is PATIKAweb version 2.1 is available at © 2009 Dogrusoz et al; licensee BioMed Central Ltd.

Other identifiers
Book Title
Biological networks, Biological objects, Biological pathways, Graph representation, Neighborhood queries, Protein-protein interactions, Signaling pathways, Structural and dynamic properties, Complex networks, Computer software, Database systems, Graph theory, Graphic methods, Proteins, Bioinformatics, Computer interface, Computer program, Conceptual framework, Information processing, Protein database, Biology, Computer graphics, Factual database, Methodology, Protein analysis, Algorithms, Computational biology, Protein interaction mapping, Signal transduction, Software
Published Version (Please cite this version)