An integrative framework for clinical diagnosis and knowledge discovery from exome sequencing data

buir.contributor.authorAlkan, Can
buir.contributor.orcidAlkan, Can|0000-0002-5443-0706
dc.citation.epage107810-8en_US
dc.citation.spage107810-1
dc.citation.volumeNumber169
dc.contributor.authorShojaei, Mona
dc.contributor.authorMohammadvand, Navid
dc.contributor.authorDogan, Tunca
dc.contributor.authorAlkan, Can
dc.contributor.authorAtalay, Rengül Çetin
dc.contributor.authorAcar, Aybar C.
dc.date.accessioned2024-03-15T11:45:34Z
dc.date.available2024-03-15T11:45:34Z
dc.date.issued2024-02
dc.departmentDepartment of Computer Engineering
dc.description.abstractNon-silent single nucleotide genetic variants, like nonsense changes and insertion-deletion variants, that affect protein function and length substantially are prevalent and are frequently misclassified. The low sensitivity and specificity of existing variant effect predictors for nonsense and indel variations restrict their use in clinical applications. We propose the Pathogenic Mutation Prediction (PMPred) method to predict the pathogenicity of single nucleotide variations, which impair protein function by prematurely terminating a protein's elongation during its synthesis. The prediction starts by monitoring functional effects (Gene Ontology annotation changes) of the change in sequence, using an existing ensemble machine learning model (UniGOPred). This, in turn, reveals the mutations that significantly deviate functionally from the wild-type sequence. We have identified novel harmful mutations in patient data and present them as motivating case studies. We also show that our method has increased sensitivity and specificity compared to state-of-the-art, especially in single nucleotide variations that produce large functional changes in the final protein. As further validation, we have done a comparative docking study on such a variation that is misclassified by existing methods and, using the altered binding affinities, show how PMPred can correctly predict the pathogenicity when other tools miss it. PMPred is freely accessible as a web service at https://pmpred.kansil.org/, and the related code is available at https://github.com/kansil/PMPred
dc.description.provenanceMade available in DSpace on 2024-03-15T11:45:34Z (GMT). No. of bitstreams: 1 An_integrative_framework_for_clinical_diagnosis_and_knowledge_discovery_from_exome_sequencing_data.pdf: 2466682 bytes, checksum: be419b1e6bd5e873f8f1c60d36d3e737 (MD5) Previous issue date: 2024-02en
dc.identifier.doi10.1016/j.compbiomed.2023.107810
dc.identifier.eissn1879-0534
dc.identifier.issn0010-4825
dc.identifier.urihttps://hdl.handle.net/11693/114802
dc.language.isoen
dc.relation.isversionofhttps://dx.doi.org/10.1016/j.compbiomed.2023.107810
dc.rightsCC BY 4.0 DEED (Attribution 4.0 International)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.source.titleComputers in Biology and Medicine
dc.subjectVariant pathogenicity prediction
dc.subjectGene ontology
dc.subjectPearson correlation
dc.subjectProtein sequence
dc.subjectMutation
dc.subjectInsertion-deletion variants
dc.titleAn integrative framework for clinical diagnosis and knowledge discovery from exome sequencing data
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
An_integrative_framework_for_clinical_diagnosis_and_knowledge_discovery_from_exome_sequencing_data.pdf
Size:
2.35 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.01 KB
Format:
Item-specific license agreed upon to submission
Description: