A novel model-based method for feature extraction from protein sequences for classification

Date
2006
Editor(s)
Advisor
Supervisor
Co-Advisor
Co-Supervisor
Instructor
Source Title
Proceedings of the IEEE 14th Signal Processing and Communications Applications Conference, SIU 2006
Print ISSN
Electronic ISSN
Publisher
IEEE
Volume
2006
Issue
Pages
Language
Turkish
Journal Title
Journal ISSN
Volume Title
Series
Abstract

Representation of amino-acid sequences constitutes the key point in classification of proteins into functional or structural classes. The representation should contain the biologically meaningful information hidden in the primary sequence of the protein. Conserved or similar subsequences are strong indicators of functional and structural similarity. In this study we present a feature mapping that takes into account the models of the subsequences of protein sequences. An expectation-maximization algorithm along with an HMM mixture model is used to cluster and learn the models of subsequences of a given set of proteins.

Course
Other identifiers
Book Title
Citation
Published Version (Please cite this version)