Prediction of protein subcellular localization based on primary sequence data

Date

2003

Authors

Özarar, M.
Atalay, V.
Atalay, R. Ç.

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

Source Title

Lecture Notes in Computer Science

Print ISSN

0302-9743

Electronic ISSN

Publisher

Springer-Verlag Berlin

Volume

2869

Issue

Pages

611 - 618

Language

English

Journal Title

Journal ISSN

Volume Title

Series

Abstract

This paper describes a system called prediction of protein subcellular localization (P2SL) that predicts the subcellular localization of proteins in eukaryotic organisms based on the amino acid content of primary sequences using amino acid order. Our approach for prediction is to find the most frequent motifs for each protein (class) based on clustering and then to use these most frequent motifs as features for classification. This approach allows a classification independent of the length of the sequence. Another important property of the approach is to provide a means to perform reverse analysis and analysis to extract rules. In addition to these and more importantly, we describe the use of a new encoding scheme for the amino acids that conserves biological function based on point of accepted mutations (PAM) substitution matrix. We present preliminary results of our system on a two class (dichotomy) classifier. However, it can be extended to multiple classes with some modifications. © Springer-Verlag Berlin Heidelberg 2003.

Course

Other identifiers

Book Title

Keywords

Citation

item.page.isversionof