Paper
15 October 2012 Optimization of short amino acid sequences classifier
Aleksy Barcz, Zbigniew Szymański
Author Affiliations +
Proceedings Volume 8454, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2012; 84541U (2012) https://doi.org/10.1117/12.2000243
Event: Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2012, 2012, Wilga, Poland
Abstract
This article describes processing methods used for short amino acid sequences classification. The data processed are 9-symbols string representations of amino acid sequences, divided into 49 data sets - each one containing samples labeled as reacting or not with given enzyme. The goal of the classification is to determine for a single enzyme, whether an amino acid sequence would react with it or not. Each data set is processed separately. Feature selection is performed to reduce the number of dimensions for each data set. The method used for feature selection consists of two phases. During the first phase, significant positions are selected using Classification and Regression Trees. Afterwards, symbols appearing at the selected positions are substituted with numeric values of amino acid properties taken from the AAindex database. In the second phase the new set of features is reduced using a correlation-based ranking formula and Gram-Schmidt orthogonalization. Finally, the preprocessed data is used for training LS-SVM classifiers. SPDE, an evolutionary algorithm, is used to obtain optimal hyperparameters for the LS-SVM classifier, such as error penalty parameter C and kernel-specific hyperparameters. A simple score penalty is used to adapt the SPDE algorithm to the task of selecting classifiers with best performance measures values.
© (2012) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Aleksy Barcz and Zbigniew Szymański "Optimization of short amino acid sequences classifier", Proc. SPIE 8454, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2012, 84541U (15 October 2012); https://doi.org/10.1117/12.2000243
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature selection

Data processing

Evolutionary algorithms

Databases

Detection and tracking algorithms

Phase measurement

Precision measurement

Back to Top