A comparative study on feature selection in regression for predicting the affinity of TAP binding peptides

Xue Ling Li, Shu Lin Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

In this study, we compare six feature selection methods, i.e. five feature selection methods for k Nearest Neighborhood regression (kNNReg) and a rough set model based forward feature selection (FARNeM) for Support Vector Regression (SVR) for predicting the affinity of TAP binding peptides. The peptides were represented with binary, sequence associated amino acid properties, and binary plus properties of amino acids derived vectors, respectively. The weighted peptide features are input to the regression model and ranked according to the corresponding weights or the occurrence frequency, respectively. We find that SVR model performs better than kNNReg model for the prediction of the affinity of TAP transporter binding peptides.

Original languageEnglish (US)
Title of host publicationAdvanced Intelligent Computing Theories and Applications: With Aspects of Artificial Intelligence - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings
Pages69-75
Number of pages7
Volume6216 LNAI
DOIs
StatePublished - 2010
Externally publishedYes
Event6th International Conference on Intelligent Computing, ICIC 2010 - Changsha, China
Duration: Aug 18 2010Aug 21 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6216 LNAI
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other6th International Conference on Intelligent Computing, ICIC 2010
CountryChina
CityChangsha
Period8/18/108/21/10

Fingerprint

Peptides
Feature Selection
Comparative Study
Affine transformation
Feature extraction
Regression
Regression Model
Support Vector Regression
Amino Acids
Amino acids
Binary sequences
Binary Sequences
Rough Set
Model-based
Binary
Prediction

Keywords

  • feature selection
  • k-nearest neighborhood regression
  • neighborhood rough set model
  • peptides
  • support vector regression
  • Transporter associated with antigen processing

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Li, X. L., & Wang, S. L. (2010). A comparative study on feature selection in regression for predicting the affinity of TAP binding peptides. In Advanced Intelligent Computing Theories and Applications: With Aspects of Artificial Intelligence - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings (Vol. 6216 LNAI, pp. 69-75). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6216 LNAI). https://doi.org/10.1007/978-3-642-14932-0_9

A comparative study on feature selection in regression for predicting the affinity of TAP binding peptides. / Li, Xue Ling; Wang, Shu Lin.

Advanced Intelligent Computing Theories and Applications: With Aspects of Artificial Intelligence - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings. Vol. 6216 LNAI 2010. p. 69-75 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6216 LNAI).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Li, XL & Wang, SL 2010, A comparative study on feature selection in regression for predicting the affinity of TAP binding peptides. in Advanced Intelligent Computing Theories and Applications: With Aspects of Artificial Intelligence - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings. vol. 6216 LNAI, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 6216 LNAI, pp. 69-75, 6th International Conference on Intelligent Computing, ICIC 2010, Changsha, China, 8/18/10. https://doi.org/10.1007/978-3-642-14932-0_9
Li XL, Wang SL. A comparative study on feature selection in regression for predicting the affinity of TAP binding peptides. In Advanced Intelligent Computing Theories and Applications: With Aspects of Artificial Intelligence - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings. Vol. 6216 LNAI. 2010. p. 69-75. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-14932-0_9
Li, Xue Ling ; Wang, Shu Lin. / A comparative study on feature selection in regression for predicting the affinity of TAP binding peptides. Advanced Intelligent Computing Theories and Applications: With Aspects of Artificial Intelligence - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings. Vol. 6216 LNAI 2010. pp. 69-75 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{7d175f8a2d60419aba41d714720f8967,
title = "A comparative study on feature selection in regression for predicting the affinity of TAP binding peptides",
abstract = "In this study, we compare six feature selection methods, i.e. five feature selection methods for k Nearest Neighborhood regression (kNNReg) and a rough set model based forward feature selection (FARNeM) for Support Vector Regression (SVR) for predicting the affinity of TAP binding peptides. The peptides were represented with binary, sequence associated amino acid properties, and binary plus properties of amino acids derived vectors, respectively. The weighted peptide features are input to the regression model and ranked according to the corresponding weights or the occurrence frequency, respectively. We find that SVR model performs better than kNNReg model for the prediction of the affinity of TAP transporter binding peptides.",
keywords = "feature selection, k-nearest neighborhood regression, neighborhood rough set model, peptides, support vector regression, Transporter associated with antigen processing",
author = "Li, {Xue Ling} and Wang, {Shu Lin}",
year = "2010",
doi = "10.1007/978-3-642-14932-0_9",
language = "English (US)",
isbn = "3642149316",
volume = "6216 LNAI",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "69--75",
booktitle = "Advanced Intelligent Computing Theories and Applications: With Aspects of Artificial Intelligence - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings",

}

TY - GEN

T1 - A comparative study on feature selection in regression for predicting the affinity of TAP binding peptides

AU - Li, Xue Ling

AU - Wang, Shu Lin

PY - 2010

Y1 - 2010

N2 - In this study, we compare six feature selection methods, i.e. five feature selection methods for k Nearest Neighborhood regression (kNNReg) and a rough set model based forward feature selection (FARNeM) for Support Vector Regression (SVR) for predicting the affinity of TAP binding peptides. The peptides were represented with binary, sequence associated amino acid properties, and binary plus properties of amino acids derived vectors, respectively. The weighted peptide features are input to the regression model and ranked according to the corresponding weights or the occurrence frequency, respectively. We find that SVR model performs better than kNNReg model for the prediction of the affinity of TAP transporter binding peptides.

AB - In this study, we compare six feature selection methods, i.e. five feature selection methods for k Nearest Neighborhood regression (kNNReg) and a rough set model based forward feature selection (FARNeM) for Support Vector Regression (SVR) for predicting the affinity of TAP binding peptides. The peptides were represented with binary, sequence associated amino acid properties, and binary plus properties of amino acids derived vectors, respectively. The weighted peptide features are input to the regression model and ranked according to the corresponding weights or the occurrence frequency, respectively. We find that SVR model performs better than kNNReg model for the prediction of the affinity of TAP transporter binding peptides.

KW - feature selection

KW - k-nearest neighborhood regression

KW - neighborhood rough set model

KW - peptides

KW - support vector regression

KW - Transporter associated with antigen processing

UR - http://www.scopus.com/inward/record.url?scp=77956165779&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77956165779&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-14932-0_9

DO - 10.1007/978-3-642-14932-0_9

M3 - Conference contribution

SN - 3642149316

SN - 9783642149313

VL - 6216 LNAI

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 69

EP - 75

BT - Advanced Intelligent Computing Theories and Applications: With Aspects of Artificial Intelligence - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings

ER -