AllerTOP v.2 - a server for in silico prediction of allergens

Ivan Dimitrov; Ivan Bangov; Darren R. Flower; Irini Doytchinova

doi:10.1007/s00894-014-2278-5

AllerTOP v.2 - a server for in silico prediction of allergens

Ivan Dimitrov, Ivan Bangov, Darren R. Flower, Irini Doytchinova^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

Abstract

Allergy is an overreaction by the immune system to a previously encountered, ordinarily harmless substance - typically proteins - resulting in skin rash, swelling of mucous membranes, sneezing or wheezing, or other abnormal conditions. The use of modified proteins is increasingly widespread: their presence in food, commercial products, such as washing powder, and medical therapeutics and diagnostics, makes predicting and identifying potential allergens a crucial societal issue. The prediction of allergens has been explored widely using bioinformatics, with many tools being developed in the last decade; many of these are freely available online. Here, we report a set of novel models for allergen prediction utilizing amino acid E-descriptors, auto- and cross-covariance transformation, and several machine learning methods for classification, including logistic regression (LR), decision tree (DT), naïve Bayes (NB), random forest (RF), multilayer perceptron (MLP) and k nearest neighbours (kNN). The best performing method was kNN with 85.3% accuracy at 5-fold cross-validation. The resulting model has been implemented in a revised version of the AllerTOP server (http://www.ddg-pharmfac.net/AllerTOP).

Original language	English
Article number	2278
Number of pages	6
Journal	Journal of Molecular Modeling
Volume	20
Issue number	6
DOIs	https://doi.org/10.1007/s00894-014-2278-5
Publication status	Published - 31 May 2014

Bibliographical note

This paper belongs to Topical Collection MIB 2013 (Modeling Interactions in Biomolecules VI).

Funding: Bulgarian Science Fund (Grants DCVNP 02-1/2009 and IO1/7)

Keywords

ACC transformation
allergen prediction
decision tree
e-descriptors
k nearest neighbours
logistic regression
multilayer perceptrone
naïve bayes
random forest

Access to Document

10.1007/s00894-014-2278-5

Cite this

@article{19129216a64a435d9051d5b4cb01c7dd,

title = "AllerTOP v.2 - a server for in silico prediction of allergens",

abstract = "Allergy is an overreaction by the immune system to a previously encountered, ordinarily harmless substance - typically proteins - resulting in skin rash, swelling of mucous membranes, sneezing or wheezing, or other abnormal conditions. The use of modified proteins is increasingly widespread: their presence in food, commercial products, such as washing powder, and medical therapeutics and diagnostics, makes predicting and identifying potential allergens a crucial societal issue. The prediction of allergens has been explored widely using bioinformatics, with many tools being developed in the last decade; many of these are freely available online. Here, we report a set of novel models for allergen prediction utilizing amino acid E-descriptors, auto- and cross-covariance transformation, and several machine learning methods for classification, including logistic regression (LR), decision tree (DT), na{\"i}ve Bayes (NB), random forest (RF), multilayer perceptron (MLP) and k nearest neighbours (kNN). The best performing method was kNN with 85.3% accuracy at 5-fold cross-validation. The resulting model has been implemented in a revised version of the AllerTOP server (http://www.ddg-pharmfac.net/AllerTOP).",

keywords = "ACC transformation, allergen prediction, decision tree, e-descriptors, k nearest neighbours, logistic regression, multilayer perceptrone, na{\"i}ve bayes, random forest",

author = "Ivan Dimitrov and Ivan Bangov and Flower, {Darren R.} and Irini Doytchinova",

note = "This paper belongs to Topical Collection MIB 2013 (Modeling Interactions in Biomolecules VI). Funding: Bulgarian Science Fund (Grants DCVNP 02-1/2009 and IO1/7)",

year = "2014",

month = may,

day = "31",

doi = "10.1007/s00894-014-2278-5",

language = "English",

volume = "20",

journal = "Journal of Molecular Modeling",

issn = "1610-2940",

publisher = "Springer",

number = "6",

}

TY - JOUR

T1 - AllerTOP v.2 - a server for in silico prediction of allergens

AU - Dimitrov, Ivan

AU - Bangov, Ivan

AU - Flower, Darren R.

AU - Doytchinova, Irini

N1 - This paper belongs to Topical Collection MIB 2013 (Modeling Interactions in Biomolecules VI). Funding: Bulgarian Science Fund (Grants DCVNP 02-1/2009 and IO1/7)

PY - 2014/5/31

Y1 - 2014/5/31

N2 - Allergy is an overreaction by the immune system to a previously encountered, ordinarily harmless substance - typically proteins - resulting in skin rash, swelling of mucous membranes, sneezing or wheezing, or other abnormal conditions. The use of modified proteins is increasingly widespread: their presence in food, commercial products, such as washing powder, and medical therapeutics and diagnostics, makes predicting and identifying potential allergens a crucial societal issue. The prediction of allergens has been explored widely using bioinformatics, with many tools being developed in the last decade; many of these are freely available online. Here, we report a set of novel models for allergen prediction utilizing amino acid E-descriptors, auto- and cross-covariance transformation, and several machine learning methods for classification, including logistic regression (LR), decision tree (DT), naïve Bayes (NB), random forest (RF), multilayer perceptron (MLP) and k nearest neighbours (kNN). The best performing method was kNN with 85.3% accuracy at 5-fold cross-validation. The resulting model has been implemented in a revised version of the AllerTOP server (http://www.ddg-pharmfac.net/AllerTOP).

AB - Allergy is an overreaction by the immune system to a previously encountered, ordinarily harmless substance - typically proteins - resulting in skin rash, swelling of mucous membranes, sneezing or wheezing, or other abnormal conditions. The use of modified proteins is increasingly widespread: their presence in food, commercial products, such as washing powder, and medical therapeutics and diagnostics, makes predicting and identifying potential allergens a crucial societal issue. The prediction of allergens has been explored widely using bioinformatics, with many tools being developed in the last decade; many of these are freely available online. Here, we report a set of novel models for allergen prediction utilizing amino acid E-descriptors, auto- and cross-covariance transformation, and several machine learning methods for classification, including logistic regression (LR), decision tree (DT), naïve Bayes (NB), random forest (RF), multilayer perceptron (MLP) and k nearest neighbours (kNN). The best performing method was kNN with 85.3% accuracy at 5-fold cross-validation. The resulting model has been implemented in a revised version of the AllerTOP server (http://www.ddg-pharmfac.net/AllerTOP).

KW - ACC transformation

KW - allergen prediction

KW - decision tree

KW - e-descriptors

KW - k nearest neighbours

KW - logistic regression

KW - multilayer perceptrone

KW - naïve bayes

KW - random forest

UR - http://www.scopus.com/inward/record.url?scp=84901583210&partnerID=8YFLogxK

U2 - 10.1007/s00894-014-2278-5

DO - 10.1007/s00894-014-2278-5

M3 - Article

AN - SCOPUS:84901583210

SN - 1610-2940

VL - 20

JO - Journal of Molecular Modeling

JF - Journal of Molecular Modeling

IS - 6

M1 - 2278

ER -

AllerTOP v.2 - a server for in silico prediction of allergens

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this