TY - JOUR
AU - Breimann, Stephan
AU - Kamp, Frits
AU - Steiner, Harald
AU - Frishman, Dmitrij
TI - AAontology: An Ontology of Amino Acid Scales for Interpretable Machine Learning
JO - Journal of molecular biology
VL - 436
IS - 19
SN - 0022-2836
CY - Amsterdam [u.a.]
PB - Elsevier
M1 - DZNE-2024-01015
SP - 168717
PY - 2024
AB - Amino acid scales are crucial for protein prediction tasks, many of them being curated in the AAindex database. Despite various clustering attempts to organize them and to better understand their relationships, these approaches lack the fine-grained classification necessary for satisfactory interpretability in many protein prediction problems. To address this issue, we developed AAontology-a two-level classification for 586 amino acid scales (mainly from AAindex) together with an in-depth analysis of their relations-using bag-of-word-based classification, clustering, and manual refinement over multiple iterations. AAontology organizes physicochemical scales into 8 categories and 67 subcategories, enhancing the interpretability of scale-based machine learning methods in protein bioinformatics. Thereby it enables researchers to gain a deeper biological insight. We anticipate that AAontology will be a building block to link amino acid properties with protein function and dysfunctions as well as aid informed decision-making in mutation analysis or protein drug design.
KW - Machine Learning
KW - Amino Acids: chemistry
KW - Computational Biology: methods
KW - Proteins: chemistry
KW - Proteins: metabolism
KW - Databases, Protein
KW - Cluster Analysis
LB - PUB:(DE-HGF)16
C6 - pmid:39053689
DO - DOI:10.1016/j.jmb.2024.168717
UR - https://pub.dzne.de/record/271147
ER -