シンポジウムセッション
第2日 6月11日(火) 9:55~10:15 B会場(中ホール200)
- 2B-S-0955
タンデムマススペクトル機械学習による脂質クラス推定
(1農工大院生命工・ 2理研IMS・ 3慶大院薬・ 4横市大院生命医)
o坂本七海1・ 岡昂輝1・ 松沢佑紀1・ 西田孝三1・ 堀あや2・ 有田誠2,3,4・ 津川裕司1,2,4
Untargeted lipidomics using collision-induced dissociation based tandem mass spectrometry (CID-MS/MS) is an essential technique in biology and clinical application. However, the annotation confidence is still guaranteed by the manual curation of analytical chemists although various software tools have been developed for the automatic spectral processing based on the rule-based fragment annotations. In this study, we developed MS2lipid, a machine learning model, to predict lipid subclasses learning 82,455 curated spectra obtained from 82 datasets. MS2lipid achieved over 94.6% accuracy for 97 lipid subclasses. Moreover, our program outperformed the accuracy of CANOPAS ontology prediction in which ours provided 35.9% higher value of F1 score on average. Furthermore, the MS2lipid program offers over 87% accuracy on average for spectra acquired by different curators and different MS techniques. Furthermore, the function of MS2lipid was showcased by the annotation of novel esterified bile acids in combination with molecular spectrum networking. Consequently, our machine learning model provides an independent criterion for lipid subclass classification in addition to an environment for annotating lipid metabolites that have been previously unknowns.