Proper integration of feature subsets boosts GO subcellular localization predictions

  • Flavio Ezequiel Spetale Cifasis-Conicet
  • Elizabeth Tapia
  • Javier Murillo
  • Flavia Krsticevic
  • Sergio Ponce
  • Laura Angelone
  • Pilar Bulacio

Abstract

Prediction of multiple subcellular localizations in proteins brings relevant information for biologicalfunction discovery. The use of computational methods based on knowledge can be a helpful starting point forguiding the costly experimental validation. In this work, we present a multilabel classifier framework to performGene Ontology - Cellular Component prediction focused on the improvement of two design aspects: i) the proteinsequence characterization, regarding biological knowledge with experimental evidence, and ii) the error evaluation byconsidering a noise model inherent in real prediction frameworks. Our proposal is validated against sets of well-knownprotein sequences of four model organisms D. rerio, A. thaliana, S. cerevisiae and D. melanogaster
Published
2018-03-24
Section
Scientific articles

Most read articles by the same author(s)