Skip to Main Content (Press Enter)

Logo UNITO
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione

UNI-FIND
Logo UNITO

|

UNI-FIND

unito.it
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione
  1. Pubblicazioni

Peculiar genes selection: A new features selection method to improve classification performances in imbalanced data sets

Articolo
Data di Pubblicazione:
2017
Abstract:
High-Throughput technologies provide genomic and trascriptomic data that are suitable for biomarker detection for classification purposes. However, the high dimension of the output of such technologies and the characteristics of the data sets analysed represent an issue for the classification task. Here we present a new feature selection method based on three steps to detect class-specific biomarkers in case of high-dimensional data sets. The first step detects the differentially expressed genes according to the experimental conditions tested in the experimental design, the second step filters out the features with low discriminative power and the third step detects the class-specific features and defines the final biomarker as the union of the class-specific features. The proposed procedure is tested on two microarray datasets, one characterized by a strong imbalance between the size of classes and the other one where the size of classes is perfectly balanced. We show that, using the proposed feature selection procedure, the classification performances of a Support Vector Machine on the imbalanced data set reach a 82% whereas other methods do not exceed 73%. Furthermore, in case of perfectly balanced dataset, the classification performances are comparable with other methods. Finally, the Gene Ontology enrichments performed on the signatures selected with the proposed pipeline, confirm the biological relevance of our methodology. The download of the package with the implementation of Peculiar Genes Selection, ‘PGS’, is available for R users at: http://github.com/mbeccuti/PGS.
Tipologia CRIS:
03A-Articolo su Rivista
Keywords:
Computational Biology; Gene Expression Profiling; Vaccination; Algorithms; Genetics and Molecular Biology (all); Agricultural and Biological Sciences (all)
Elenco autori:
Martina, Federica; Beccuti, Marco; Balbo, Gianfranco; Cordero, Francesca
Autori di Ateneo:
BECCUTI Marco
CORDERO Francesca
Link alla scheda completa:
https://iris.unito.it/handle/2318/1652379
Link al Full Text:
https://iris.unito.it/retrieve/handle/2318/1652379/370860/Peculiar%20Genes%20Selection-%20A%20new%20features%20selection%20method%20to%20improve%20classification%20performances%20in%20imbalanced%20data%20sets.pdf
Pubblicato in:
PLOS ONE
Journal
  • Dati Generali

Dati Generali

URL

http://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0177475&type=printable
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 25.5.0.1