Skip to Main Content (Press Enter)

Logo UNITO
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione

UNI-FIND
Logo UNITO

|

UNI-FIND

unito.it
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione
  1. Pubblicazioni

Positive and unlabeled learning in categorical data

Articolo
Data di Pubblicazione:
2016
Abstract:
In common binary classification scenarios, the presence of both positive and negative examples in training data is needed to build an efficient classifier. Unfortunately, in many domains, this requirement is not satisfied and only one class of examples is available. To cope with this setting, classification algorithms have been introduced that learn from Positive and Unlabeled (PU) data. Originally, these approaches were exploited in the context of document classification. Only few works address the PU problem for categorical datasets. Nevertheless, the available algorithms are mainly based on Naive Bayes classifiers. In this work we present a new distance based PU learning approach for categorical data: Pulce. Our framework takes advantage of the intrinsic relationships between attribute values and exceeds the independence assumption made by Naive Bayes. Pulce, in fact, leverages on the statistical properties of the data to learn a distance metric employed during the classification task. We extensively validate our approach over real world datasets and demonstrate that our strategy obtains statistically significant improvements w.r.t. state-of-the-art competitors.
Tipologia CRIS:
03A-Articolo su Rivista
Keywords:
Positive unlabeled learning, Partially supervised learning, Distance learning, Categorical data
Elenco autori:
Ienco, Dino; Pensa, Ruggero G.
Autori di Ateneo:
PENSA Ruggero Gaetano
Link alla scheda completa:
https://iris.unito.it/handle/2318/1558958
Link al Full Text:
https://iris.unito.it/retrieve/handle/2318/1558958/150579/neurocom2016_draft_4aperto.pdf
Pubblicato in:
NEUROCOMPUTING
Journal
  • Dati Generali

Dati Generali

URL

http://www.sciencedirect.com/science/article/pii/S0925231216003118
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 25.6.1.0