Skip to Main Content (Press Enter)

Logo UNITO
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione

UNI-FIND
Logo UNITO

|

UNI-FIND

unito.it
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione
  1. Pubblicazioni

Take a Ramble into Solution Spaces for Classification Problems in Neural Networks

Contributo in Atti di convegno
Data di Pubblicazione:
2019
Abstract:
Solving a classification problem for a neural network means looking for a particular configuration of the internal parameters. This is commonly achieved by minimizing non-convex object functions. Hence, the same classification problem is likely to have several, different, equally valid solutions, depending on a number of factors like the initialization and the adopted optimizer. In this work, we propose an algorithm which looks for a zero-error path joining two solutions to the same classification problem. We witness that finding such a path is typically not a trivial problem; however, our heuristics is able to succeed in such a task. This is a step forward to explain why simple training heuristics (like SGD) are able to train complex neural networks: we speculate they focus on particular solutions, which belong to a connected solution sub-space. We work in two different scenarios: a synthetic, unbiased and totally-uncorrelated (hard) training problem, and MNIST. We empirically show that the algorithmically-accessible solutions space is connected, and we have hints suggesting it is a convex sub-space. © 2019, Springer Nature Switzerland AG.
Tipologia CRIS:
04A-Conference paper in volume
Elenco autori:
Tartaglione, Enzo; Grangetto, Marco
Autori di Ateneo:
GRANGETTO Marco
Link alla scheda completa:
https://iris.unito.it/handle/2318/1714235
Link al Full Text:
https://iris.unito.it/retrieve/handle/2318/1714235/538817/ICIAP19_takearamble.pdf
Titolo del libro:
International Conference on Image Analysis and Processing, ICIAP 2019
Pubblicato in:
LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
Journal
LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
Series
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 25.6.1.0