Overparametrized deep neural networks: Convergence and generalization properties

Polyzos Christos

URI	http://purl.tuc.gr/dl/dias/71452342-B134-4FF8-B2B5-06849C9E9FB4	-
Identifier	https://doi.org/10.26233/heallink.tuc.97293	-
Language	en	-
Extent	4.5 megabytes	en
Extent	61 pages	en
Title	Overparametrized deep neural networks: Convergence and generalization properties	en
Title	Υπερπαραμετροποιημένα νευρωνικά δίκτυα βαθείας μάθησης: Ιδιότητες σύγκλισης και γενίκευσης	el
Creator	Polyzos Christos	en
Creator	Πολυζος Χρηστος	el
Contributor [Thesis Supervisor]	Liavas Athanasios	en
Contributor [Thesis Supervisor]	Λιαβας Αθανασιος	el
Contributor [Committee Member]	Karystinos Georgios	en
Contributor [Committee Member]	Καρυστινος Γεωργιος	el
Contributor [Committee Member]	Zervakis Michail	en
Contributor [Committee Member]	Ζερβακης Μιχαηλ	el
Publisher	Πολυτεχνείο Κρήτης	el
Publisher	Technical University of Crete	en
Academic Unit	Technical University of Crete::School of Electrical and Computer Engineering	en
Academic Unit	Πολυτεχνείο Κρήτης::Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών	el
Content Summary	In this thesis, we consider deep neural networks for Machine Learning. We depict neural networks as weighted directed graphs and we represent them as parametric functions that receive an input and compute an output, or prediction, given some fixed parameters, the weights and the biases. The quintessence of a neural network is the feed-forward model, in which the underlying graph does not contain cycles (acyclic graph) and the parametric function is defined in a compositional, or hierarchical, way. Throughout our presentation, we focus on a supervised learning setting, where our neural network model, or learner, has access to a training set that contains examples of how pairs of input-output data are related. In other words, supervised learning amounts to learning from examples. Given a training set, depending whether the outputs have real or categorical values, we consider regression and logistic regression. For each setting, we provide the basic statistical framework and construct a loss function known as the empirical risk. We train our neural network by minimizing the empirical risk w.r.t. its parameters by using gradient-based optimization methods. The gradient of the loss function is computed via the back-propagation algorithm. We showcase the convergence and generalization properties of different algorithms (deep neural network models and optimization methods) using real-world data.	en
Type of Item	Διπλωματική Εργασία	el
Type of Item	Diploma Work	en
License	http://creativecommons.org/licenses/by/4.0/	en
Date of Item	2023-09-05	-
Date of Publication	2023	-
Subject	Overparameterization	en
Subject	Machine learning	en
Subject	Generalization	en
Subject	Deep neural networks	en
Subject	Deep learning	en
Subject	Convergence	en
Bibliographic Citation	Christos Polyzos, "Overparametrized deep neural networks: Convergence and generalization properties", Diploma Work, School of Electrical and Computer Engineering, Technical University of Crete, Chania, Greece, 2023	en
Bibliographic Citation	Χρήστος Πολύζος, "Υπερπαραμετροποιημένα νευρωνικά δίκτυα βαθείας μάθησης: Ιδιότητες σύγκλισης και γενίκευσης", Διπλωματική Εργασία, Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Χανιά, Ελλάς, 2023	el

Search

Browse

My Space

Overparametrized deep neural networks: Convergence and generalization properties

Polyzos Christos

Available Files

Services

Export

Share

Statistics

Metadata & Content in a METS Package:

Metadata in Format: