Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry
dc.contributor.advisor | Ioannou, Yani | |
dc.contributor.author | Jain, Rohan | |
dc.contributor.committeemember | Drew, Steve | |
dc.contributor.committeemember | Bento, Mariana | |
dc.date | 2025-11 | |
dc.date.accessioned | 2025-06-06T16:04:57Z | |
dc.date.available | 2025-06-06T16:04:57Z | |
dc.date.issued | 2025-06-05 | |
dc.description.abstract | The Lottery Ticket Hypothesis (LTH) suggests there exists a sparse LTH mask and weights that achieve the same generalization performance as the dense model while using significantly fewer parameters. LTH achieves this by iteratively sparsifying and re-training within the pruned solution basin. However, finding a LTH solution is computationally expensive, and a LTH’s sparsity mask does not generalize to other random weight initializations. Recent work has suggested that neural networks trained from random initialization find solutions within the same basin modulo permutations, and proposes a method to align trained models within the same loss basin. We hypothesize that misalignment of basins is the reason why LTH masks do not generalize to new random initializations and propose permuting the LTH mask to align with the new optimization basin when performing sparse training from a different random initialization. We empirically show a significant increase in generalization when sparse training from random initialization with the permuted mask as compared to using the non-permuted LTH mask, on multiple datasets (CIFAR-10/100 & ImageNet) and models (VGG11 & ResNet20/50). | |
dc.identifier.citation | Jain, R. (2025). Sparse training from random initialization: aligning lottery ticket masks using weight symmetry (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. | |
dc.identifier.uri | https://hdl.handle.net/1880/121904 | |
dc.language.iso | en | |
dc.publisher.faculty | Graduate Studies | |
dc.publisher.institution | University of Calgary | |
dc.rights | University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission. | |
dc.subject | Lottery Ticket Hypothesis | |
dc.subject | Linear Mode Connectivity | |
dc.subject | Weight Symmetry | |
dc.subject | Deep Neural Networks | |
dc.subject | Optimization | |
dc.subject | Sparse Training | |
dc.subject.classification | Artificial Intelligence | |
dc.subject.classification | Education--Sciences | |
dc.subject.classification | Education--Technology | |
dc.subject.classification | Computer Science | |
dc.title | Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry | |
dc.type | master thesis | |
thesis.degree.discipline | Engineering – Electrical & Computer | |
thesis.degree.grantor | University of Calgary | |
thesis.degree.name | Master of Science (MSc) | |
ucalgary.thesis.accesssetbystudent | I do not require a thesis withhold – my thesis will have open access and can be viewed and downloaded publicly as soon as possible. |