Distributionally robust binary classifier under Wasserstein distance

dc.contributor.advisorWu, Jingjing
dc.contributor.advisorZhang, Qingrun
dc.contributor.authorHuang, Qian
dc.contributor.committeememberLiao, Wenyuan
dc.contributor.committeememberSwishchuk, Anatoliy
dc.date2024-11
dc.date.accessioned2024-09-10T19:48:39Z
dc.date.available2024-09-10T19:48:39Z
dc.date.issued2024-09-08
dc.description.abstractThe robustification of statistical models has been a popular topic for decades. Statistical robustification and robust optimization are the two main approaches in the literature, where the former stabilizes the model output by removing the outlier points while the latter concerns more the outlier points in making the conservative decisions. This thesis develops a novel robust optimization perspective to robustify a class of binary classifiers. Our model considers the worst-case distribution within a pre-determined uncertainty ball that centers at the given benchmark distribution with the radius calculated as per the Wasserstein distance. We derive the tractable formulation for the general problem. When focusing on the support vector machine (SVM), the general problem boils down to an easy-to-solve second- order cone programming problem. The robustified SVM is then applied to synthetic data with and without contamination, and our simulation studies show that our robustified SVM model can outperform the classical SVM and the extreme empirical loss SVM models under many circumstances.
dc.identifier.citationHuang, Q. (2024). Distributionally robust binary classifier under Wasserstein distance (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.
dc.identifier.urihttps://hdl.handle.net/1880/119671
dc.identifier.urihttps://dx.doi.org/10.11575/PRISM/47282
dc.language.isoen
dc.publisher.facultyGraduate Studies
dc.publisher.institutionUniversity of Calgary
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subjectDistributional robustness
dc.subjectBinary classifier
dc.subjectSupport vector machine
dc.subjectWasserstein distance
dc.subject.classificationEducation--Mathematics
dc.titleDistributionally robust binary classifier under Wasserstein distance
dc.typemaster thesis
thesis.degree.disciplineMathematics & Statistics
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameMaster of Science (MSc)
ucalgary.thesis.accesssetbystudentI do not require a thesis withhold – my thesis will have open access and can be viewed and downloaded publicly as soon as possible.

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2024_huang_qian.pdf
Size:
1.45 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: