BERTops: Studying BERT Representations under a Topological Lens
Chauhan, Jatin and Kaul, Manohar (2022) BERTops: Studying BERT Representations under a Topological Lens. In: 2022 International Joint Conference on Neural Networks, IJCNN 2022, 18 July 2022through 23 July 2022, Padua.
Text
Neural_Networks.pdf - Published Version Available under License Creative Commons Attribution. Download (1MB) |
Abstract
Proposing scoring functions to effectively understand, analyze and learn various properties of high dimensional hidden representations of large-scale transformer models like BERT can be a challenging task. In this work, we explore a new direction by studying the topological features of BERT hidden representations using persistent homology (PH). We propose a novel scoring function named 'persistence scoring function (PSF)' which: (i) accurately captures the homology of the high-dimensional hidden representations and correlates well with the test set accuracy of a wide range of datasets and outperforms existing scoring metrics, (ii) captures interesting post fine-tuning 'per-class' level properties from both qualitative and quantitative viewpoints, (iii) is more stable to perturbations as compared to the baseline functions, which makes it a very robust proxy, and (iv) finally, also serves as a predictor of the attack success rates for a wide category of black-box and white-box adversarial attack methods. Our extensive correlation experiments demonstrate the practical utility of PSF on various NLP tasks relevant to BERT11Code is available at https://github.com/chauhanjatin10/BERTops © 2022 IEEE.
IITH Creators: |
|
||
---|---|---|---|
Item Type: | Conference or Workshop Item (Paper) | ||
Uncontrolled Keywords: | BERT; Machine Learning; Neural Networks; Persistent Homology | ||
Subjects: | Computer science | ||
Divisions: | Department of Computer Science & Engineering | ||
Depositing User: | . LibTrainee 2021 | ||
Date Deposited: | 05 Nov 2022 10:22 | ||
Last Modified: | 05 Nov 2022 10:22 | ||
URI: | http://raiithold.iith.ac.in/id/eprint/11172 | ||
Publisher URL: | http://doi.org/10.1109/IJCNN55064.2022.9891897 | ||
Related URLs: |
Actions (login required)
View Item |
Statistics for this ePrint Item |