BERTops: Studying BERT Representations under a Topological Lens

Chauhan, Jatin and Kaul, Manohar (2022) BERTops: Studying BERT Representations under a Topological Lens. In: 2022 International Joint Conference on Neural Networks, IJCNN 2022, 18 July 2022through 23 July 2022, Padua.

[img] Text
Neural_Networks.pdf - Published Version
Available under License Creative Commons Attribution.

Download (1MB)

Abstract

Proposing scoring functions to effectively understand, analyze and learn various properties of high dimensional hidden representations of large-scale transformer models like BERT can be a challenging task. In this work, we explore a new direction by studying the topological features of BERT hidden representations using persistent homology (PH). We propose a novel scoring function named 'persistence scoring function (PSF)' which: (i) accurately captures the homology of the high-dimensional hidden representations and correlates well with the test set accuracy of a wide range of datasets and outperforms existing scoring metrics, (ii) captures interesting post fine-tuning 'per-class' level properties from both qualitative and quantitative viewpoints, (iii) is more stable to perturbations as compared to the baseline functions, which makes it a very robust proxy, and (iv) finally, also serves as a predictor of the attack success rates for a wide category of black-box and white-box adversarial attack methods. Our extensive correlation experiments demonstrate the practical utility of PSF on various NLP tasks relevant to BERT11Code is available at https://github.com/chauhanjatin10/BERTops © 2022 IEEE.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: BERT; Machine Learning; Neural Networks; Persistent Homology
Subjects: Computer science
Divisions: Department of Computer Science & Engineering
Depositing User: . LibTrainee 2021
Date Deposited: 05 Nov 2022 10:22
Last Modified: 05 Nov 2022 10:22
URI: http://raiithold.iith.ac.in/id/eprint/11172
Publisher URL: http://doi.org/10.1109/IJCNN55064.2022.9891897
Related URLs:

    Actions (login required)

    View Item View Item
    Statistics for RAIITH ePrint 11172 Statistics for this ePrint Item