De-noising Distances: A Data Analysis Perspective

Singh, Arpan and Jayaram, Balasubramaniam (2019) De-noising Distances: A Data Analysis Perspective. Masters thesis, Indian institute of technology Hyderabad.

[img] Text
Thesis_Msc_MA_5459.pdf - Submitted Version
Restricted to Repository staff only until June 2020.

Download (376kB) | Request a copy

Abstract

In data analysis, the use of a distance function is ubiquitous. There is an increased awareness about the non-intuitive, and often counterintuitive, behaviour of distances in high dimensions. Similarity Search (SS) algorithms find the best possible match for a given data point from a set of data points. As the title indicates, this work is an attempt at exploring distances, in the context of SS, from the following two perspectives: On the one hand, considering "De-noising" as a verb, we would like to understand the properties or characteristics of distances that play a role in an SS. On the other hand, considering "De-noising" as an adjective, we would like to study distances that de-noise well, i.e., given a data point that is corrupted by some noise as the input, we study the dependencies that may exist between the type of noise and the distance function that accurately finds the original data in an SS, also taking into account the other 3D’s, viz., Dimensionality, Distribution and Denseness of the data.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Jayaram, Balasubramaniamhttp://orcid.org/0000-0001-7370-3821
Item Type: Thesis (Masters)
Subjects: Mathematics
Divisions: Department of Mathematics
Depositing User: Team Library
Date Deposited: 12 Jun 2019 11:09
Last Modified: 26 Jun 2019 03:46
URI: http://raiithold.iith.ac.in/id/eprint/5459
Publisher URL:
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 5459 Statistics for this ePrint Item