De-noising Distances: A Data Analysis Perspective
Singh, Arpan and Jayaram, Balasubramaniam (2019) De-noising Distances: A Data Analysis Perspective. Masters thesis, Indian institute of technology Hyderabad.
Text
Thesis_Msc_MA_5459.pdf - Submitted Version Restricted to Repository staff only until June 2020. Download (376kB) | Request a copy |
Abstract
In data analysis, the use of a distance function is ubiquitous. There is an increased awareness about the non-intuitive, and often counterintuitive, behaviour of distances in high dimensions. Similarity Search (SS) algorithms find the best possible match for a given data point from a set of data points. As the title indicates, this work is an attempt at exploring distances, in the context of SS, from the following two perspectives: On the one hand, considering "De-noising" as a verb, we would like to understand the properties or characteristics of distances that play a role in an SS. On the other hand, considering "De-noising" as an adjective, we would like to study distances that de-noise well, i.e., given a data point that is corrupted by some noise as the input, we study the dependencies that may exist between the type of noise and the distance function that accurately finds the original data in an SS, also taking into account the other 3D’s, viz., Dimensionality, Distribution and Denseness of the data.
IITH Creators: |
|
||||
---|---|---|---|---|---|
Item Type: | Thesis (Masters) | ||||
Subjects: | Mathematics | ||||
Divisions: | Department of Mathematics | ||||
Depositing User: | Team Library | ||||
Date Deposited: | 12 Jun 2019 11:09 | ||||
Last Modified: | 26 Jun 2019 03:46 | ||||
URI: | http://raiithold.iith.ac.in/id/eprint/5459 | ||||
Publisher URL: | |||||
Related URLs: |
Actions (login required)
View Item |
Statistics for this ePrint Item |