Community-based Outlier Detection for Edge-attributed Graphs

P, Supriya and Balasubramanian, Vineeth N (2016) Community-based Outlier Detection for Edge-attributed Graphs. arXiv. pp. 1-9.

[img]
Preview
Text (arXiv copy)
1612.09435.pdf - Accepted Version

Download (466kB) | Preview

Abstract

The study of networks has emerged in diverse disciplines as a means of analyzing complex relationship data. Beyond graph analysis tasks like graph query processing, link analysis, influence propagation, there has recently been some work in the area of outlier detection for information network data. Although various kinds of outliers have been studied for graph data, there is not much work on anomaly detection from edge-attributed graphs. In this paper, we introduce a method that detects novel outlier graph nodes by taking into account the node data and edge data simultaneously to detect anomalies. We model the problem as a community detection task, where outliers form a separate community. We propose a method that uses a probabilistic graph model (Hidden Markov Random Field) for joint modeling of nodes and edges in the network to compute Holistic Community Outliers (HCOutliers). Thus, our model presents a natural setting for heterogeneous graphs that have multiple edges/relationships between two nodes. EM (Expectation Maximization) is used to learn model parameters, and infer hidden community labels. Experimental results on synthetic datasets and the DBLP dataset show the effectiveness of our approach for finding novel outliers from networks.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Balasubramanian, Vineeth NUNSPECIFIED
Item Type: Article
Subjects: Computer science > Big Data Analytics
Divisions: Department of Computer Science & Engineering
Depositing User: Team Library
Date Deposited: 10 Jan 2017 04:32
Last Modified: 25 Apr 2018 05:37
URI: http://raiithold.iith.ac.in/id/eprint/2977
Publisher URL: https://arxiv.org/abs/1612.09435
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 2977 Statistics for this ePrint Item