| |
Computer Science Seminar Series
Outlier Detection and Ranking: A Depth-Based Approach
Februray 27, 3:00pm
Weir Hall, Room 235
Dr. Yixin Chen
Abstract:
Statistical depth functions provide from the "deepest" point a
"center-outward ordering" of multi-dimensional data. In this sense, they can
detect observations that appear extreme relative to the rest of the
observations, i.e., outliers. Of the various statistical depths, the spatial
depth is especially appealing because of its computational efficiency and
mathematical tractability. In the first part of this talk, I introduce a
novel statistical depth, the kernelized spatial depth (KSD), which
generalizes the spatial depth via positive definite kernels. By choosing a
proper kernel, the KSD can capture the local structure of a data set while
the spatial depth fails. Based on the KSD, I propose a novel outlier
detection algorithm, by which an observation with a depth value less than a
threshold is declared as an outlier. The detector is simple in structure:
the threshold is the only one parameter for a given kernel. It can be
learned from a collection of "normal" observations or from a mixture of
normal observations and outliers with unknown labels. Upper bounds on the
false alarm probability of a depth-based detector are derived. These upper
bounds can be used to determine the threshold. In the second part of this
talk, we generalize the KSD concept to graph data. This naturally leads to a
ranking algorithm based on graph kernels.
[ Home |
Site Map ]
|
|