Nie jesteś zalogowany | Zaloguj się

LIDL: Local Intrinsic Dimension estimation using Likelihood

Prelegent(ci)
Piotr Tempczyk
Termin
18 marca 2021 12:15
Informacje na temat wydarzenia
meet.google.com/yew-oubf-ngi
Seminarium
Seminarium "Uczenie maszynowe"

We investigate the problem of local intrinsic dimension (LID)estimation. LID of the data is the minimal number of coordinates which are necessary to describe the data point and its neighborhood without significant information loss. Existing methods for LID estimation do not scale well to high dimensional data because they rely on estimating the LID based on nearest neighbors structure, which may cause problems due to the curse of dimensionality. We propose a new method for LocalIntrinsic Dimension estimation using Likelihood (LIDL), which yields more accurate LID estimates thanks to the recent progress in likelihood estimation in high dimensions, such as normalizing flows (NF). We show our method yields more accurate estimates than previous state-of-the-art algorithms for LID estimation on standard benchmarks for this problem, and that unlike other methods, it scales well to problems with thousands of dimensions. We anticipate this new approach to open a way to accurate LID estimation for real-world, high dimensional datasets and expect it to improve further with advances in the NF literature