Data Series Progressive Similarity Search with Probabilistic Quality Guarantees - Laboratoire d'Informatique PAris DEscartes - EA 2517 Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Data Series Progressive Similarity Search with Probabilistic Quality Guarantees

Résumé

Existing systems dealing with the increasing volume of data series cannot guarantee interactive response times, even for fundamental tasks such as similarity search. Therefore, it is necessary to develop analytic approaches that support exploration and decision making by providing progressive results, before the final and exact ones have been computed. Prior works lack both efficiency and accuracy when applied to large-scale data series collections. We present and experimentally evaluate a new probabilistic learning-based method that provides quality guarantees for progressive Nearest Neighbor (NN) query answering. We provide both initial and progressive estimates of the final answer that are getting better during the similarity search, as well suitable stopping criteria for the progressive queries. Experiments with synthetic and diverse real datasets demonstrate that our prediction methods constitute the first practical solution to the problem, significantly outperforming competing approaches.
Fichier principal
Vignette du fichier
progressive-sigmod.pdf (4.74 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02560760 , version 1 (02-05-2020)

Identifiants

Citer

Anna Gogolou, Theophanis Tsandilas, Karima Echihabi, Anastasia Bezerianos, Themis Palpanas. Data Series Progressive Similarity Search with Probabilistic Quality Guarantees. ACM SIGMOD International Conference on Management of Data, Jun 2020, Portland, United States. pp.1857-1873, ⟨10.1145/3318464.3389751⟩. ⟨hal-02560760⟩
277 Consultations
311 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More