Abstract
Provided is a computer-implemented method to find similar sets to a selected query set within a collection of sets, wherein each set represents a process. Each set is transformed to a representation in a vector space. Moreover, each set comprises a prefix.
The method comprises creating and storing a data structure representing an inverted metric index in a storage device. The similar sets to the selected query set are identified by a probing step followed by a candidate verifying step. In the probing step, the inverted metric index is filtered by means of a predefined subspace of the vector space. In the candidate verifying step, each set of the filtered inverted
metric index is identified as a similar set if its distance value is smaller or equal to a predefined distance threshold value.
The method comprises creating and storing a data structure representing an inverted metric index in a storage device. The similar sets to the selected query set are identified by a probing step followed by a candidate verifying step. In the probing step, the inverted metric index is filtered by means of a predefined subspace of the vector space. In the candidate verifying step, each set of the filtered inverted
metric index is identified as a similar set if its distance value is smaller or equal to a predefined distance threshold value.
| Originalsprache | Englisch |
|---|---|
| Veröffentlichungsnummer | WO2023208903A1 |
| IPC | G06F 16/22 2019.1 |
| Publikationsstatus | Veröffentlicht - 2 Nov. 2023 |
Systematik der Wissenschaftszweige 2012
- 102 Informatik
Ausstattungen/Einrichtungen
-
Database Research Cluster
Augsten, N. (Manager), Schäler, M. (Manager) & Egger, A. (System-Administrator/in)
InformatikForschungsinfrastruktur: Core Facility
Publikationen
- 1 Paper
-
A Two-Level Signature Scheme for Stable Set Similarity Joins
Schmitt, D. U., Kocher, D., Augsten, N., Mann, W. & Miller, A., Juli 2023, S. 2686-2698. 13 S.Publikation: Konferenzbeitrag › Paper › Peer-reviewed
Open Access
Dieses zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver