A Universal Scheme for Partitioned Dynamic Shortest Path Index

0upvotes

By: Mengxuan Zhang, Xinjie Zhou, Lei Li, Ziyi Liu, Goce Trajcevski, Yan Huang, Xiaofang Zhou

Graph partitioning is a common solution to scale up the graph algorithms, and shortest path (SP) computation is one of them. However, the existing solutions typically have a fixed partition method with a fixed path index and fixed partition structure, so it is unclear how the partition method and path index influence the pathfinding performance. Moreover, few studies have explored the index maintenance of partitioned SP (PSP) on dynamic gra... more

DatabasesOctober 13, 2023 10:22am

Comments (0)
Views (159)

Spatio-temporal flow patterns

0upvotes

By: Chrysanthi Kosyfaki, Nikos Mamoulis, Reynold Cheng, Ben Kao

Transportation companies and organizations routinely collect huge volumes of passenger transportation data. By aggregating these data (e.g., counting the number of passengers going from a place to another in every 30 minute interval), it becomes possible to analyze the movement behavior of passengers in a metropolitan area. In this paper, we study the problem of finding important trends in passenger movements at varying granularities, which... more

DatabasesOctober 9, 2023 9:54am

Comments (0)
Views (186)

Workload-aware and Learned Z-Indexes

0upvotes

By: Sachith Pai, Michael Mathioudakis, Yanhao Wang

In this paper, a learned and workload-aware variant of a Z-index, which jointly optimizes storage layout and search structures, as a viable solution for the above challenges of spatial indexing. Specifically, we first formulate a cost function to measure the performance of a Z-index on a dataset for a range-query workload. Then, we optimize the Z-index structure by minimizing the cost function through adaptive partitioning and ordering for ... more

DatabasesOctober 9, 2023 9:04am

Comments (0)
Views (192)

Minerva: Decentralized Collaborative Query Processing over InterPlanetary File System

0upvotes

By: Zhiyi Yao, Bowen Ding, Qianlan Bai, Yuedong Xu

Data silos create barriers in accessing and utilizing data dispersed over networks. Directly sharing data easily suffers from the long downloading time, the single point failure and the untraceable data usage. In this paper, we present Minerva, a peer-to-peer cross-cluster data query system based on InterPlanetary File System (IPFS). Minerva makes use of the distributed Hash table (DHT) lookup to pinpoint the locations that store content ch... more

DatabasesOctober 9, 2023 8:45am

Comments (0)
Views (206)

Blend: A Unified Data Discovery System

0upvotes

By: Mahdi Esmailoghli, Christoph Schnell, Renée J. Miller, Ziawasch Abedjan

Data discovery is an iterative and incremental process that necessitates the execution of multiple data discovery queries to identify the desired tables from large and diverse data lakes. Current methodologies concentrate on single discovery tasks such as join, correlation, or union discovery. However, in practice, a series of these approaches and their corresponding index structures are necessary to enable the user to discover the desired ... more

DatabasesOctober 5, 2023 8:53am

Comments (0)
Views (226)

Top-k contrast order-preserving pattern mining for time series classification

0upvotes

By: Youxi Wu, Yufei Meng, Yan Li, Lei Guo, Xingquan Zhu, Philippe Fournier-Viger, Xindong Wu

Recently, order-preserving pattern (OPP) mining, a new sequential pattern mining method, has been proposed to mine frequent relative orders in a time series. Although frequent relative orders can be used as features to classify a time series, the mined patterns do not reflect the differences between two classes of time series well. To effectively discover the differences between time series, this paper addresses the top-k contrast OPP (COPP... more

DatabasesOctober 5, 2023 8:53am

Comments (0)
Views (241)

Conditional independence on semiring relations

0upvotes

By: Miika Hannula

Conditional independence plays a foundational role in database theory, probability theory, information theory, and graphical models. In databases, conditional independence appears in database normalization and is known as the (embedded) multivalued dependency. Many properties of conditional independence are shared across various domains, and to some extent these commonalities can be studied through a measure-theoretic approach. The present ... more

DatabasesOctober 4, 2023 5:26am

Comments (0)
Views (214)

Optimizing substructure search: a novel approach for efficient querying in large chemical databases

0upvotes

By: Vsevolod Vaskin, Dmitri Jakovlev, Fedor Bakharev

Substructure search in chemical compound databases is a fundamental task in cheminformatics with critical implications for fields such as drug discovery, materials science, and toxicology. However, the increasing size and complexity of chemical databases have rendered traditional search algorithms ineffective, exacerbating the need for scalable solutions. We introduce a novel approach to enhance the efficiency of substructure search, moving... more

DatabasesOctober 4, 2023 5:25am

Comments (0)
Views (237)

Rel2Graph: Automated Mapping From Relational Databases to a Unified Property Knowledge Graph

0upvotes

By: Ziyu Zhao, Wei Liu, Tim French, Michael Stewart

Although a few approaches are proposed to convert relational databases to graphs, there is a genuine lack of systematic evaluation across a wider spectrum of databases. Recognising the important issue of query mapping, this paper proposes an approach Rel2Graph, an automatic knowledge graph construction (KGC) approach from an arbitrary number of relational databases. Our approach also supports the mapping of conjunctive SQL queries into patt... more

DatabasesOctober 3, 2023 5:57am

Comments (0)
Views (216)

MaaSDB: Spatial Databases in the Era of Large Language Models (Vision Paper)

0upvotes

By: Jianzhong Qi, Zuqing Li, Egemen Tanin

Large language models (LLMs) are advancing rapidly. Such models have demonstrated strong capabilities in learning from large-scale (unstructured) text data and answering user queries. Users do not need to be experts in structured query languages to interact with systems built upon such models. This provides great opportunities to reduce the barrier of information retrieval for the general public. By introducing LLMs into spatial data manage... more

DatabasesOctober 2, 2023 9:14am