Adaptive Indexing for In-situ Visual Exploration and Analytics

Stavros Maroulis, Nikos Bikakis, George Papastefanatos, Panos Vassiliadis, Yannis Vassiliou

Summary

In-situ processing has received a great deal of attention in recent years. In in-situ scenarios, big raw data files which do not fit in main memory, must be efficiently handled using commodity hardware, without the overhead of a preprocessing phase or the loading of data into a database. In this work, we present an adaptive indexing scheme that enables efficient visual exploration and analytics over big raw data files. Beyond visual exploration and statistics, the scheme enables categorical-based analytics using group-by and filter operations. The proposed scheme combines a tile-based structure that offers efficient exploratory operations over the 2D space, with a tree-based structure that organizes a tile’s objects based on their categorical values, enabling efficient visual analytics and the support of advanced visualization methods. The index resides in main memory and is built progressively as the user explores parts of the raw file, whereas its structure and level of granularity are adjusted to the user’s exploration areas and type of analysis. We conduct experiments using real and synthetic datasets, and demonstrate that the proposed approach, is in most cases more than 40× faster compared to the existing solutions, and performs around 3 orders of magnitude less I/O operations.

Texts

Stavros Maroulis, Nikos Bikakis, George Papastefanatos, Panos Vassiliadis, Yannis Vassiliou. Adaptive Indexing for In-situ Visual Exploration and Analytics. 23rd International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP 2021), March 23, 2021.
Local copy of the paper (PDF)

Presentations

The slides of the talk are here (pdf)

A video presentation video at vimeo.

Other

The source code of the paper is available at github, here

The paper is part of a larger effort on interactive data exploration -- see the VisualFacts project