Compression of Large-Scale Spatial Data

Description

Hierarchical interpolation techniques allow the efficient representation of large spatial data sets, e.g. digital elevation models, images or simulation results from numerical computations. Due to their hierarchical structure they allow the derivation of approximations of the original data with variable level of detail. Using adaptive refinement they can also represent smooth parts of the data very efficiently. This results in a significant compression as well as a highly reduced computational cost of algorithms for data visualization and analysis.

However, many algorithms assume that the data is given and stored on a regular grid. Although this allows simple data structures and implicit traversal of the hierarchy, the benefits from reduced triangle counts do not carry over to the memory consumption. On the other side, storing the approximations as irregular triangle meshes leads to high memory requirements per triangle (or vertex) for a multiresolution mesh and is thus inefficient for small approximation errors.

We developed a compression scheme based on space-filling curves and relative branch pointers which allows an efficient storage of large digital elavation models with full multiresolution functionality. Most importantly, this allows a working on the compressed data itself. The total memory requirement of the data structures is less than 7 bytes per vertex independent of the maximum level of the hierarchy. The latter property is very important for very large scale or compound models consisting of several layers of terrain data.

Examples

If below pictures appear scrambled, please increase the width of your web browser. Click on the pictures to see an enlarged version.

Here you see the global gtopo30 dataset (courtesy of USGS) in several approximations with decreasing approximation error from left to right. The storage requirement ranges from a several KBytes for 3000 triangles (left image) to a few MBytes for one million triangles (right image). The relative error of the rightmost image is about 3.6%. In contrast, the original size of the data set would be several hundred MBytes. In the lower row you see the color shaded DEM, in the upper row the corresponding adaptive grids.

In order to show the real benefits of the method we added two further data sets to the global model. One data set covers the lower Rhine valley (courtesy of SFB350) with a resolution of 50m which is nested in the global data set. The second data set covers the city of Bonn (courtesy of DLR) and is again nested in the former data set. Its resolution is about 2m. The four images are taken from an interactive application zooming from outer space into Bonn at several frames per second. The adaptive triangulations shown in the upper row contain roughly the same number of triangles (50.000-100.000) leading to a scale invariance of the rendering performance. The whole compressed data set requires less than 1 GByte for all the data residing in main memory.

References

T. Gerstner, Multiresolution compression and visualization of global topographic data, GeoInformatica, 7(1):7-32, 2003.
T. Gerstner, Adaptive hierarchical methods for landscape visualization and analysis, in "Process Modelling and Landform Evolution", S. Hergarten, H.J. Neugebauer (eds)., Lecture Notes in Earth Sciences 78, pp. 75-92, Springer, 1999.
T. Gerstner, H.-P. Helfrich, and A. Kunoth Wavelet analysis of geoscientific data, in "Dynamics of Multiscale Earth Systems" H.-J. Neugebauer (ed.), Lecture Notes in Earth Sciences, Springer, 2001, to appear.
T. Gerstner, M. Rumpf, U. Weikard, Error indicators for multilevel visualization and computing on nested grids, Computers & Graphics 24(3):363-373, 2000.

Related Projects

Thomas Gerstner