Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Practical notes on lossy compression of scientific data

View through CrossRef
<p>Lossy compression methods are extremely efficient in terms of space and performance and allow for reduction of network bandwidth and disk space needed to store data arrays without sacrificing the number of stored values.  Lossy compression involves an irreversible transformation of data that reduces the information content of the data.  The transformation introduces a distortion that is normally measured in terms of absolute or relative error. The error is higher for higher compression ratios.  A good choice of lossy compression parameters maximizes the compression ratio while keeping the introduced error within acceptable margins.  Negligence or failure to chose a right compression method or its parameters leads to poor compression ratio, or loss of the data.</p><p>A good strategy for lossy compression would involve sepcification of the acceptible error margin and choice of compression parameters and storage format. We will discuss specific techniques of lossy compression, and illustrate pitfalls in choice of the error margins and tools for lossy/lossless compression. The following specific topics will be covered:</p><p>1. Packing of floating-point data to integers in NetCDF is sub-optimal in most cases,   and for some quantities leads to severe errors.<br>2. Keeping relative vs absolute precision: false alternative.<br>3. Acceptible error margin depends on both the origin and the intended application of data.<br>4. Smart algorithms to decide on compression parameters have limited area of applicability,   which has to be considered in each individual case.<br>5. Choice of a format for compressed data (NetCDF, GRIB2, Zarr): tradeoff between size, speed and precision.<br>6. What "number_of_significant_digits" and "least_significant_digit" mean in terms of relative/absolute error.<br>7. Bit-Shuffle is not always beneficial.</p>
Copernicus GmbH
Title: Practical notes on lossy compression of scientific data
Description:
<p>Lossy compression methods are extremely efficient in terms of space and performance and allow for reduction of network bandwidth and disk space needed to store data arrays without sacrificing the number of stored values.
 Lossy compression involves an irreversible transformation of data that reduces the information content of the data.
 The transformation introduces a distortion that is normally measured in terms of absolute or relative error.
The error is higher for higher compression ratios.
 A good choice of lossy compression parameters maximizes the compression ratio while keeping the introduced error within acceptable margins.
 Negligence or failure to chose a right compression method or its parameters leads to poor compression ratio, or loss of the data.
</p><p>A good strategy for lossy compression would involve sepcification of the acceptible error margin and choice of compression parameters and storage format.
We will discuss specific techniques of lossy compression, and illustrate pitfalls in choice of the error margins and tools for lossy/lossless compression.
The following specific topics will be covered:</p><p>1.
Packing of floating-point data to integers in NetCDF is sub-optimal in most cases,   and for some quantities leads to severe errors.
<br>2.
Keeping relative vs absolute precision: false alternative.
<br>3.
Acceptible error margin depends on both the origin and the intended application of data.
<br>4.
Smart algorithms to decide on compression parameters have limited area of applicability,   which has to be considered in each individual case.
<br>5.
Choice of a format for compressed data (NetCDF, GRIB2, Zarr): tradeoff between size, speed and precision.
<br>6.
What "number_of_significant_digits" and "least_significant_digit" mean in terms of relative/absolute error.
<br>7.
Bit-Shuffle is not always beneficial.
</p>.

Related Results

Survey on Various Image Compression Techniques Used in Image Processing to Improve the Quality of Image
Survey on Various Image Compression Techniques Used in Image Processing to Improve the Quality of Image
This paper presents study of assorted lossy compression techniques. the 2 techniques are Wavelet Difference Reduction (WDR) based compression and Singular Value Decomposition (SVD)...
Differential Diagnosis of Neurogenic Thoracic Outlet Syndrome: A Review
Differential Diagnosis of Neurogenic Thoracic Outlet Syndrome: A Review
Abstract Thoracic outlet syndrome (TOS) is a complex and often overlooked condition caused by the compression of neurovascular structures as they pass through the thoracic outlet. ...
A note on precision-preserving compression of scientific data
A note on precision-preserving compression of scientific data
Abstract. Lossy compression of scientific data arrays is a powerful tool to save network bandwidth and storage space. Properly applied lossy compression can reduce the size of a da...
Lossy Scientific Data Compression With SPERR
Lossy Scientific Data Compression With SPERR
<p>Much of the research in lossy data compression has focused on minimizing the average error for a given storage budget. For scientific applications, the maximum poi...
Continuous Leakage Resilient Lossy Trapdoor Functions
Continuous Leakage Resilient Lossy Trapdoor Functions
Lossy trapdoor functions (LTFs) were first introduced by Peikert and Waters (STOC’08). Since their introduction, lossy trapdoor functions have found numerous applications. They can...
Improving the performance of 3D image model compression based on optimized DEFLATE algorithm
Improving the performance of 3D image model compression based on optimized DEFLATE algorithm
AbstractThis study focuses on optimizing and designing the Delayed-Fix-Later Awaiting Transmission Encoding (DEFLATE) algorithm to enhance its compression performance and reduce th...
Lossless Compression Method for Medical Image Sequences Using Super-Spatial Structure Prediction and Inter-frame Coding
Lossless Compression Method for Medical Image Sequences Using Super-Spatial Structure Prediction and Inter-frame Coding
Space research organizations, hospitals and military air surveillance activities, among others, produce a huge amount of data in the form of images hence a large storage space is r...
Millimetre‐wave broadband waveguide‐based power combiner using lossy planar lines
Millimetre‐wave broadband waveguide‐based power combiner using lossy planar lines
A lossy waveguide‐based power combiner is proposed for broadband low‐loss symmetrically combining with good port matching and isolation at millimetre‐wave frequencies. In the combi...

Back to Top