Latent_Feature_Grid_Compression

Summary

This research advances neural volume compression techniques that represent the state-of-the-art in 3D scene representation networks.
By combining advanced pruning algorithms with wavelet transformations, this work achieves competetive compression ratios while maintaining high reconstruction quality for volumetric data.

Problem & Impact

Traditional volume compression methods struggle to balance compression efficiency with reconstruction quality.
This work addresses the critical need for memory-efficient neural networks capable of representing complex 3D volumes with minimal storage requirements - essential for applications in medical imaging, scientific visualization, and computer graphics.

Key Technical Contributions

Multi-Algorithm Pruning Framework: Implementation of three complementary pruning strategies - binary masking, Smallify, and Variational Dropout
Wavelet-Enhanced Compression: Integration of wavelet transformations that concentrates feature information into fewer coefficients, dramatically improving pruning effectiveness
Latent Feature Grid Optimization: Identifying and preserving the most critical network parameters while eliminating redundant ones
End-to-End Training Pipeline: Complete framework built on FV-SRN with hyperparameter tuning and neural architecture search

Results

5+ PSNR improvement: Pruned networks achieve up to 5 PSNR points better reconstruction quality compared to baseline methods
Competetive compression: Outperforms traditional compression algorithms like TTHRESH across multiple datasets
Wavelet advantage: Wavelet-enhanced pruning shows significantly improved parameter efficiency, with most feature information concentrated in just a few coefficients
Scalable performance: Validated on multiple datasets including 255³ and 150³ volume resolutions

Practical Applications

The introduced algotihm for neural volume compression enables real-time processing of large-scale volumetric data with dramatically reduced memory footprints, making high-quality 3D neural representations accessible for resource-constrained environments and real-time applications.

Quick Start

Requirements

Basic functionalities, such as training and quantization of the network with 3D numpy arrays as input, as well as writing of the results as .vti files can be enabled by installing with pip (Env.txt) or conda (Env.yml). The resulting .vti files can be visualized with ParaView.

Optionally: Pyrenderer can be used as a visualization tool and to feed CVol Data into the network. Follow the instructions under https://github.com/shamanDevel/fV-SRN for installation.

Data

Datasets and corresponding config files for all experiments can be found in datasets/ and experiment-config-files/.

Train and run the network:

Install the requirements from Env.txt (pip) or Env.yml (conda).
Generate a config-file for the experiment or use one under experiment-config-files/. Descriptions for the different parameters can be generated with python Feature_Grid_Training.py --help.
Use python Feature_Grid_Training.py --config <Path-To-Config-File> to start training.
During training, Tensorboard tracks the experiment under runs/. After training, a checkpoint to the trained model, as well as the config-file and basic information about the training are logged to experiments/<expname>/. The checkpoint is generated in two ways: First, as a torch .pth (model.pth) for easy reading with other torch implementations. Secondly, stored efficiently as a binary representation, where the pruned parameters are removed (binary_model_file and binary_model_file_mask.bnr). Furthermore a .vti file for the ground-truth and model-predicted volume will be generated.
A generated model can be inferred again explicitly with python Feature_Grid_Inference.py --config_path <Path-To-Config-File> --reconstruct <binary> | <checkpoint>. The paths to the binary masks and torch checkpoints are stored in the model config file, and the reconstruction source can be specified to reconstruct from the efficient binary representation with 'binary' or from the torch checkpoint with 'checkpoint'.

Perform Hyperparameter Search:

In order to find the best hyperparameter for each network type and dataset, the AX MULTI-OBJECTIVE NAS Algorithm is provided. To run hyperparameter search, use jupyter notebook to start either the 'Multiobjective-NAS' jupyter notebook. In the first cell, define the config file of the experiment, then execute the subsequent cells to start the scheduler and visualize the results. The Search-Space for each experiment can be configured in Multi_Objective_NAS.py.

Project Structure

Parsing of arguments, as well as the entry points to training and inference are implemented in Feature_Grid_Training.py and Feature_Grid_Inference.py.
The initialization of the network, as well as training is implemented in training/training.py.
Model utilities, such as network setup and storage are implemented in model/model_utils.py.
The basic model architecture can be found in model/Feature_Grid_Model.py and model/Feature_Embedding.py.
The pruning algorithms are implemented in model/Smallify_Dropout.py, model/Straight_Through_Dropout.py and model/Variational_Dropout_Layer.py.
Data input is handled in the classes in data/

Results

All experiments are performed on the datasets\mhd1024.h5(255, 255, 255) and datasets\test_vol.npy(150,150,150) datasets. The fv-SRN, as well as Neurcomp are able to outperform state of the art compression algorithms, like TTHRESH.

Pruning works best when performed on large, open networks, where singular nodes have relatively low influence on the reconstruction quality. Pruning is performed on the latent feature grid of the fV-SRN and is able to significantly improve the reconstruction quality of the network by up to 5 PSNR points.

The wavelet transformation is able to further enhance the effectiveness of the pruning algorithms. This is because most of the latent feature information of the fV-SRN feature grid are encoded into just a few wavelet coefficients, enabling the pruning algorithms to easily distinguish between important and unimportant parameters.

For a more extensive review, see the pdf of the Master's thesis.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Latent_Feature_Grid_Compression

Summary

Problem & Impact

Key Technical Contributions

Results

Practical Applications

Quick Start

Requirements

Data

Train and run the network:

Perform Hyperparameter Search:

Project Structure

Results

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
data		data
datasets		datasets
experiment-config-files		experiment-config-files
model		model
plots		plots
tests		tests
training		training
visualization		visualization
wavelet_transform		wavelet_transform
.gitignore		.gitignore
Env.txt		Env.txt
Env.yml		Env.yml
Feature_Grid_Inference.py		Feature_Grid_Inference.py
Feature_Grid_Training.py		Feature_Grid_Training.py
Master_Thesis_Training_Methods_for_Memory_efficient_Volume_Scene_Representation_Networks_Maarten_Bussler.pdf		Master_Thesis_Training_Methods_for_Memory_efficient_Volume_Scene_Representation_Networks_Maarten_Bussler.pdf
Multi_Objective_NAS.py		Multi_Objective_NAS.py
Multiobjective-NAS.ipynb		Multiobjective-NAS.ipynb
README.md		README.md

Bussler/Latent_Feature_Grid_Compression

Folders and files

Latest commit

History

Repository files navigation

Latent_Feature_Grid_Compression

Summary

Problem & Impact

Key Technical Contributions

Results

Practical Applications

Quick Start

Requirements

Data

Train and run the network:

Perform Hyperparameter Search:

Project Structure

Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages