FastBDT

Stochastic gradient-boosted decision trees for multivariate classification, usable standalone and via Python interface.

Check the paper on ArXiv: FastBDT: A speed-optimized and cache-friendly implementation of stochastic gradient-boosted decision trees for multivariate classification

Stochastic gradient-boosted decision trees are widely employed for multivariate classification and regression tasks. This paper presents a speed-optimized and cache-friendly implementation for multivariate classification called FastBDT. FastBDT is one order of magnitude faster during the fitting and application phases compared to popular implementations in frameworks like TMVA, scikit-learn, and XGBoost. The concepts used to optimize execution time and performance are discussed in detail in this paper. Key ideas include:

equal-frequency binning on the input data, which allows replacing expensive floating-point operations with integer operations while improving classification quality;
a cache-friendly linear access pattern to the input data, in contrast to typical implementations that exhibit random access patterns.

FastBDT provides interfaces to C/C++ and Python. It is extensively used in high energy physics by the Belle II Collaboration.

Warning

This repository is a fork maintained by the Belle II Collaboration. It is guaranteed to compile with modern compilers and the unit tests and main examples are fully functional, unless stated otherwise. However, no further development of this fork is currently planned.

The original repository can be found at: https://github.com/thomaskeck/FastBDT

Installation

To build and install FastBDT, use the following commands:

mkdir -p build install && cd build
cmake ..
make
make install

This will also install the Python bindings automatically if CMake detects a valid python3 interpreter during the configuration step.

Usage

Typically, you will want to use FastBDT as a library integrated directly into your application. Available interfaces:

the C++ shared/static library (see examples/IRISExample.cxx)
the C shared library
the Python library PyFastBDT/FastBDT.py (see examples/iris_example.py and examples/generic_example.py)

Weight type and numerical precision

By default, FastBDT uses single-precision floating point (float) as type for internal weights in the C++ implementation. This choice is made for performance reasons and is sufficient for most use cases. If higher numerical precision is required, FastBDT can be compiled using double-precision floating point (double) weights by enabling the following CMake option at configuration time:

cmake .. -DUSE_DOUBLE_WEIGHT=ON

This changes the internal weight type used throughout the FastBDT codebase.

Weight type in C++

When working with FastBDT in C++, it is strongly recommended to use the type alias FastBDT::Weight, which is available via the header FastBDT.h, for all weight-related variables, rather than explicitly using float or double. This ensures that user code remains compatible regardless of whether FastBDT is built with single or double precision.

Weight type in Python

The Python interface automatically handles the internal weight type and requires no user action. Switching between single and double precision is entirely transparent to Python users.

Name		Name	Last commit message	Last commit date
Latest commit History 222 Commits
PyFastBDT		PyFastBDT
examples		examples
include		include
src		src
.gitlab-ci.yml		.gitlab-ci.yml
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml.in		pyproject.toml.in
setup.py.in		setup.py.in

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FastBDT

Warning

Installation

Usage

Weight type and numerical precision

Weight type in C++

Weight type in Python

Further reading

About

Uh oh!

Releases 2

Packages

Contributors 4

Uh oh!

Languages

License

belle2/FastBDT

Folders and files

Latest commit

History

Repository files navigation

FastBDT

Warning

Installation

Usage

Weight type and numerical precision

Weight type in C++

Weight type in Python

Further reading

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 4

Uh oh!

Languages

Packages