This repository contains codes for feature extraction and Se-Res2Net model used in Complementary regional energy features for spoofed speech detection.
Regional energy features are independent from the utterence lenghts, hence can be combined with deep embeddings. They provide a compact representation of the spectrogram. Time and frequency resolutions can be adjusted independenly. Examples for the utterances taken from LA and PA conditions of ASVspoof2019 dataset are given below. Combined with SE-Res2Net embeddings, 0.9% EER was obtained for the evaluation set of ASVspoof 2019 LA condition.

