Hello @Zengyi-Qin @jkchengh @kzhang66
I have read your paper Learning Safe Multi-Agent Control with Decentralized Neural Barrier Certificates. The idea to combine the loss function for control barrier functions and the controller is fascinating.
As far as I understood, you have experimented with a task having continuous action space. Can this be applied to a reinforcement learning task with discrete action space?