Skip to content

Conversation

@dev-sda1
Copy link

@dev-sda1 dev-sda1 commented Dec 17, 2025

Nodes that are marked as unavailable/down in Slurm, along with their processors, probably shouldn't be counted in the total_nodes and total_processors figure, as this leads to misleading numbers in certain areas such as the System Status, which advertises more nodes and processors available than there really are. This is made possible by adjusting the sinfo command called by node_cpu_info to use %F instead of %A.

This is more of a proposal at the moment so I've also added an unavailable_nodes field that breaks out the down/unavailable node field, if directly modifying total_nodes and total_processors isn't considered ideal or breaks some things, however this way would either need this replicated across all other adapters, or a special check added in cluster_info.rb for Slurm.

Total nodes shouldn't account for nodes marked as down in Slurm, as this might look misleading in certain widgets like System Status. Have also broken out unavailable nodes as a new property if directly modifying total_nodes isn't considered ideal.
@dev-sda1 dev-sda1 marked this pull request as draft December 17, 2025 16:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant