node-exporter was launched natively inside a docker container with the “infiniband” collector enabled by default, but was unable to provide IB metrics like node_infiniband_port_data_received_bytes_total
.
Debug logs:
ts=2022-07-19T23:51:58.455Z caller=infiniband_linux.go:119 level=debug collector=infiniband msg="**infiniband statistics not found, skipping**"
ts=2022-07-19T23:51:58.455Z caller=collector.go:167 level=debug msg="**collector returned no data**" name=infiniband duration_seconds=0.048551425 err="collector returned no data"
Infiniband metrics were not collected
curl http://localhost:9101/metrics | grep node_scrape_collector_success{collector=\"infiniband\"}
node_scrape_collector_success{collector=“infiniband”} 0
Infiniband source folder exists with read permissions:
ls -l /sys/class/infiniband
total 0
lrwxrwxrwx 1 root root 0 Jul 16 01:03 mlx5_0 -> ../../devices/LNXSYSTM:00/LNXSYBUS:00/ACPI0004:00/VMBUS:00/0000000b-0101-0001-3135-423331303142/pci0101:00/0101:00:00.0/infiniband/mlx5_0
...
lrwxrwxrwx 1 root root 0 Jul 16 01:03 mlx5_8 -> ../../devices/LNXSYSTM:00/LNXSYBUS:00/ACPI0004:00/VMBUS:00/e8c6c124-f62a-49b1-a22b-b301d36239c9/pcif62a:00/f62a:00:02.0/infiniband/mlx5_8
ls -l /sys/class/infiniband/mlx5_0/ports/1/
total 0
-r--r--r-- 1 root root 4096 Jul 21 05:26 cap_mask
drwxr-xr-x 2 root root 0 Jul 22 16:03 cm_rx_duplicates
drwxr-xr-x 2 root root 0 Jul 22 16:03 cm_rx_msgs
drwxr-xr-x 2 root root 0 Jul 22 16:03 cm_tx_msgs
drwxr-xr-x 2 root root 0 Jul 22 16:03 cm_tx_retries
drwxr-xr-x 2 root root 0 Jul 21 05:28 counters
drwxr-xr-x 4 root root 0 Jul 22 16:03 gid_attrs
drwxr-xr-x 2 root root 0 Jul 21 05:26 gids
-r--r--r-- 1 root root 4096 Jul 22 16:03 has_smi
drwxr-xr-x 2 root root 0 Jul 22 16:03 hw_counters
-r--r--r-- 1 root root 4096 Jul 21 05:26 lid
-r--r--r-- 1 root root 4096 Jul 21 05:26 lid_mask_count
-r--r--r-- 1 root root 4096 Jul 21 05:26 link_layer
-r--r--r-- 1 root root 4096 Jul 21 05:26 phys_state
drwxr-xr-x 2 root root 0 Jul 21 05:26 pkeys
-r--r--r-- 1 root root 4096 Jul 21 05:26 rate
-r--r--r-- 1 root root 4096 Jul 21 05:26 sm_lid
-r--r--r-- 1 root root 4096 Jul 21 05:26 sm_sl
-r--r--r-- 1 root root 4096 Jul 21 05:26 state