A user has requested confirmation that the InfiniBand network is performing optimally and is not limiting the speed of a training run. To verify this, you would like to measure the RDMA throughput rate between two endpoints.
Which tool should be used?​
You are troubleshooting InfiniBand connectivity issues in a cluster managed by the NVIDIA Network Operator. You need to verify the status of the InfiniBand interfaces. Which command should you use to check the state and link layer of InfiniBand interfaces on a node?
You are investigating a performance issue in a Spectrum-X network and suspect there might be congestion problems.
Which component executes the congestion control algorithm in a Spectrum-X environment?​
You suspect there might be connectivity issues in your InfiniBand fabric and need to perform a comprehensive check. Which tool should you use to run a full fabric diagnostic and generate a report?
Which of the following tools in Cumulus Linux is specifically useful for detecting and differentiating microbursts from regular network congestion?
Pick the 2 correct responses below
A major cloud provider is designing a new data center to support large-scale AI workloads, particularly for training large language models. They want to optimize their network architecture for maximum performance and efficiency.
Why is a rail-optimized topology considered a best practice for AI network architecture in this scenario?
Which of the following statements are true about AI workloads and adaptive routing?
Pick the 2 correct responses below.​
You are concerned about potential security threats and unexpected downtime in your InfiniBand data center.
Which UFM platform uses analytics to detect security threats, operational issues, and predict network failures in InfiniBand data centers?​
You are automating the deployment of a Spectrum-X network using Ansible. You need to ensure that the playbooks can handle different switch models and configurations efficiently.
Which feature of the NVIDIA NVUE Collection helps simplify the automation by providing pre-built roles for common network configurations?
A fabric administrator added new servers to a 40-port edge switch. The administrator now needs to gather and map the newly added ports’ LIDs and LINK SPEED. Which of the following commands can be used for that purpose?
You are using NVIDIA Air to simulate a Spectrum-X network for AI workloads. You want to ensure that your network configurations are optimal before deployment.
Which NVIDIA tool can be integrated with Air to validate network configurations in the digital twin environment?​
What command sequence is used to identify the exact name of the server that runs as the master SM in a multi-node fabric?
A financial services company is planning to implement an AI infrastructure to support real-time fraud detection and risk assessment. They need a solution that can handle both training and inference workloads while maintaining data privacy and security.
Which NVIDIA reference architecture component would be most appropriate to address the data privacy and security concerns in this AI networking setup?​
You are tasked with troubleshooting a link flapping issue in an InfiniBand AI fabric. You would like to start troubleshooting from the physical layer.
What is the right NVIDIA tool to be used for this task?​
When designing a multi-tenancy East/West (E/W) fabric using Unified Fabric Manager (UFM), which method should be used?​
You are configuring an InfiniBand network for an AI cluster and need to install the appropriate software stack. Which NVIDIA software package provides the necessary drivers and tools for InfiniBand configuration in Linux environments?