Configuring the RDMA Network Fabric Switch
You must perform an initial configuration of the RDMA Network Fabric switch.
Configuring the InfiniBand Switches
The two Sun Datacenter InfiniBand Switch 36 leaf switches are identified in Recovery Appliance as iba
and ibb
. Complete these configuration procedures for both switches:
Configuring an InfiniBand Switch
The default identifier for leaf switch 1 in U20 is iba
, and for leaf switch 2 in U22 is ibb
.
To configure a Sun Datacenter InfiniBand Switch 36 switch:
Setting the Serial Number on a Spine Switch
In a multirack configuration, set the rack master serial number in the ILOM of the spine switch. Skip this procedure when configuring the leaf switches.
To set the serial number on the spine switch:
Checking the Health of an InfiniBand Switch
To check the health of an InfiniBand switch:
-
Open the fabric management shell:
-> show /SYS/Fabric_Mgmt NOTE: show on Fabric_Mgmt will launch a restricted Linux shell. User can execute switch diagnosis, SM Configuration and IB monitoring commands in the shell. To view the list of commands, use "help" at rsh prompt. Use exit command at rsh prompt to revert back to ILOM shell. FabMan@hostname->
The prompt changes from -> to FabMan@hostname->
-
Check the general health of the switch:
FabMan@ra1sw-iba-> showunhealthy OK - No unhealthy sensors
-
Check the general environment.
FabMan@ra1sw-iba-> env_test NM2 Environment test started: Starting Voltage test: Voltage ECB OK Measured 3.3V Main = 3.28 V Measured 3.3V Standby = 3.42 V Measured 12V = 12.06 V . . .
The report should show that fans 1, 2, and 3 are present, and fans 0 and 4 are not present. All OK and Passed results indicate that the environment is normal.
-
Determine the current InfiniBand subnet manager priority of the switch. Leaf switches must have an smpriority of 5, and spine switches must have a smpriority of 8. The sample output shown here indicates the correct priority for a leaf switch.
FabMan@ra1sw-iba-> setsmpriority list Current SM settings: smpriority 5 controlled_handover TRUE subnet_prefix 0xfe80000000000000
-
If the priority setting is incorrect, then reset it:
-
Disable the subnet manager:
FabMan@ra1sw-iba->disablesm Stopping partitiond daemon. [ OK ] Stopping IB Subnet Manager.. [ OK ]
-
Reset the priority. This example sets the priority on a leaf switch:
FabMan@ra1sw-iba->setsmpriority 5 Current SM settings: smpriority 5 controlled_handover TRUE subnet_prefix 0xfe80000000000000
-
Restart the subnet manager:
FabMan@ra1sw-iba->enablesm Starting IB Subnet Manager. [ OK ] Starting partitiond daemon. [ OK ]
-
-
Log out of the Fabric Management shell and the Oracle ILOM shell:
FabMan@ra1sw-iba-> exit -> exit
-
Log in to Linux as root and restart the switch:
localhost: root password: welcome1 [root@localhost ~]# reboot
-
Disconnect your laptop from the InfiniBand switch.
-
Repeat these procedures for the second InfiniBand leaf switch.
Setting a Spine Switch as the Subnet Manager Master
The InfiniBand switch located in rack unit 1 (U1) is the spine switch. Recovery Appliance has a spine switch only when it is connected to another Recovery Appliance. It is not included as a basic component of the rack.
Perform these steps after the racks are cabled together
The spine switch is the Subnet Manager Master for the InfiniBand subnet. The Subnet Manager Master has priority 8.
To verify the priority setting of the spine switch:
-
Log in to the spine switch as the
root
user. -
Run the
setsmpriority list
command.The command should show that
smpriority
has a value of 8. Ifsmpriority
has a different value, then do the following:-
Use the
disablesm
command to stop the Subnet Manager. -
Use the
setsmpriority 8
command to set the priority to 8. -
Use the
enablesm
command to restart the Subnet Manager.
-
The other two InfiniBand switches are the leaf switches. The leaf switches are located in rack units 20 and 22 (U20 and U22). They are the Standby Subnet Managers with a priority of 5. You can verify the status using the preceding procedure, substituting a value of 5 in the command shown in step 22.b.
To determine the Subnet Manager Master:
-
Log in as the
root
user on any InfiniBand switch. -
Display the location of the Subnet Manager Master.
# getmaster 20100701 11:46:38 OpenSM Master on Switch : 0x0021283a8516a0a0 ports 36 Sun DCS 36 QDR switch ra01sw-ib1.example.com enhanced port 0 lid 1 lmc 0
The preceding output shows the proper configuration. The Subnet Master Manager is running on spine switch ra01sw-ib1.example.com.
If the spine switch is not the Subnet Manager Master, then reset the Subnet Manager Master:
-
Use the
getmaster
command to identify the current location of the Subnet Manager Master. -
Log in as the
root
user on the leaf switch that is the Subnet Manager Master. -
Disable Subnet Manager on the switch. The Subnet Manager Master relocates to another switch.
See Also:
"Disable the Subnet Manager" in Sun Datacenter InfiniBand Switch 36 User's Guide at
http://docs.oracle.com/cd/E19197-01/835-0784-05/z4001de61813698.html#z40003f12047367
-
Use the
getmaster
command to identify the current location of the Subnet Manager Master. If the spine switch is not Subnet Manager Master, then repeat steps 2 and 3 until the spine switch is the Subnet Manager Master. -
Enable Subnet Manager on the leaf switches that were disabled during this procedure.
See Also:
"Enable the Subnet Manager" in Sun Datacenter InfiniBand Switch 36 User's Guide at
http://docs.oracle.com/cd/E19197-01/835-0784-05/z4001de61707660.html#z40003f12047359
Note:
If the InfiniBand network consists of four or more racks cabled together, then only the spine switches run Subnet Manager. Disable the Subnet Manager on the leaf switches.