RDMA/mlx5: Don't add slave port to unaffiliated list
authorLeon Romanovsky <leonro@nvidia.com>
Mon, 31 May 2021 16:04:44 +0000 (19:04 +0300)
committerJason Gunthorpe <jgg@nvidia.com>
Wed, 16 Jun 2021 18:26:01 +0000 (15:26 -0300)
The mlx5_ib_bind_slave_port() doesn't remove multiport device from the
unaffiliated list, but mlx5_ib_unbind_slave_port() did it. This unbalanced
flow caused to the situation where mlx5_ib_unaffiliated_port_list was
changed during iteration.

Fixes: 32f69e4be269 ("{net, IB}/mlx5: Manage port association for multiport RoCE")
Link: https://lore.kernel.org/r/2726e6603b1e6ecfe76aa5a12a063af72173bcf7.1622477058.git.leonro@nvidia.com
Reported-by: Dan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: Leon Romanovsky <leonro@nvidia.com>
Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
drivers/infiniband/hw/mlx5/main.c

index 312aa73..18f5fc8 100644 (file)
@@ -3182,8 +3182,6 @@ static void mlx5_ib_unbind_slave_port(struct mlx5_ib_dev *ibdev,
 
        port->mp.mpi = NULL;
 
-       list_add_tail(&mpi->list, &mlx5_ib_unaffiliated_port_list);
-
        spin_unlock(&port->mp.mpi_lock);
 
        err = mlx5_nic_vport_unaffiliate_multiport(mpi->mdev);
@@ -3332,6 +3330,8 @@ static void mlx5_ib_cleanup_multiport_master(struct mlx5_ib_dev *dev)
                                mlx5_ib_dbg(dev, "unbinding port_num: %u\n",
                                            i + 1);
                                mlx5_ib_unbind_slave_port(dev, dev->port[i].mp.mpi);
+                               list_add_tail(&dev->port[i].mp.mpi->list,
+                                             &mlx5_ib_unaffiliated_port_list);
                        }
                }
        }