Both ucma_destroy_id() and ucma_close_id() (triggered from an event via a
wq) can drive the refcount to zero. ucma_get_ctx() was wrongly assuming
that the refcount can only go to zero from ucma_destroy_id() which also
removes it from the xarray.
Use refcount_inc_not_zero() instead.
Link: https://lore.kernel.org/r/20200818120526.702120-2-leon@kernel.org
Signed-off-by: Leon Romanovsky <leonro@mellanox.com>
Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
        if (!IS_ERR(ctx)) {
                if (ctx->closing)
                        ctx = ERR_PTR(-EIO);
-               else
-                       refcount_inc(&ctx->ref);
+               else if (!refcount_inc_not_zero(&ctx->ref))
+                       ctx = ERR_PTR(-ENXIO);
        }
        xa_unlock(&ctx_table);
        return ctx;