Wasserstein implementation does not seem to be fully "batched" #10

netw0rkf10w · 2021-09-15T14:59:39Z

Thanks for sharing your code!

I would like to ask a question regarding your implementation of the Sinkhorn algorithm. You stated that one of the main motivations was to obtain efficient batched computation. However, looking at the code I observe that it only supports the case where the cost matrix is the same across the batch:

def forward(ctx, mu, nu, dist, lam=1e-3, N=100):
        assert mu.dim() == 2 and nu.dim() == 2 and dist.dim() == 2
        bs = mu.size(0)
        d1, d2 = dist.size()
        assert nu.size(0) == bs and mu.size(1) == d1 and nu.size(1) == d2

That is, the shape dist is d1 x d2 instead of bs x d1 x d2. Is this expected?

Thank you in advance for your reply.

The text was updated successfully, but these errors were encountered:

netw0rkf10w · 2021-09-15T15:10:51Z

I also observe that the backward pass does not compute the gradient for dist:

@staticmethod
    def backward(ctx, grad_out):
        return grad_out[:, None] * ctx.log_u * ctx.lam, grad_out[:, None] * ctx.log_v * ctx.lam, None, None, None

which unfortunately prevents learning dist...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wasserstein implementation does not seem to be fully "batched" #10

Wasserstein implementation does not seem to be fully "batched" #10

netw0rkf10w commented Sep 15, 2021

netw0rkf10w commented Sep 15, 2021

Wasserstein implementation does not seem to be fully "batched" #10

Wasserstein implementation does not seem to be fully "batched" #10

Comments

netw0rkf10w commented Sep 15, 2021

netw0rkf10w commented Sep 15, 2021