uNet neural network architecture which takes multiple (X) tensors as input and contains Spatial Transformer units (ST)