Currently, I am especially interested in the problem of identifying similarities between high-dimensional datasets. Very often, data may be collected by a number of sources, which may be unable to share their entire datasets for reasons like confidentiality agreements, dataset location and size, etc. If there exists some similar substructure between distinct datasets, this may be exploited. For...