Categorical attributes are those that can take a discrete set of values, e.g., colours. This work is about compressing vectors over categorical to low-dimension vectors. The current hash-based methods do not provide any guarantee on the Hamming distances between compressed representations. Here we present FSketch create sketches for sparse data and an estimator estimate pairwise among uncompres...