Sorting data is needed in many application domains. Traditionally, the read from memory and sent to a general-purpose processor or application-specific hardware for sorting. The sorted then written back memory. Reading/writing from/to transferring between processing unit incur significant latency energy overhead. In this work, we develop first architectures in-memory sorting of best our knowled...