The proliferation of data collection technologies often results in large sets with many observations and variables. In practice, highly relevant engineered features are groups predictors that share a common regression coefficient (i.e., the group affect response only via their collective sum), where unknown advance must be discovered from data. We propose an algorithm called tree (CTR) to disco...