We leveraged a wide range of knowledge in machine learning and information retrieval to develop a robust keywords identification system. In particular, we adapted the Tf-Idf heuristic for feature selection, investigated linear regression, logistic regression, and SVM for predicting number of keywords (tags), and also implemented NaiveBayes, Labeled Latent Dirichlet Allocation(L-LDA) algorithms ...