Document similarity with the 20 Newsgroups dataset and tf-idf features