Cleaning the corpus