R Tutorial: Tokenizing and cleaning