Tokenizing news articles in sentences and words