Compare commits

..

No commits in common. '75f08ea170340ede65eb14736048e8157919be09' and 'e70689072f697a541c4d196f9933864db95c23b2' have entirely different histories.

@ -326,7 +326,7 @@
"def preprocess(words, type='doc'):\n",
" if (type == 'tweet'):\n",
" tknzr = TweetTokenizer(strip_handles=True, reduce_len=True)\n",
" tokens = tknzr.tokenize(words)\n",
" tokens = tknzr.tokenize(tweet)\n",
" else:\n",
" tokens = nltk.word_tokenize(words.lower())\n",
" porter = nltk.PorterStemmer()\n",

Loading…
Cancel
Save