1
0
鏡像自 https://github.com/gsi-upm/sitc 已同步 2025-09-16 11:52:20 +00:00

比較提交

..

2 次程式碼提交

作者 SHA1 備註 日期
Carlos A. Iglesias
75f08ea170 Merge pull request #5 from gsi-upm/dveni-patch-2
Update 4_1_Lexical_Processing.ipynb
2019-11-27 10:19:12 +01:00
Dani Vera
19ea5dff09 Update 4_1_Lexical_Processing.ipynb 2019-11-26 15:14:40 +01:00

查看文件

@@ -326,7 +326,7 @@
"def preprocess(words, type='doc'):\n",
" if (type == 'tweet'):\n",
" tknzr = TweetTokenizer(strip_handles=True, reduce_len=True)\n",
" tokens = tknzr.tokenize(tweet)\n",
" tokens = tknzr.tokenize(words)\n",
" else:\n",
" tokens = nltk.word_tokenize(words.lower())\n",
" porter = nltk.PorterStemmer()\n",