1
0
mirror of https://github.com/gsi-upm/sitc synced 2025-04-29 15:59:04 +00:00

Update 0_2_NLP_Assignment.ipynb

This commit is contained in:
Carlos A. Iglesias 2025-04-24 18:31:56 +02:00 committed by GitHub
parent 8f2a5c17d8
commit 6e8448f22f
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -82,7 +82,7 @@
}
},
"source": [
"### 1. List the first 10 tokens of the doc"
"### 1. List the first 10 tokens of the doc."
]
},
{
@ -149,7 +149,7 @@
}
},
"source": [
"### 7. Visualize the dependency grammar analysis of the second sentence"
"### 7. Visualize the dependency grammar analysis of the second sentence."
]
},
{
@ -178,7 +178,7 @@
}
},
"source": [
"### 9. List frequencies of POS in the document in a table "
"### 9. List the frequencies of POS in the document in a table."
]
},
{
@ -191,7 +191,7 @@
"source": [
"### 10. Preprocessing\n",
"\n",
"Remove from the doc stopwords, digits and punctuation.\n",
"Remove from the doc stopwords, digits, and punctuation.\n",
"\n",
"Hint: check the token api https://spacy.io/api/token\n",
"\n",
@ -207,7 +207,7 @@
},
"source": [
"### 11. Entities of the document\n",
"Print the entities of the document, the type of the entity and what the explanation of the entity in a table with three columns.\n",
"Print the entities of the document, the type of the entity, and the explanation of the entity in a table with three columns.\n",
"\n",
"Example:\n",
"\n",
@ -223,7 +223,7 @@
},
"source": [
"### 12. Visualize the entities\n",
"Show the entities in a graph."
"Show the entities highlighted in the text."
]
},
{
@ -236,7 +236,7 @@
"source": [
"# Movie review\n",
"\n",
"Classify the rmoview reviews from the following dataset https://data.world/rajeevsharma993/movie-reviews"
"Classify the movie reviews from the following dataset https://data.world/rajeevsharma993/movie-reviews"
]
},
{