1
0
mirror of https://github.com/gsi-upm/sitc synced 2025-12-15 09:38:16 +00:00

Fix visualization section

This commit is contained in:
Stefano
2017-12-11 18:12:06 +01:00
parent fdf696380d
commit 23073b3431
4 changed files with 8 additions and 7 deletions

View File

@@ -65,9 +65,10 @@
"source": [
"This section covers different ways to inspect the distribution of samples per feature.\n",
"\n",
"First of all, let's take a see how many samples of each class we have, using a [histogram](https://en.wikipedia.org/wiki/Histogram). \n",
"First of all, let's see how many samples of each class we have, using a [histogram](https://en.wikipedia.org/wiki/Histogram). \n",
"\n",
"A histogram is a graphical representation of the distribution of numerical data. It is an estimate of the probability distribution of a continuous variable (quantitative variable). \n",
"A histogram is a graphical representation of the distribution of numerical data. It is an estimation of the probability distribution of a continuous variable (quantitative variable). \n",
"\n",
"For building a histogram, we need first to 'bin' the range of values—that is, divide the entire range of values into a series of intervals—and then count how many values fall into each interval. \n",
"\n",
@@ -151,7 +152,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"We see we have the same distribution of samples for every class.\n",
"As can be seen, we have the same distribution of samples for every class.\n",
"The next step is to see the distribution of the features"
]
},