Compare commits
219 Commits
9937490213
...
master
Author | SHA1 | Date | |
---|---|---|---|
|
9844820e66 | ||
|
d10434362e | ||
|
fb2135cea6 | ||
|
ba6e533e0b | ||
|
4f5e976918 | ||
|
b58370a19a | ||
|
5c203b0884 | ||
|
5bf815f60f | ||
|
90a3ff098b | ||
|
945a8a7fb6 | ||
|
6532ef1b27 | ||
|
3a73b2b286 | ||
|
2e4ec3cfdc | ||
|
21e7ae2f57 | ||
|
7b4d16964d | ||
|
c5967746ea | ||
|
ed7f0f3e1c | ||
|
9324516c19 | ||
|
6fc5565ea0 | ||
|
1113485833 | ||
|
0c3f317a85 | ||
|
0b550c837b | ||
|
d7ce6df7fe | ||
|
e2edae6049 | ||
|
4ea0146def | ||
|
e7b2cee795 | ||
|
9e1d0e5534 | ||
|
f82203f371 | ||
|
b9ecccdeab | ||
|
44a555ac2d | ||
|
ec11ff2d5e | ||
|
ec02125396 | ||
|
b5f1a7dd22 | ||
|
1cc1e45673 | ||
|
a2ad2c0e92 | ||
|
1add6a4c8e | ||
|
af78e6480d | ||
|
cae7d8cbb2 | ||
|
f58aa6c0b8 | ||
|
6e8448f22f | ||
|
8f2a5c17d8 | ||
|
36d117e417 | ||
|
2fc057f6f9 | ||
|
5b0d4f2a5d | ||
|
7afa2b3b22 | ||
|
4e0f9159e8 | ||
|
82aa552976 | ||
|
3ebff69cf8 | ||
|
0f228bbec3 | ||
|
64c8854741 | ||
|
3e081e5d83 | ||
|
065797b886 | ||
|
8d2f625b7e | ||
|
26eda30a71 | ||
|
55365ae927 | ||
|
152125b3da | ||
|
97362545ea | ||
|
c49c866a2e | ||
|
3f7694e330 | ||
|
bf684d6e6e | ||
|
d935b85b26 | ||
|
1d8e777236 | ||
|
23ebe2f390 | ||
|
01eb89ada4 | ||
|
e4fdcd65a1 | ||
|
9f46c534f7 | ||
|
743c57691f | ||
|
2c53b81299 | ||
|
dd6c053109 | ||
|
e35e0a11e9 | ||
|
7315b681e4 | ||
|
3fac9c6f78 | ||
|
21819abeae | ||
|
0d4c0c706d | ||
|
8de629b495 | ||
|
86114b4a56 | ||
|
1a3f618995 | ||
|
a1121c03a5 | ||
|
715d0cb77f | ||
|
0150ce7cf7 | ||
|
08dfe5c147 | ||
|
78e62af098 | ||
|
3f5eba3e84 | ||
|
2de1cda8f1 | ||
|
cc442c35f3 | ||
|
1100c352fa | ||
|
9b573d292d | ||
|
dd8a4f50d8 | ||
|
47148f2ccc | ||
|
8ffda8123a | ||
|
6629837e7d | ||
|
ba08a9a264 | ||
|
4b8fd30f42 | ||
|
d879369930 | ||
|
4da01f3ae6 | ||
|
da9a01e26b | ||
|
dc23b178d7 | ||
|
5410d6115d | ||
|
6749aa5deb | ||
|
c31e6c1676 | ||
|
1c7496c8ac | ||
|
35b1ae4ec8 | ||
|
58fc6f5e9c | ||
|
91147becee | ||
|
1530995243 | ||
|
0c0960cec7 | ||
|
3363c953f4 | ||
|
542ce2708d | ||
|
380340d66d | ||
|
7f49f8990b | ||
|
419ea57824 | ||
|
7d6010114d | ||
|
f9d8234e14 | ||
|
d41fa61c65 | ||
|
05a4588acf | ||
|
50933f6c94 | ||
|
68ba528dd7 | ||
|
897bb487b1 | ||
|
41d3bdea75 | ||
|
0a9cd3bd5e | ||
|
2c7c9e58e0 | ||
|
f0278aea33 | ||
|
7bf0fb6479 | ||
|
4d87b07ed9 | ||
|
7d71ba5f7a | ||
|
1124c9129c | ||
|
df6449b55f | ||
|
d99eeb733a | ||
|
a43fb4c78c | ||
|
bf21e3ceab | ||
|
e41d233828 | ||
|
a7c6be5b96 | ||
|
11a1ea80d3 | ||
|
a209d18a5b | ||
|
ffefd8c2e3 | ||
|
f43cde73e4 | ||
|
8784fdc773 | ||
|
a6d5f9ddeb | ||
|
2e72a4d729 | ||
|
9426b4c061 | ||
|
5e5979d515 | ||
|
270dcec611 | ||
|
e6e52b43ee | ||
|
3b7675fa3f | ||
|
44c63412f9 | ||
|
5febbc21a4 | ||
|
66ed4ba258 | ||
|
95cd25aef4 | ||
|
955e74fc8e | ||
|
6743dad100 | ||
|
729f7684c2 | ||
|
ae8d3d3ba2 | ||
|
2ba0e2f3d9 | ||
|
c9114cc796 | ||
|
b80c097362 | ||
|
161cd8492b | ||
|
3d6d96dd8a | ||
|
44aa3d24fb | ||
|
8925a4a3c1 | ||
|
23913811df | ||
|
7b4391f187 | ||
|
0c100dbadc | ||
|
2f7cbe9e45 | ||
|
b43125ca59 | ||
|
5144b7f228 | ||
|
8b6d6de169 | ||
|
7271b5e632 | ||
|
bd99321d6b | ||
|
91b8f66056 | ||
|
242a0a9252 | ||
|
d8d25c4dc3 | ||
|
4979fe6877 | ||
|
c5e0f146c4 | ||
|
167475029e | ||
|
da79a18bfc | ||
|
47761c11aa | ||
|
fd5aa4a1fd | ||
|
396a7b17ca | ||
|
2248188219 | ||
|
21dc5ec3de | ||
|
db99033727 | ||
|
5459e801d5 | ||
|
75f08ea170 | ||
|
19ea5dff09 | ||
|
e70689072f | ||
|
344e054ba4 | ||
|
2c8238f1f2 | ||
|
e42299ac7a | ||
|
9d1b88dfea | ||
|
ae3c34f94c | ||
|
06e51db4e3 | ||
|
b42ba2fe58 | ||
|
0aa095b40d | ||
|
ecfa8924c8 | ||
|
b3239cbbab | ||
|
842b6307f1 | ||
|
762157bfe1 | ||
|
d4b59e702d | ||
|
267421e5b8 | ||
|
a1abd4b766 | ||
|
c1d3ca38ea | ||
|
a1be167cc0 | ||
|
4d339a1a83 | ||
|
47fe85d527 | ||
|
f9965fdbcd | ||
|
e824fd8fed | ||
|
76d08a9e40 | ||
|
ba5bb34eb2 | ||
|
4f12fac0de | ||
|
fafce65bd3 | ||
|
9332fd6f80 | ||
|
d551ee44c5 | ||
|
e573852e70 | ||
|
1086b9818a | ||
|
f039465f5e | ||
|
a4f8f69b19 | ||
|
fc07718ae8 | ||
|
1f5318a357 | ||
|
d857869c06 |
@@ -14,10 +14,42 @@ Also note that we have a code of conduct, please follow it in all your interacti
|
||||
2. If you are adding code, ensure the changed notebooks can be run in a fresh environment. Include instructions to download
|
||||
any additional dependencies.
|
||||
3. Ensure any spurious changes are removed, such as compilation files (`pyc`) or metadata changes in a notebook.
|
||||
You can automatically do so using nbstripout:
|
||||
```
|
||||
pip install nbstripout
|
||||
nbstripout --install
|
||||
```
|
||||
This will install a git hook that strips all metadata from the notebooks before you commit changes to git.
|
||||
|
||||
You can also remove the output from all cells using this command:
|
||||
|
||||
```
|
||||
make clean
|
||||
```
|
||||
|
||||
To limit the command to a specific folder (e.g. ml1):
|
||||
```
|
||||
make FOLDER=ml1 clean
|
||||
```
|
||||
4. Submit your pull request on GitHub.
|
||||
5. A member of the GSI-UPM group will review your request.
|
||||
6. The reviewer may ask for further changes before merging the contribution. Please, follow the reviewer's instructions before resubmitting.
|
||||
|
||||
|
||||
## Testing the changes
|
||||
|
||||
You can execute all notebooks at once to make sure they work with this command:
|
||||
|
||||
```
|
||||
make exec
|
||||
```
|
||||
|
||||
To only check notebooks in a specific folder, run:
|
||||
|
||||
```
|
||||
make FOLDER=ml2 exec # Run all notebooks in the ml2 folder
|
||||
```
|
||||
|
||||
## Code of Conduct
|
||||
|
||||
### Our Pledge
|
||||
|
11
Makefile
Normal file
@@ -0,0 +1,11 @@
|
||||
FOLDER:=.
|
||||
ERROR:=255
|
||||
|
||||
exec:
|
||||
find $(FOLDER) -iname '*.ipynb' -print0 | xargs -n 1 -0 sh -c 'jupyter nbconvert --execute --ClearOutputPreprocessor.enabled=True --inplace $$0 || exit $(ERROR)'
|
||||
|
||||
clean:
|
||||
find $(FOLDER) -iname '*.ipynb' -print0 | xargs -n 1 -0 sh -c 'nbstripout $$0 || exit $(ERROR)'
|
||||
|
||||
|
||||
.PHONY: exec clean
|
12
README.md
@@ -1,19 +1,21 @@
|
||||
# sitc
|
||||
Exercises for Intelligent Systems Course at Universidad Politécnica de Madrid, Telecommunication Engineering School. This material is used in the subjects
|
||||
- SITC (Sistemas Inteligentes y Tecnologías del Conocimiento) - Master Universitario de Ingeniería de Telecomunicación (MUIT)
|
||||
- TIAD (Tecnologías Inteligentes de Análisis de Datos) - Master Universitario en Ingeniera de Redes y Servicios Telemáticos)
|
||||
- CDAW (Ciencia de datos y aprendizaje en automático en la web de datos) - Master Universitario de Ingeniería de Telecomunicación (MUIT)
|
||||
- ABID (Analítica de Big Data) - Master Universitario en Ingeniera de Redes y Servicios Telemáticos)
|
||||
|
||||
For following this course:
|
||||
- Follow the instructions to install the environment: https://github.com/gsi-upm/sitc/blob/master/python/1_1_Notebooks.ipynb (Just install 'conda')
|
||||
- Download the course: use 'https://github.com/gsi-upm/sitc'
|
||||
- Run in a terminal in the foloder sitc: jupyter notebook (and enjoy)
|
||||
- Download the course: use 'https://github.com/gsi-upm/sitc' (or clone the repository to receive updates).
|
||||
- Run in a terminal in the folder sitc: jupyter notebook (and enjoy)
|
||||
|
||||
Topics
|
||||
* Python: quick introduction to Python
|
||||
* Python: a quick introduction to Python
|
||||
* ML-1: introduction to machine learning with scikit-learn
|
||||
* ML-2: introduction to machine learning with pandas and scikit-learn
|
||||
* ML-21: preprocessing and visualizatoin
|
||||
* ML-3: introduction to machine learning. Neural Computing
|
||||
* ML-4: introduction to Evolutionary Computing
|
||||
* ML-5: introduction to Reinforcement Learning
|
||||
* NLP: introduction to NLP
|
||||
* LOD: Linked Open Data, exercises and example code
|
||||
* SNA: Social Network Analysis
|
||||
|
BIN
images/EscUpmPolit_p.gif
Normal file
After Width: | Height: | Size: 3.1 KiB |
BIN
images/cart.png
Normal file
After Width: | Height: | Size: 95 KiB |
BIN
images/data-chart-type.png
Normal file
After Width: | Height: | Size: 34 KiB |
BIN
images/frozenlake-problem.png
Normal file
After Width: | Height: | Size: 54 KiB |
BIN
images/frozenlake-world.png
Normal file
After Width: | Height: | Size: 67 KiB |
BIN
images/gym-maze.gif
Normal file
After Width: | Height: | Size: 222 KiB |
BIN
images/iris-classes.png
Normal file
After Width: | Height: | Size: 1.4 MiB |
BIN
images/iris-dataset.jpg
Normal file
After Width: | Height: | Size: 44 KiB |
BIN
images/iris-features.png
Normal file
After Width: | Height: | Size: 944 KiB |
BIN
images/machine-learning-process.jpg
Normal file
After Width: | Height: | Size: 237 KiB |
BIN
images/multilayerperceptron_network.png
Normal file
After Width: | Height: | Size: 87 KiB |
BIN
images/plot_ML_flow_chart_1.png
Normal file
After Width: | Height: | Size: 56 KiB |
BIN
images/plot_ML_flow_chart_2.png
Normal file
After Width: | Height: | Size: 87 KiB |
BIN
images/plot_ML_flow_chart_3.png
Normal file
After Width: | Height: | Size: 58 KiB |
BIN
images/qlearning-algo.png
Normal file
After Width: | Height: | Size: 85 KiB |
BIN
images/recording.gif
Normal file
After Width: | Height: | Size: 1.8 MiB |
BIN
images/titanic.jpg
Normal file
After Width: | Height: | Size: 152 KiB |
484
lod/00_SPARQL_Tutorial.ipynb
Normal file
@@ -0,0 +1,484 @@
|
||||
{
|
||||
"cells": [
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
""
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"<header style=\"width:100%;position:relative\">\n",
|
||||
" <div style=\"width:80%;float:right;\">\n",
|
||||
" <h1>Course Notes for Learning Intelligent Systems</h1>\n",
|
||||
" <h3>Department of Telematic Engineering Systems</h3>\n",
|
||||
" <h5>Universidad Politécnica de Madrid. © Carlos A. Iglesias </h5>\n",
|
||||
" </div>\n",
|
||||
" <img style=\"width:15%;\" src=\"../logo.jpg\" alt=\"UPM\" />\n",
|
||||
"</header>"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Introduction\n",
|
||||
"\n",
|
||||
"This lecture provides an introduction to RDF and the SPARQL query language.\n",
|
||||
"\n",
|
||||
"This is the first in a series of notebooks about SPARQL, which consists of:\n",
|
||||
"\n",
|
||||
"* This notebook, which explains basic concepts of RDF and SPARQL\n",
|
||||
"* [A notebook](01_SPARQL_Introduction.ipynb) that provides an introduction to SPARQL through a collection of exercises of increasing difficulty.\n",
|
||||
"* [An optional notebook](02_SPARQL_Custom_Endpoint.ipynb) with queries to a custom dataset.\n",
|
||||
"The dataset is meant to be done after the [RDF exercises](../rdf/RDF.ipynb) and it is out of the scope of this course.\n",
|
||||
"You can consult it if you are interested."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"# RDF basics\n",
|
||||
"This section is taken from [[1](#1), [2](#2)].\n",
|
||||
"\n",
|
||||
"RDF allows us to make statements about resources. The format of these statements is simple. A statement always has the following structure:\n",
|
||||
"\n",
|
||||
" <subject> <predicate> <object>\n",
|
||||
" \n",
|
||||
"An RDF statement expresses a relationship between two resources. The **subject** and the **object** represent the two resources being related; the **predicate** represents the nature of their relationship.\n",
|
||||
"The relationship is phrased in a directional way (from subject to object).\n",
|
||||
"In RDF this relationship is known as a **property**.\n",
|
||||
"Because RDF statements consist of three elements they are called **triples**.\n",
|
||||
"\n",
|
||||
"Here are some examples of RDF triples (informally expressed in pseudocode):\n",
|
||||
"\n",
|
||||
" <Bob> <is a> <person>.\n",
|
||||
" <Bob> <is a friend of> <Alice>.\n",
|
||||
" \n",
|
||||
"Resources are identified by [IRIs](https://en.wikipedia.org/wiki/Internationalized_Resource_Identifier), which can appear in all three positions of a triple. For example, the IRI for Leonardo da Vinci in DBpedia is:\n",
|
||||
"\n",
|
||||
" <http://dbpedia.org/resource/Leonardo_da_Vinci>\n",
|
||||
"\n",
|
||||
"IRIs can be abbreviated as *prefixed names*. For example, \n",
|
||||
" PREFIX dbr: <http://dbpedia.org/resource/>\n",
|
||||
" <dbr:Leonardo_da_Vinci>\n",
|
||||
" \n",
|
||||
"Objects can be literals: \n",
|
||||
" * strings (e.g., \"plain string\" or \"string with language\"@en)\n",
|
||||
" * numbers (e.g., \"13.4\"^^xsd:float)\n",
|
||||
" * dates (e.g., )\n",
|
||||
" * booleans\n",
|
||||
" * etc.\n",
|
||||
" \n",
|
||||
"RDF data is stored in RDF repositories that expose SPARQL endpoints.\n",
|
||||
"Let's query one of the most famous RDF repositories: [dbpedia](https://wiki.dbpedia.org/).\n",
|
||||
"First, we should learn how to execute SPARQL in a notebook."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"# Executing SPARQL in a notebook\n",
|
||||
"There are several ways to execute SPARQL in a notebook.\n",
|
||||
"Some of the most popular are:\n",
|
||||
"\n",
|
||||
"* using libraries such as [sparql-client](https://pypi.org/project/sparql-client/) or [rdflib](https://rdflib.dev/sparqlwrapper/) that enable executing SPARQL within a Python3 kernel\n",
|
||||
"* using other libraries. In our case, a light library has been developed (the file helpers.py) for accessing SPARQL endpoints using an HTTP connection.\n",
|
||||
"* using the [graph notebook package](https://pypi.org/project/graph-notebook/)\n",
|
||||
"* using a SPARQL kernel [sparql kernel](https://github.com/paulovn/sparql-kernel) instead of the Python3 kernel\n",
|
||||
"\n",
|
||||
"\n",
|
||||
"We are going to use the second option to avoid installing new packages.\n",
|
||||
"\n",
|
||||
"To use the library, you need to:\n",
|
||||
"\n",
|
||||
"1. Import `sparql` from helpers (i.e., `helpers.py`, a file that is available in the github repository)\n",
|
||||
"2. Use the `%%sparql` magic command to indicate the SPARQL endpoint and then the SPARQL code.\n",
|
||||
"\n",
|
||||
"Let's try it!\n",
|
||||
"\n",
|
||||
"# Queries agains DBPedia\n",
|
||||
"\n",
|
||||
"We are going to execute a SPARQL query against DBPedia. This section is based on [[8](#8)].\n",
|
||||
"\n",
|
||||
"First, we just create a query to retrieve arbitrary triples (subject, predicate, object) without any restriction (besides limiting the result to 10 triples)."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"from helpers import sparql"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"SELECT ?s ?p ?o\n",
|
||||
"WHERE {\n",
|
||||
" ?s ?p ?o\n",
|
||||
"}\n",
|
||||
"LIMIT 10"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Well, it worked, but the results are not particulary interesting. \n",
|
||||
"Let's search for a famous football player, Fernando Torres.\n",
|
||||
"\n",
|
||||
"To do so, we will search for entities whose English \"human-readable representation\" (i.e., label) matches \"Fernando Torres\":"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"SELECT *\n",
|
||||
"WHERE\n",
|
||||
" {\n",
|
||||
" ?athlete rdfs:label \"Fernando Torres\"@en \n",
|
||||
" }"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Great, we found the IRI of the node: `http://dbpedia.org/resource/Fernando_Torres`\n",
|
||||
"\n",
|
||||
"Now we can start asking for more properties.\n",
|
||||
"\n",
|
||||
"To do so, go to http://dbpedia.org/resource/Fernando_Torres and you will see all the information available about Fernando Torres. Pay attention to the names of predicates to be able to create new queries. For example, we are interesting in knowing where Fernando Torres was born (`dbo:birthPlace`).\n",
|
||||
"\n",
|
||||
"Let's go!"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX dbo: <http://dbpedia.org/ontology/>\n",
|
||||
"\n",
|
||||
"SELECT *\n",
|
||||
"WHERE\n",
|
||||
" {\n",
|
||||
" ?athlete rdfs:label \"Fernando Torres\"@en ;\n",
|
||||
" dbo:birthPlace ?birthPlace . \n",
|
||||
" }"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"If we examine the SPARQL query, we find three blocks:\n",
|
||||
"\n",
|
||||
"* **PREFIX** section: IRIs of vocabularies and the prefix used below, to avoid long IRIs. e.g., by defining the `dbo` prefix in our example, the `dbo:birthPlace` below expands to `http://dbpedia.org/ontology/birthPlace`.\n",
|
||||
"* **SELECT** section: variables we want to return (`*` is an abbreviation that selects all of the variables in a query)\n",
|
||||
"* **WHERE** clause: triples where some elements are variables. These variables are bound during the query processing process and bounded variables are returned.\n",
|
||||
"\n",
|
||||
"Now take a closer look at the **WHERE** section.\n",
|
||||
"We said earlier that triples are made out of three elements and each triple pattern should finish with a period (`.`) (although the last pattern can omit this).\n",
|
||||
"However, when two or more triple patterns share the same subject, we omit it all but the first one, and use ` ;` as separator.\n",
|
||||
"If if both the subject and predicate are the same, we could use a coma `,` instead.\n",
|
||||
"This allows us to avoid repetition and make queries more readable.\n",
|
||||
"But don't forget the space before your separators (`;` and `.`).\n",
|
||||
"\n",
|
||||
"The result is interesting, we know he was born in Fuenlabrada, but we see an additional (wrong) value, the Spanish national football team. The conversion process from Wikipedia to DBPedia should still be tuned :).\n",
|
||||
"\n",
|
||||
"We can *fix* it, by adding some more constaints.\n",
|
||||
"In our case, only want a birth place that is also a municipality (i.e., its type is `http://dbpedia.org/resource/Municipalities_of_Spain`).\n",
|
||||
"Let's see!"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX dbo: <http://dbpedia.org/ontology/>\n",
|
||||
"PREFIX dbr: <http://dbpedia.org/resource/>\n",
|
||||
"\n",
|
||||
"SELECT *\n",
|
||||
"WHERE\n",
|
||||
" {\n",
|
||||
" ?athlete rdfs:label \"Fernando Torres\"@en ;\n",
|
||||
" dbo:birthPlace ?birthPlace .\n",
|
||||
" ?birthPlace dbo:type dbr:Municipalities_of_Spain \n",
|
||||
" }"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Great. Now it looks better.\n",
|
||||
"Notice that we added a new prefix.\n",
|
||||
"\n",
|
||||
"Now, is Fuenlabrada is a big city?\n",
|
||||
"Let's find out.\n",
|
||||
"\n",
|
||||
"**Hint**: you can find more subject / object / predicate nodes related to [Fuenlabrada])http://dbpedia.org/resource/Fuenlabrada) in the RDF graph just as we did before.\n",
|
||||
"That is how we found the `dbo:areaTotal` property."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX dbo: <http://dbpedia.org/ontology/>\n",
|
||||
"PREFIX dbr: <http://dbpedia.org/resource/>\n",
|
||||
"\n",
|
||||
"SELECT *\n",
|
||||
"WHERE\n",
|
||||
" {\n",
|
||||
" dbr:Fuenlabrada dbo:areaTotal ?area \n",
|
||||
" }"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Well, it shows 39.1 km$^2$.\n",
|
||||
"\n",
|
||||
"Let's go back to our Fernando Torres.\n",
|
||||
"What we are really insterested in is the name of the city he was born in, not its IRI.\n",
|
||||
"As we saw before, the human-readable name is provided by the `rdfs:label` property:"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX dbo: <http://dbpedia.org/ontology/>\n",
|
||||
"PREFIX dbp: <http://dbpedia.org/property/>\n",
|
||||
"\n",
|
||||
"SELECT *\n",
|
||||
"WHERE\n",
|
||||
" {\n",
|
||||
" ?player rdfs:label \"Fernando Torres\"@en ;\n",
|
||||
" dbo:birthPlace ?birthPlace .\n",
|
||||
" ?birthPlace dbo:type dbr:Municipalities_of_Spain ;\n",
|
||||
" rdfs:label ?placeName \n",
|
||||
" \n",
|
||||
" }"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Well, we are almost there. We see that we receive the city name in many languages. We want just the English name. Let's filter!"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX dbo: <http://dbpedia.org/ontology/>\n",
|
||||
"PREFIX dbp: <http://dbpedia.org/property/>\n",
|
||||
"\n",
|
||||
"SELECT *\n",
|
||||
"WHERE\n",
|
||||
" {\n",
|
||||
" ?player rdfs:label \"Fernando Torres\"@en ;\n",
|
||||
" dbo:birthPlace ?birthPlace .\n",
|
||||
" ?birthPlace dbo:type dbr:Municipalities_of_Spain ;\n",
|
||||
" rdfs:label ?placeName .\n",
|
||||
" FILTER ( LANG ( ?placeName ) = 'en' )\n",
|
||||
" \n",
|
||||
" }"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Awesome!\n",
|
||||
"\n",
|
||||
"But we said we don't care about the IRI of the place. We only want two pieces of data: Fernando's birth date and the name of his birthplace.\n",
|
||||
"\n",
|
||||
"Let's tune our query a bit more."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX dbo: <http://dbpedia.org/ontology/>\n",
|
||||
"PREFIX dbp: <http://dbpedia.org/property/>\n",
|
||||
"\n",
|
||||
"SELECT ?birthDate, ?placeName\n",
|
||||
"WHERE\n",
|
||||
" {\n",
|
||||
" ?player rdfs:label \"Fernando Torres\"@en ;\n",
|
||||
" dbo:birthDate ?birthDate ;\n",
|
||||
" dbo:birthPlace ?birthPlace .\n",
|
||||
" ?birthPlace dbo:type dbr:Municipalities_of_Spain ;\n",
|
||||
" rdfs:label ?placeName .\n",
|
||||
" FILTER ( LANG ( ?placeName ) = 'en' )\n",
|
||||
" \n",
|
||||
" }"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Great 😃\n",
|
||||
"\n",
|
||||
"Are there many football players born in Fuenlabrada? Let's find out!"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX dbo: <http://dbpedia.org/ontology/>\n",
|
||||
"PREFIX dbp: <http://dbpedia.org/property/>\n",
|
||||
"\n",
|
||||
"SELECT *\n",
|
||||
"WHERE\n",
|
||||
" {\n",
|
||||
" ?player a dbo:SoccerPlayer ; \n",
|
||||
" dbo:birthPlace dbr:Fuenlabrada . \n",
|
||||
" }"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Well, not that many. Observe we have used `a`.\n",
|
||||
"It is just an abbreviation for `rdf:type`, both can be used interchangeably.\n",
|
||||
"\n",
|
||||
"If you want additional examples, you can follow the notebook by [Shawn Graham](https://github.com/o-date/sparql-and-lod/blob/master/sparql-intro.ipynb), which is based on the SPARQL tutorial by Matthew Lincoln, available [here in English](https://programminghistorian.org/en/lessons/retired/graph-databases-and-SPARQL) and [here in Spanish](https://programminghistorian.org/es/lecciones/retirada/sparql-datos-abiertos-enlazados]). You have also a local copy of these tutorials together with this notebook [here in English](https://htmlpreview.github.io/?https://github.com/gsi-upm/sitc/blob/master/lod/tutorial/graph-databases-and-SPARQL.html) and [here in Spanish](https://htmlpreview.github.io/?https://github.com/gsi-upm/sitc/blob/master/lod/tutorial/sparql-datos-abiertos-enlazados.html). \n"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## References"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"* <a id=\"1\">[1]</a> [SPARQL by Example. A Tutorial. Lee Feigenbaum. W3C, 2009](https://www.w3.org/2009/Talks/0615-qbe/#q1)\n",
|
||||
"* <a id=\"2\">[2]</a> [RDF Primer W3C](https://www.w3.org/TR/rdf11-primer/)\n",
|
||||
"* <a id=\"3\">[3]</a> [SPARQL queries of Beatles recording sessions](http://www.snee.com/bobdc.blog/2017/11/sparql-queries-of-beatles-reco.html)\n",
|
||||
"* <a id=\"4\">[4]</a> [RDFLib documentation](https://rdflib.readthedocs.io/en/stable/).\n",
|
||||
"* <a id=\"5\">[5]</a> [Wikidata Query Service query examples](https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples)\n",
|
||||
"* <a id=\"6\">[6]</a> [RDF Graph Data Model. Learn about the RDF graph model used by Stardog.](https://www.stardog.com/tutorials/data-model)\n",
|
||||
"* <a id=\"7\">[7]</a> [Learn SPARQL Write Knowledge Graph queries using SPARQL with step-by-step examples.](https://www.stardog.com/tutorials/sparql/)\n",
|
||||
"* <a id=\"8\">[8]</a> [Running Basic SPARQL Queries Against DBpedia.](https://medium.com/virtuoso-blog/dbpedia-basic-queries-bc1ac172cc09)\n",
|
||||
"* <a id=\"8\">[9]</a> [Intro SPARQL based on painters.](https://github.com/o-date/sparql-and-lod/blob/master/sparql-intro.ipynb)."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Licence\n",
|
||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"\n",
|
||||
"© Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
]
|
||||
}
|
||||
],
|
||||
"metadata": {
|
||||
"datacleaner": {
|
||||
"position": {
|
||||
"top": "50px"
|
||||
},
|
||||
"python": {
|
||||
"varRefreshCmd": "try:\n print(_datacleaner.dataframe_metadata())\nexcept:\n print([])"
|
||||
},
|
||||
"window_display": false
|
||||
},
|
||||
"kernelspec": {
|
||||
"display_name": "Python 3 (ipykernel)",
|
||||
"language": "python",
|
||||
"name": "python3"
|
||||
},
|
||||
"language_info": {
|
||||
"codemirror_mode": {
|
||||
"name": "ipython",
|
||||
"version": 3
|
||||
},
|
||||
"file_extension": ".py",
|
||||
"mimetype": "text/x-python",
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.8.10"
|
||||
},
|
||||
"latex_envs": {
|
||||
"LaTeX_envs_menu_present": true,
|
||||
"autocomplete": true,
|
||||
"bibliofile": "biblio.bib",
|
||||
"cite_by": "apalike",
|
||||
"current_citInitial": 1,
|
||||
"eqLabelWithNumbers": true,
|
||||
"eqNumInitial": 1,
|
||||
"hotkeys": {
|
||||
"equation": "Ctrl-E",
|
||||
"itemize": "Ctrl-I"
|
||||
},
|
||||
"labels_anchors": false,
|
||||
"latex_user_defs": false,
|
||||
"report_style_numbering": false,
|
||||
"user_envs_cfg": false
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
"nbformat_minor": 1
|
||||
}
|
1919
lod/01_SPARQL_Introduction.ipynb
Executable file
476
lod/02_SPARQL_Custom_Endpoint.ipynb
Normal file
@@ -0,0 +1,476 @@
|
||||
{
|
||||
"cells": [
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "7276f055a8c504d3c80098c62ed41a4f",
|
||||
"grade": false,
|
||||
"grade_id": "cell-0bfe38f97f6ab2d2",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"<header style=\"width:100%;position:relative\">\n",
|
||||
" <div style=\"width:80%;float:right;\">\n",
|
||||
" <h1>Course Notes for Learning Intelligent Systems</h1>\n",
|
||||
" <h3>Department of Telematic Engineering Systems</h3>\n",
|
||||
" <h5>Universidad Politécnica de Madrid</h5>\n",
|
||||
" </div>\n",
|
||||
" <img style=\"width:15%;\" src=\"../logo.jpg\" alt=\"UPM\" />\n",
|
||||
"</header>"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "42642609861283bc33914d16750b7efa",
|
||||
"grade": false,
|
||||
"grade_id": "cell-0cd673883ee592d1",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"## Introduction\n",
|
||||
"\n",
|
||||
"In the previous notebook, we learnt how to use SPARQL by querying DBpedia.\n",
|
||||
"\n",
|
||||
"In this notebook, we will use SPARQL on manually annotated data. The data was collected as part of a [previous exercise](../lod/).\n",
|
||||
"\n",
|
||||
"The goal is to try SPARQL with data annotated by users with limited knowledge of vocabularies and semantics, and to compare the experience with similar queries to a more structured dataset.\n",
|
||||
"\n",
|
||||
"Hence, there are two parts.\n",
|
||||
"First, you will query a set of graphs annotated by students of this course.\n",
|
||||
"Then, you will query a synthetic dataset that contains similar information."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "a3ecb4b300a5ab82376a4a8cb01f7e6b",
|
||||
"grade": false,
|
||||
"grade_id": "cell-10264483046abcc4",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"## Objectives\n",
|
||||
"\n",
|
||||
"* Experiencing the usefulness of the Linked Open Data initiative by querying data from different RDF graphs and endpoints\n",
|
||||
"* Understanding the challenges in querying multiple sources, with different annotators.\n"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "2fedf0d73fc90104d1ab72c3413dfc83",
|
||||
"grade": false,
|
||||
"grade_id": "cell-4f8492996e74bf20",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"## Tools\n",
|
||||
"\n",
|
||||
"See [the SPARQL notebook](./01_SPARQL_Introduction.ipynb#Tools)"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "c5f8646518bd832a47d71f9d3218237a",
|
||||
"grade": false,
|
||||
"grade_id": "cell-eb13908482825e42",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"Run this line to enable the `%%sparql` magic command."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"from helpers import *"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Exercises\n"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Querying the manually annotated dataset will be slightly different from querying DBpedia.\n",
|
||||
"The main difference is that this dataset uses different graphs to separate the annotations from different students.\n",
|
||||
"\n",
|
||||
"**Each graph is a separate set of triples**.\n",
|
||||
"For this exercise, you could think of graphs as individual endpoints.\n",
|
||||
"\n",
|
||||
"\n",
|
||||
"First, let us get a list of graphs available:"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql http://fuseki.gsi.upm.es/hotels\n",
|
||||
" \n",
|
||||
"SELECT ?g (COUNT(?s) as ?count) WHERE {\n",
|
||||
" GRAPH ?g {\n",
|
||||
" ?s ?p ?o\n",
|
||||
" }\n",
|
||||
"}\n",
|
||||
"GROUP BY ?g\n",
|
||||
"ORDER BY desc(?count)"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"You should see many graphs, with different triple counts.\n",
|
||||
"\n",
|
||||
"The biggest one should be http://fuseki.gsi.upm.es/synthetic"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Once you have this list, you can query specific graphs like so:"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql http://fuseki.gsi.upm.es/hotels\n",
|
||||
" \n",
|
||||
"SELECT *\n",
|
||||
"WHERE {\n",
|
||||
" GRAPH <http://fuseki.gsi.upm.es/synthetic>{\n",
|
||||
" ?s ?p ?o .\n",
|
||||
" }\n",
|
||||
"}\n",
|
||||
"LIMIT 10"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"There are two exercises in this notebook.\n",
|
||||
"\n",
|
||||
"In each of them, you are asked to run five queries, to answer the following questions:\n",
|
||||
"\n",
|
||||
"* Number of hotels (or entities) with reviews\n",
|
||||
"* Number of reviews\n",
|
||||
"* The hotel with the lowest average score\n",
|
||||
"* The hotel with the highest average score\n",
|
||||
"* A list of hotels with their addresses and telephone numbers"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"### Manually annotated data"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Your task is to design five queries to answer the questions in the description, and run each of them in at least three graphs, other than the `synthetic` graph.\n",
|
||||
"\n",
|
||||
"To design the queries, what you know about the schema.org vocabularies, or explore subjects, predicates and objects in each of the graphs.\n",
|
||||
"\n",
|
||||
"Here's an example to get the entities and their types in a graph:"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql http://fuseki.gsi.upm.es/hotels\n",
|
||||
"\n",
|
||||
"PREFIX schema: <http://schema.org/>\n",
|
||||
" \n",
|
||||
"SELECT ?s ?o\n",
|
||||
"WHERE {\n",
|
||||
" GRAPH <http://fuseki.gsi.upm.es/35c20a49f8c6581be1cf7bd56d12d131>{\n",
|
||||
" ?s a ?o .\n",
|
||||
" }\n",
|
||||
"\n",
|
||||
"}\n",
|
||||
"LIMIT 40"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"### Synthetic dataset\n",
|
||||
"\n",
|
||||
"Now, run the same queries in the synthetic dataset.\n",
|
||||
"\n",
|
||||
"The query below should get you started:"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql http://fuseki.gsi.upm.es/hotels\n",
|
||||
" \n",
|
||||
"SELECT *\n",
|
||||
"WHERE {\n",
|
||||
" GRAPH <http://fuseki.gsi.upm.es/synthetic>{\n",
|
||||
" ?s ?p ?o .\n",
|
||||
" }\n",
|
||||
"}\n",
|
||||
"LIMIT 10"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"### Optional exercise\n",
|
||||
"\n",
|
||||
"\n",
|
||||
"Explore the graphs and find the most typical mistakes (e.g. using `http://schema.org/Hotel/Hotel`).\n",
|
||||
"\n",
|
||||
"Tip: You can use normal SPARQL queries with `BOUND` and `REGEX` to check if the annotations are correct.\n",
|
||||
"\n",
|
||||
"You can also query all the graphs at the same time. e.g. to get all types used:"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql http://fuseki.gsi.upm.es/hotels\n",
|
||||
"\n",
|
||||
"PREFIX schema: <http://schema.org/>\n",
|
||||
" \n",
|
||||
"SELECT DISTINCT ?o\n",
|
||||
"WHERE {\n",
|
||||
" GRAPH ?g {\n",
|
||||
" ?s a ?o .\n",
|
||||
" }\n",
|
||||
" {\n",
|
||||
" SELECT ?g\n",
|
||||
" WHERE {\n",
|
||||
" GRAPH ?g {}\n",
|
||||
" FILTER (str(?g) != 'http://fuseki.gsi.upm.es/synthetic')\n",
|
||||
" }\n",
|
||||
" }\n",
|
||||
"\n",
|
||||
"\n",
|
||||
"}\n",
|
||||
"LIMIT 50"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"### Discussion\n",
|
||||
"\n",
|
||||
"Compare the results of the synthetic and the manual dataset, and answer these questions:"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Both datasets should use the same schema. Are there any differences when it comes to using them?"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "860c3977cd06736f1342d535944dbb63",
|
||||
"grade": true,
|
||||
"grade_id": "cell-9bd08e4f5842cb89",
|
||||
"locked": false,
|
||||
"points": 0,
|
||||
"schema_version": 3,
|
||||
"solution": true
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# YOUR ANSWER HERE"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Are the annotations used correctly in every graph?"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "1946a7ed4aba8d168bb3fad898c05651",
|
||||
"grade": true,
|
||||
"grade_id": "cell-9dc1c9033198bb18",
|
||||
"locked": false,
|
||||
"points": 0,
|
||||
"schema_version": 3,
|
||||
"solution": true
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# YOUR ANSWER HERE"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Has any of the datasets been harder to query? If so, why?"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "6714abc5226618b76dc4c1aaed6d1a49",
|
||||
"grade": true,
|
||||
"grade_id": "cell-6c18003ced54be23",
|
||||
"locked": false,
|
||||
"points": 0,
|
||||
"schema_version": 3,
|
||||
"solution": true
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# YOUR ANSWER HERE"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## References"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"* [RDFLib documentation](https://rdflib.readthedocs.io/en/stable/).\n",
|
||||
"* [Wikidata Query Service query examples](https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples)"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Licence\n",
|
||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"\n",
|
||||
"© Universidad Politécnica de Madrid."
|
||||
]
|
||||
}
|
||||
],
|
||||
"metadata": {
|
||||
"kernelspec": {
|
||||
"display_name": "Python 3 (ipykernel)",
|
||||
"language": "python",
|
||||
"name": "python3"
|
||||
},
|
||||
"language_info": {
|
||||
"codemirror_mode": {
|
||||
"name": "ipython",
|
||||
"version": 3
|
||||
},
|
||||
"file_extension": ".py",
|
||||
"mimetype": "text/x-python",
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.8.10"
|
||||
},
|
||||
"toc": {
|
||||
"base_numbering": 1,
|
||||
"nav_menu": {},
|
||||
"number_sections": true,
|
||||
"sideBar": true,
|
||||
"skip_h1_title": false,
|
||||
"title_cell": "Table of Contents",
|
||||
"title_sidebar": "Contents",
|
||||
"toc_cell": false,
|
||||
"toc_position": {},
|
||||
"toc_section_display": true,
|
||||
"toc_window_display": false
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
"nbformat_minor": 2
|
||||
}
|
652
lod/04_SPARQL_Advanced.ipynb
Normal file
@@ -0,0 +1,652 @@
|
||||
{
|
||||
"cells": [
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "7276f055a8c504d3c80098c62ed41a4f",
|
||||
"grade": false,
|
||||
"grade_id": "cell-0bfe38f97f6ab2d2",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"<header style=\"width:100%;position:relative\">\n",
|
||||
" <div style=\"width:80%;float:right;\">\n",
|
||||
" <h1>Course Notes for Learning Intelligent Systems</h1>\n",
|
||||
" <h3>Department of Telematic Engineering Systems</h3>\n",
|
||||
" <h5>Universidad Politécnica de Madrid</h5>\n",
|
||||
" </div>\n",
|
||||
" <img style=\"width:15%;\" src=\"../logo.jpg\" alt=\"UPM\" />\n",
|
||||
"</header>"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "bd478e6253226d24ba7f33cb9f6ba706",
|
||||
"grade": false,
|
||||
"grade_id": "cell-0cd673883ee592d1",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"## Advanced SPARQL\n",
|
||||
"\n",
|
||||
"This notebook complements [the SPARQL notebook](./01_SPARQL.ipynb) with some advanced commands.\n",
|
||||
"\n",
|
||||
"If you have not completed the exercises in the previous notebook, please do so before continuing.\n"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "9ea4fd529653214745b937d5fc4559e5",
|
||||
"grade": false,
|
||||
"grade_id": "cell-10264483046abcc4",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"## Objectives\n",
|
||||
"\n",
|
||||
"* To cover some SPARQL concepts that are less frequently used "
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "2fedf0d73fc90104d1ab72c3413dfc83",
|
||||
"grade": false,
|
||||
"grade_id": "cell-4f8492996e74bf20",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"## Tools\n",
|
||||
"\n",
|
||||
"See [the SPARQL notebook](./01_SPARQL_Introduction.ipynb#Tools)"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "c5f8646518bd832a47d71f9d3218237a",
|
||||
"grade": false,
|
||||
"grade_id": "cell-eb13908482825e42",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"Run this line to enable the `%%sparql` magic command."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"from helpers import *"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Exercises"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"### Working with dates"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"To explore dates, we will focus on our Writers example.\n",
|
||||
"\n",
|
||||
"First, search for writers born in the XX century.\n",
|
||||
"You can use a special filter, knowing that `\"2000\"^^xsd:date` is the first date of year 2000."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "1a23c8b9a53f7ae28f28b1c23b9706b5",
|
||||
"grade": false,
|
||||
"grade_id": "cell-ab7755944d46f9ca",
|
||||
"locked": false,
|
||||
"schema_version": 3,
|
||||
"solution": true
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>\n",
|
||||
"PREFIX dct: <http://purl.org/dc/terms/>\n",
|
||||
"PREFIX dbc: <http://dbpedia.org/resource/Category:>\n",
|
||||
"PREFIX dbo: <http://dbpedia.org/ontology/>\n",
|
||||
"PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>\n",
|
||||
"\n",
|
||||
"SELECT ?escritor ?nombre (year(?fechaNac) as ?nac)\n",
|
||||
"WHERE {\n",
|
||||
" ?escritor dct:subject dbc:Spanish_novelists ;\n",
|
||||
" rdfs:label ?nombre ;\n",
|
||||
" dbo:birthDate ?fechaNac .\n",
|
||||
" FILTER(lang(?nombre) = \"es\") .\n",
|
||||
" # YOUR ANSWER HERE\n",
|
||||
"}\n",
|
||||
"# YOUR ANSWER HERE\n",
|
||||
"LIMIT 1000"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "e261d808f509c1e29227db94d1eab784",
|
||||
"grade": true,
|
||||
"grade_id": "cell-cf3821f2d33fb0f6",
|
||||
"locked": true,
|
||||
"points": 0,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"assert 'Ramiro Ledesma' in solution()['columns']['nombre']\n",
|
||||
"assert 'Ray Loriga' in solution()['columns']['nombre']\n",
|
||||
"assert all(int(x) > 1899 and int(x) < 2001 for x in solution()['columns']['nac'])"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Now, get the list of Spanish novelists that are still alive.\n",
|
||||
"\n",
|
||||
"A person is alive if their death date is not defined and the were born less than 100 years ago.\n",
|
||||
"\n",
|
||||
"Remember, we can check whether the optional value for a key was bound in a SPARQL query using `BOUND(?key)`."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "e4579d551790c33ba4662562c6a05d99",
|
||||
"grade": false,
|
||||
"grade_id": "cell-474b1a72dec6827c",
|
||||
"locked": false,
|
||||
"schema_version": 3,
|
||||
"solution": true
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql\n",
|
||||
"\n",
|
||||
"PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>\n",
|
||||
"PREFIX dct:<http://purl.org/dc/terms/>\n",
|
||||
"PREFIX dbc:<http://dbpedia.org/resource/Category:>\n",
|
||||
"PREFIX dbo:<http://dbpedia.org/ontology/>\n",
|
||||
"\n",
|
||||
"SELECT ?escritor, ?nombre, year(?fechaNac) as ?nac\n",
|
||||
"\n",
|
||||
"WHERE {\n",
|
||||
" ?escritor dct:subject dbc:Spanish_novelists .\n",
|
||||
" ?escritor rdfs:label ?nombre .\n",
|
||||
" ?escritor dbo:birthDate ?fechaNac .\n",
|
||||
"# YOUR ANSWER HERE\n",
|
||||
" FILTER(lang(?nombre) = \"es\") .\n",
|
||||
"}\n",
|
||||
"# YOUR ANSWER HERE\n",
|
||||
"LIMIT 1000"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "770bbddef5210c28486a1929e4513ada",
|
||||
"grade": true,
|
||||
"grade_id": "cell-46b62dd2856bc919",
|
||||
"locked": true,
|
||||
"points": 0,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"assert 'Fernando Arrabal' in solution()['columns']['nombre']\n",
|
||||
"assert 'Albert Espinosa' in solution()['columns']['nombre']\n",
|
||||
"for year in solution()['columns']['nac']:\n",
|
||||
" assert int(year) >= 1918"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Working with badly formatted dates (OPTIONAL!)"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Now, get the list of Spanish novelists that died before their fifties (i.e. younger than 50 years old), or that aren't 50 years old yet.\n",
|
||||
"\n",
|
||||
"For the sake of simplicity, you can use the `year(<date>)` function.\n",
|
||||
"\n",
|
||||
"Hint: you can use boolean logic in your filters (e.g. `&&` and `||`).\n",
|
||||
"\n",
|
||||
"Hint 2: Some dates are not formatted properly, which makes some queries fail when they shouldn't. As a workaround, you could convert the date to string, and back to date again: `xsd:dateTime(str(?date))`."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "e55173801ab36337ad356a1bc286dbd1",
|
||||
"grade": false,
|
||||
"grade_id": "cell-ceefd3c8fbd39d79",
|
||||
"locked": false,
|
||||
"schema_version": 3,
|
||||
"solution": true
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>\n",
|
||||
"PREFIX dct:<http://purl.org/dc/terms/>\n",
|
||||
"PREFIX dbc:<http://dbpedia.org/resource/Category:>\n",
|
||||
"PREFIX dbo:<http://dbpedia.org/ontology/>\n",
|
||||
"\n",
|
||||
"SELECT ?escritor, ?nombre, year(?fechaNac) as ?nac, ?fechaDef\n",
|
||||
"\n",
|
||||
"WHERE {\n",
|
||||
" ?escritor dct:subject dbc:Spanish_novelists .\n",
|
||||
" ?escritor rdfs:label ?nombre .\n",
|
||||
" ?escritor dbo:birthDate ?fechaNac .\n",
|
||||
" # YOUR ANSWER HERE\n",
|
||||
"}\n",
|
||||
"# YOUR ANSWER HERE\n",
|
||||
"LIMIT 100"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "1b77cfaefb8b2ec286ce7b0c70804fe0",
|
||||
"grade": true,
|
||||
"grade_id": "cell-461cd6ccc6c2dc79",
|
||||
"locked": true,
|
||||
"points": 0,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"assert 'Javier Sierra' in solution()['columns']['nombre']\n",
|
||||
"assert 'http://dbpedia.org/resource/José_Ángel_Mañas' in solution()['columns']['escritor']"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"### Regular expressions"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"[Regular expressions](https://www.w3.org/TR/rdf-sparql-query/#funcex-regex) are a very powerful tool, but we will only cover the basics in this exercise.\n",
|
||||
"\n",
|
||||
"In essence, regular expressions match strings against patterns.\n",
|
||||
"In their simplest form, they can be used to find substrings within a variable.\n",
|
||||
"For instance, using `regex(?label, \"substring\")` would only match if and only if the `?label` variable contains `substring`.\n",
|
||||
"But regular expressions can be more complex than that.\n",
|
||||
"For instance, we can find patterns such as: a 10 digit number, a 5 character long string, or variables without whitespaces.\n",
|
||||
"\n",
|
||||
"The syntax of the regex function is the following:\n",
|
||||
"\n",
|
||||
"```\n",
|
||||
"regex(?variable, \"pattern\", \"flags\")\n",
|
||||
"```\n",
|
||||
"\n",
|
||||
"Flags are optional configuration options for the regular expression, such as *do not care about case* (`i` flag).\n",
|
||||
"\n",
|
||||
"As an example, let us find the cities in Madrid that contain \"de\" in their name."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"SELECT ?localidad\n",
|
||||
"WHERE {\n",
|
||||
" ?localidad <http://dbpedia.org/ontology/isPartOf> <http://dbpedia.org/resource/Community_of_Madrid> .\n",
|
||||
" ?localidad rdfs:label ?nombre .\n",
|
||||
" FILTER (lang(?nombre) = \"es\" ).\n",
|
||||
" FILTER regex(?nombre, \"de\", \"i\")\n",
|
||||
"}\n",
|
||||
"LIMIT 10"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Now, use regular expressions to find Spanish novelists whose **first name** is Juan.\n",
|
||||
"In other words, their name **starts with** \"Juan\"."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "b70a9a4f102c253e864d2e8aec79ce81",
|
||||
"grade": false,
|
||||
"grade_id": "cell-a57d3546a812f689",
|
||||
"locked": false,
|
||||
"schema_version": 3,
|
||||
"solution": true
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>\n",
|
||||
"PREFIX dct:<http://purl.org/dc/terms/>\n",
|
||||
"PREFIX dbc:<http://dbpedia.org/resource/Category:>\n",
|
||||
"PREFIX dbr:<http://dbpedia.org/resource/>\n",
|
||||
"PREFIX dbo:<http://dbpedia.org/ontology/>\n",
|
||||
"\n",
|
||||
"# YOUR ANSWER HERE\n",
|
||||
"\n",
|
||||
"WHERE {\n",
|
||||
" {\n",
|
||||
" ?escritor dct:subject dbc:Spanish_poets .\n",
|
||||
" }\n",
|
||||
" UNION {\n",
|
||||
" ?escritor dct:subject dbc:Spanish_novelists .\n",
|
||||
" }\n",
|
||||
" ?escritor rdfs:label ?nombre\n",
|
||||
" FILTER(lang(?nombre) = \"es\") .\n",
|
||||
"# YOUR ANSWER HERE\n",
|
||||
"}\n",
|
||||
"ORDER BY ?nombre\n",
|
||||
"LIMIT 1000"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "66db9abddfafa91c2dc25577457f71fb",
|
||||
"grade": true,
|
||||
"grade_id": "cell-c149fe65008f39a9",
|
||||
"locked": true,
|
||||
"points": 0,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"assert len(solution()['columns']['nombre']) > 15\n",
|
||||
"for i in solution()['columns']['nombre']:\n",
|
||||
" assert 'Juan' in i\n",
|
||||
"assert \"Robert Juan-Cantavella\" not in solution()['columns']['nombre']"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "1be6d6e4d8e74240ef07deffcbe5e71a",
|
||||
"grade": false,
|
||||
"grade_id": "cell-0c2f0113d97dc9de",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"## Group concat"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "c8dbb73a781bd24080804f289a1cea0b",
|
||||
"grade": false,
|
||||
"grade_id": "asdasdasdddddddddddasdasdsad",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"Sometimes, it is useful to aggregate results from form different rows.\n",
|
||||
"For instance, we might want to get a comma-separated list of the names in each each autonomous community in Spain.\n",
|
||||
"\n",
|
||||
"In those cases, we can use the `GROUP_CONCAT` function."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>\n",
|
||||
"PREFIX dbo: <http://dbpedia.org/ontology/>\n",
|
||||
"PREFIX dbr: <http://dbpedia.org/resource/>\n",
|
||||
" \n",
|
||||
"SELECT ?com, GROUP_CONCAT(?name, \",\") as ?places # notice how we rename the variable\n",
|
||||
"\n",
|
||||
"WHERE {\n",
|
||||
" ?com dct:subject dbc:Autonomous_communities_of_Spain .\n",
|
||||
" ?localidad dbo:subdivision ?com ;\n",
|
||||
" rdfs:label ?name .\n",
|
||||
" FILTER (lang(?name)=\"es\")\n",
|
||||
"}\n",
|
||||
"\n",
|
||||
"ORDER BY ?com\n",
|
||||
"LIMIT 100"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"editable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "markdown",
|
||||
"checksum": "4779fb61645634308d0ed01e0c88e8a4",
|
||||
"grade": false,
|
||||
"grade_id": "asdiopjasdoijasdoijasd",
|
||||
"locked": true,
|
||||
"schema_version": 3,
|
||||
"solution": false
|
||||
}
|
||||
},
|
||||
"source": [
|
||||
"Try it yourself, to get a list of works by each of the authors in this query:"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {
|
||||
"deletable": false,
|
||||
"nbgrader": {
|
||||
"cell_type": "code",
|
||||
"checksum": "e5d87d1d8eba51c510241ba75981a597",
|
||||
"grade": false,
|
||||
"grade_id": "cell-2e3de17c75047652",
|
||||
"locked": false,
|
||||
"schema_version": 3,
|
||||
"solution": true
|
||||
}
|
||||
},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"%%sparql https://dbpedia.org/sparql\n",
|
||||
"\n",
|
||||
"PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>\n",
|
||||
"PREFIX dct:<http://purl.org/dc/terms/>\n",
|
||||
"PREFIX dbc:<http://dbpedia.org/resource/Category:>\n",
|
||||
"PREFIX dbr:<http://dbpedia.org/resource/>\n",
|
||||
"PREFIX dbo:<http://dbpedia.org/ontology/>\n",
|
||||
"\n",
|
||||
"# YOUR ANSWER HERE\n",
|
||||
"\n",
|
||||
"WHERE {\n",
|
||||
" ?escritor a dbo:Writer .\n",
|
||||
" ?escritor rdfs:label ?nombre .\n",
|
||||
" ?escritor dbo:birthDate ?fechaNac .\n",
|
||||
" ?escritor dbo:birthPlace dbr:Madrid .\n",
|
||||
" # YOUR ANSWER HERE\n",
|
||||
" FILTER(lang(?nombre) = \"es\") .\n",
|
||||
" FILTER(!bound(?titulo) || lang(?titulo) = \"en\") .\n",
|
||||
"\n",
|
||||
"}\n",
|
||||
"ORDER BY ?nombre\n",
|
||||
"LIMIT 100"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## References"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": []
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Licence\n",
|
||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"\n",
|
||||
"© 2018 Universidad Politécnica de Madrid."
|
||||
]
|
||||
}
|
||||
],
|
||||
"metadata": {
|
||||
"kernelspec": {
|
||||
"display_name": "Python 3 (ipykernel)",
|
||||
"language": "python",
|
||||
"name": "python3"
|
||||
},
|
||||
"language_info": {
|
||||
"codemirror_mode": {
|
||||
"name": "ipython",
|
||||
"version": 3
|
||||
},
|
||||
"file_extension": ".py",
|
||||
"mimetype": "text/x-python",
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.8.10"
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
"nbformat_minor": 2
|
||||
}
|
4352
lod/BeatlesMusicians.ttl
Normal file
@@ -1,11 +0,0 @@
|
||||
# Files included #
|
||||
|
||||
* `validate.py` validates and serializes a turtle dataset
|
||||
* `sparql.py` runs a custom sparql query on a given dataset (by default, `reviews.ttl`)
|
||||
* `extract_data.py` extracts RDFa, micro-data and JSON-LD data from a given URL
|
||||
|
||||
# Installation #
|
||||
|
||||
```
|
||||
pip install --user -r requirements.txt
|
||||
```
|
@@ -1,49 +0,0 @@
|
||||
|
||||
import sys
|
||||
from future.standard_library import install_aliases
|
||||
install_aliases()
|
||||
|
||||
from urllib import request, parse
|
||||
from rdflib import Graph, term
|
||||
from lxml import etree
|
||||
|
||||
if len(sys.argv) < 2:
|
||||
print('Usage: python {} <URL>'.format(sys.argv[0]))
|
||||
print('')
|
||||
print('Extract rdfa, microdata and json-ld annotations from a website')
|
||||
exit(1)
|
||||
|
||||
url = sys.argv[1]
|
||||
|
||||
g = Graph()
|
||||
g.parse(url, format='rdfa')
|
||||
g.parse(url, format='microdata')
|
||||
|
||||
|
||||
def sanitize_triple(t):
|
||||
"""Function to remove bad URIs from the graph that would otherwise
|
||||
make the serialization fail."""
|
||||
def sanitize_triple_item(item):
|
||||
if isinstance(item, term.URIRef) and '/' not in item:
|
||||
return term.URIRef(parse.quote(str(item)))
|
||||
return item
|
||||
|
||||
return (sanitize_triple_item(t[0]),
|
||||
sanitize_triple_item(t[1]),
|
||||
sanitize_triple_item(t[2]))
|
||||
|
||||
|
||||
with request.urlopen(url) as response:
|
||||
# Get all json-ld objects embedded in the html file
|
||||
html = response.read().decode('utf-8', errors='ignore')
|
||||
parser = etree.XMLParser(recover=True)
|
||||
root = etree.fromstring(html, parser=parser)
|
||||
if root:
|
||||
for jsonld in root.findall(".//script[@type='application/ld+json']"):
|
||||
g.parse(data=jsonld.text, publicID=url, format='json-ld')
|
||||
|
||||
|
||||
fixedgraph = Graph()
|
||||
fixedgraph += [sanitize_triple(s) for s in g]
|
||||
|
||||
print(g.serialize(format='turtle').decode('utf-8', errors='ignore'))
|
@@ -1,16 +1,29 @@
|
||||
'''
|
||||
Helper functions and ipython magic for the SPARQL exercises.
|
||||
|
||||
The tests in the notebooks rely on the `LAST_QUERY` variable, which is updated by the `%%sparql` magic after every query.
|
||||
This variable contains the full query used (`LAST_QUERY["query"]`), the endpoint it was sent to (`LAST_QUERY["endpoint"]`), and a dictionary with the response of the endpoint (`LAST_QUERY["results"]`).
|
||||
For convenience, the results are also given as tuples (`LAST_QUERY["tuples"]`), and as a dictionary of of `{column:[values]}` (`LAST_QUERY["columns"]`).
|
||||
'''
|
||||
from IPython.core.magic import (register_line_magic, register_cell_magic,
|
||||
register_line_cell_magic)
|
||||
|
||||
from IPython.display import HTML, display, Image
|
||||
from IPython.display import HTML, display, Image, display_javascript
|
||||
from urllib.request import Request, urlopen
|
||||
from urllib.parse import quote_plus, urlencode
|
||||
from urllib.error import HTTPError
|
||||
|
||||
import ssl
|
||||
import json
|
||||
import sys
|
||||
|
||||
js = "IPython.CodeCell.options_default.highlight_modes['magic_sparql'] = {'reg':[/^%%sparql/]};"
|
||||
display_javascript(js, raw=True)
|
||||
|
||||
|
||||
def send_query(query, endpoint):
|
||||
FORMATS = ",".join(["application/sparql-results+json", "text/javascript", "application/json"])
|
||||
FORMATS = ",".join(["application/sparql-results+json",
|
||||
"text/javascript",
|
||||
"application/json"])
|
||||
|
||||
data = {'query': query}
|
||||
# b = quote_plus(query)
|
||||
@@ -20,7 +33,19 @@ def send_query(query, endpoint):
|
||||
headers={'content-type': 'application/x-www-form-urlencoded',
|
||||
'accept': FORMATS},
|
||||
method='POST')
|
||||
return json.loads(urlopen(r).read().decode('utf-8'));
|
||||
context = ssl.create_default_context()
|
||||
context.check_hostname = False
|
||||
context.verify_mode = ssl.CERT_NONE
|
||||
|
||||
res = urlopen(r, context=context, timeout=2)
|
||||
data = res.read().decode('utf-8')
|
||||
if res.getcode() == 200:
|
||||
try:
|
||||
return json.loads(data)
|
||||
except Exception:
|
||||
print('Got: ', data, file=sys.stderr)
|
||||
raise
|
||||
raise Exception('Error getting results: {}'.format(data))
|
||||
|
||||
|
||||
def tabulate(tuples, header=None):
|
||||
@@ -39,11 +64,14 @@ def tabulate(tuples, header=None):
|
||||
|
||||
LAST_QUERY = {}
|
||||
|
||||
def solution():
|
||||
return LAST_QUERY
|
||||
|
||||
|
||||
def query(query, endpoint=None, print_table=False):
|
||||
global LAST_QUERY
|
||||
|
||||
endpoint = endpoint or "http://dbpedia.org/sparql"
|
||||
endpoint = endpoint or "http://fuseki.gsi.upm.es/sitc/"
|
||||
results = send_query(query, endpoint)
|
||||
tuples = to_table(results)
|
||||
|
||||
@@ -80,12 +108,30 @@ def to_table(results):
|
||||
|
||||
@register_cell_magic
|
||||
def sparql(line, cell):
|
||||
'''
|
||||
Sparql magic command for ipython. It can be used in a cell like this:
|
||||
|
||||
```
|
||||
%%sparql
|
||||
|
||||
... Your SPARQL query ...
|
||||
|
||||
```
|
||||
|
||||
by default, it will use the DBpedia endpoint, but you can use a different endpoint like this:
|
||||
|
||||
```
|
||||
%%sparql http://my-sparql-endpoint...
|
||||
|
||||
... Your SPARQL query ...
|
||||
```
|
||||
'''
|
||||
try:
|
||||
return query(cell, endpoint=line, print_table=True)
|
||||
except HTTPError as ex:
|
||||
error_message = ex.read().decode('utf-8')
|
||||
print('Error {}. Reason: {}'.format(ex.status, ex.reason))
|
||||
print(error_message)
|
||||
print(error_message, file=sys.stderr)
|
||||
|
||||
|
||||
def show_photos(values):
|
||||
|
@@ -1,29 +0,0 @@
|
||||
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
|
||||
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
|
||||
@prefix schema: <http://schema.org/> .
|
||||
|
||||
|
||||
_:Hotel1 a schema:Hotel ;
|
||||
schema:description "A fictitious hotel" .
|
||||
|
||||
|
||||
_:Review1 a schema:Review ;
|
||||
schema:reviewBody "This is a great review" ;
|
||||
schema:reviewRating [
|
||||
a schema:Rating ;
|
||||
schema:author <http://jfernando.es/me> ;
|
||||
schema:ratingValue "0.7"
|
||||
|
||||
] ;
|
||||
schema:itemReviewed _:Hotel1 .
|
||||
|
||||
|
||||
_:Review2 a schema:Review ;
|
||||
schema:reviewBody "This is a not so great review" ;
|
||||
schema:reviewRating [
|
||||
a schema:Rating ;
|
||||
schema:author [ a schema:Person ;
|
||||
schema:givenName "anonymous" ] ;
|
||||
schema:ratingValue "0.3"
|
||||
] ;
|
||||
schema:itemReviewed _:Hotel1 .
|
@@ -1,23 +0,0 @@
|
||||
# !/bin/env python #
|
||||
# Ejemplo de consultas SPARQL sobre turtle #
|
||||
# python consultas.py #
|
||||
import rdflib
|
||||
import sys
|
||||
|
||||
dataset = sys.argv[1] if len(sys.argv) > 1 else 'reviews.ttl'
|
||||
g = rdflib.Graph()
|
||||
|
||||
schema = rdflib.Namespace("http://schema.org/")
|
||||
|
||||
# Read Turtle file #
|
||||
g.parse(dataset, format='turtle')
|
||||
|
||||
results = g.query(
|
||||
"""SELECT DISTINCT ?review ?p ?o
|
||||
WHERE {
|
||||
?review a schema:Review.
|
||||
?review ?p ?o.
|
||||
}""", initNs={'schema': schema})
|
||||
|
||||
for row in results:
|
||||
print("%s %s %s" % row)
|
0
lod/tests.py
Normal file
61
lod/tutorial/css/github.css
Normal file
@@ -0,0 +1,61 @@
|
||||
.highlight { padding-top: 0; margin: 0;}
|
||||
.highlight .c { color: #999988; font-style: italic } /* Comment */
|
||||
.highlight .err { color: #a61717; background-color: #e3d2d2 } /* Error */
|
||||
.highlight .k { color: #000000; font-weight: bold } /* Keyword */
|
||||
.highlight .o { color: #000000; font-weight: bold } /* Operator */
|
||||
.highlight .cm { color: #999988; font-style: italic } /* Comment.Multiline */
|
||||
.highlight .cp { color: #999999; font-weight: bold; font-style: italic } /* Comment.Preproc */
|
||||
.highlight .c1 { color: #999988; font-style: italic } /* Comment.Single */
|
||||
.highlight .cs { color: #999999; font-weight: bold; font-style: italic } /* Comment.Special */
|
||||
.highlight .gd { color: #000000; background-color: #ffdddd } /* Generic.Deleted */
|
||||
.highlight .ge { color: #000000; font-style: italic } /* Generic.Emph */
|
||||
.highlight .gr { color: #aa0000 } /* Generic.Error */
|
||||
.highlight .gh { color: #999999 } /* Generic.Heading */
|
||||
.highlight .gi { color: #000000; background-color: #ddffdd } /* Generic.Inserted */
|
||||
.highlight .go { color: #888888 } /* Generic.Output */
|
||||
.highlight .gp { color: #555555 } /* Generic.Prompt */
|
||||
.highlight .gs { font-weight: bold } /* Generic.Strong */
|
||||
.highlight .gu { color: #aaaaaa } /* Generic.Subheading */
|
||||
.highlight .gt { color: #aa0000 } /* Generic.Traceback */
|
||||
.highlight .kc { color: #000000; font-weight: bold } /* Keyword.Constant */
|
||||
.highlight .kd { color: #000000; font-weight: bold } /* Keyword.Declaration */
|
||||
.highlight .kn { color: #000000; font-weight: bold } /* Keyword.Namespace */
|
||||
.highlight .kp { color: #000000; font-weight: bold } /* Keyword.Pseudo */
|
||||
.highlight .kr { color: #000000; font-weight: bold } /* Keyword.Reserved */
|
||||
.highlight .kt { color: #445588; font-weight: bold } /* Keyword.Type */
|
||||
.highlight .m { color: #009999 } /* Literal.Number */
|
||||
.highlight .s { color: #d01040 } /* Literal.String */
|
||||
.highlight .na { color: #008080 } /* Name.Attribute */
|
||||
.highlight .nb { color: #0086B3 } /* Name.Builtin */
|
||||
.highlight .nc { color: #445588; font-weight: bold } /* Name.Class */
|
||||
.highlight .no { color: #008080 } /* Name.Constant */
|
||||
.highlight .nd { color: #3c5d5d; font-weight: bold } /* Name.Decorator */
|
||||
.highlight .ni { color: #800080 } /* Name.Entity */
|
||||
.highlight .ne { color: #990000; font-weight: bold } /* Name.Exception */
|
||||
.highlight .nf { color: #990000; font-weight: bold } /* Name.Function */
|
||||
.highlight .nl { color: #990000; font-weight: bold } /* Name.Label */
|
||||
.highlight .nn { color: #555555 } /* Name.Namespace */
|
||||
.highlight .nt { color: #000080 } /* Name.Tag */
|
||||
.highlight .nv { color: #008080 } /* Name.Variable */
|
||||
.highlight .ow { color: #000000; font-weight: bold } /* Operator.Word */
|
||||
.highlight .w { color: #bbbbbb } /* Text.Whitespace */
|
||||
.highlight .mf { color: #009999 } /* Literal.Number.Float */
|
||||
.highlight .mh { color: #009999 } /* Literal.Number.Hex */
|
||||
.highlight .mi { color: #009999 } /* Literal.Number.Integer */
|
||||
.highlight .mo { color: #009999 } /* Literal.Number.Oct */
|
||||
.highlight .sb { color: #d01040 } /* Literal.String.Backtick */
|
||||
.highlight .sc { color: #d01040 } /* Literal.String.Char */
|
||||
.highlight .sd { color: #d01040 } /* Literal.String.Doc */
|
||||
.highlight .s2 { color: #d01040 } /* Literal.String.Double */
|
||||
.highlight .se { color: #d01040 } /* Literal.String.Escape */
|
||||
.highlight .sh { color: #d01040 } /* Literal.String.Heredoc */
|
||||
.highlight .si { color: #d01040 } /* Literal.String.Interpol */
|
||||
.highlight .sx { color: #d01040 } /* Literal.String.Other */
|
||||
.highlight .sr { color: #009926 } /* Literal.String.Regex */
|
||||
.highlight .s1 { color: #d01040 } /* Literal.String.Single */
|
||||
.highlight .ss { color: #990073 } /* Literal.String.Symbol */
|
||||
.highlight .bp { color: #999999 } /* Name.Builtin.Pseudo */
|
||||
.highlight .vc { color: #008080 } /* Name.Variable.Class */
|
||||
.highlight .vg { color: #008080 } /* Name.Variable.Global */
|
||||
.highlight .vi { color: #008080 } /* Name.Variable.Instance */
|
||||
.highlight .il { color: #009999 } /* Literal.Number.Integer.Long */
|
968
lod/tutorial/css/style.css
Normal file
@@ -0,0 +1,968 @@
|
||||
@media screen {
|
||||
|
||||
body {
|
||||
font-family: 'Quattrocento', Verdana, sans-serif;
|
||||
font-size:16px;
|
||||
background-color:#ffffff;
|
||||
}
|
||||
|
||||
.container {
|
||||
max-width: 48rem;
|
||||
overflow: hidden;
|
||||
text-overflow: ellipsis;
|
||||
}
|
||||
|
||||
|
||||
/* =============================================================================
|
||||
Helper classes
|
||||
========================================================================== */
|
||||
|
||||
.noclear {
|
||||
clear:none;
|
||||
}
|
||||
|
||||
.expanded {
|
||||
max-width: 58rem;
|
||||
}
|
||||
|
||||
.garnish {
|
||||
width: 23%;
|
||||
padding:0;
|
||||
}
|
||||
|
||||
.full-width {
|
||||
width:80%;
|
||||
margin: 0 auto;
|
||||
text-align:center;
|
||||
}
|
||||
|
||||
.float-right {
|
||||
float:right;
|
||||
margin-left: 1rem;
|
||||
margin-bottom: 1rem;
|
||||
}
|
||||
|
||||
.float-left {
|
||||
margin-right: 1rem;
|
||||
margin-bottom: 1rem;
|
||||
}
|
||||
|
||||
|
||||
/* =============================================================================
|
||||
Home Page
|
||||
========================================================================== */
|
||||
|
||||
.home-block {
|
||||
padding:3rem 0;
|
||||
color:#666;
|
||||
}
|
||||
|
||||
.home-block h2 {
|
||||
margin:0;
|
||||
font-size:2.8rem;
|
||||
color:#333;
|
||||
text-align:center;
|
||||
}
|
||||
|
||||
.home-block p {
|
||||
margin:0rem;
|
||||
font-family:'Open Sans';
|
||||
font-size:1.2rem;
|
||||
padding-top:2rem;
|
||||
text-align:justify;
|
||||
}
|
||||
|
||||
.home-block a:visited {
|
||||
color: #38c;
|
||||
}
|
||||
|
||||
.home-stripe-1 {
|
||||
color:#eee;
|
||||
background:#27b;
|
||||
}
|
||||
|
||||
.home-stripe-1 h2, .home-stripe-2 h2 {
|
||||
color:#fff;
|
||||
}
|
||||
|
||||
.home-stripe-1 a:visited, .home-stripe-1 a:link {
|
||||
color:#6bf;
|
||||
}
|
||||
|
||||
.home-stripe-2 {
|
||||
color:#fff;
|
||||
background:#289;
|
||||
}
|
||||
|
||||
.home-stripe-2 a:visited, .home-stripe-2 a:link {
|
||||
color:#6cd;
|
||||
}
|
||||
|
||||
.home-image {
|
||||
width: 75%;
|
||||
}
|
||||
|
||||
.home-logo img {
|
||||
width: 200px;
|
||||
}
|
||||
|
||||
.home-logo a h1 {
|
||||
color: #fff;
|
||||
}
|
||||
|
||||
.home-logo {
|
||||
color: #fff;
|
||||
}
|
||||
|
||||
.home-logo li {
|
||||
font-size: 1.2rem;
|
||||
}
|
||||
|
||||
.en-back {
|
||||
background-color: #444444;
|
||||
}
|
||||
|
||||
.es-back {
|
||||
background-color: #535D7F;
|
||||
}
|
||||
|
||||
.fr-back {
|
||||
background-color: #3D7C81;
|
||||
}
|
||||
|
||||
.pt-back {
|
||||
background-color: #d6b664;
|
||||
}
|
||||
|
||||
|
||||
.sitewide-alert {
|
||||
position: relative;
|
||||
margin-bottom: 0;
|
||||
}
|
||||
|
||||
/* =============================================================================
|
||||
Lesson Headers
|
||||
========================================================================== */
|
||||
|
||||
header {
|
||||
margin:-3rem 0 3rem 0;
|
||||
padding:0;
|
||||
font-family:'Roboto', sans-serif;
|
||||
color:#ccc;
|
||||
background: #efefef;
|
||||
border-top:1px solid #333;
|
||||
border-bottom:1px solid #333;
|
||||
text-align:left;
|
||||
}
|
||||
|
||||
header .container-fluid {
|
||||
margin:0;
|
||||
padding:1rem;
|
||||
background: #f5f5f5;
|
||||
}
|
||||
|
||||
header h1 {
|
||||
margin:0;
|
||||
padding:0;
|
||||
font-size:1.8rem;
|
||||
text-align:left;
|
||||
}
|
||||
|
||||
header h2 {
|
||||
font-family:'Roboto', sans-serif;
|
||||
font-size:1.2rem;
|
||||
color:#333;
|
||||
margin: 1.5rem 0 1.5rem 0rem;
|
||||
text-align:left;
|
||||
}
|
||||
|
||||
header h3, header h4 {
|
||||
font: .9rem/1.1rem 'Roboto Condensed', sans-serif;
|
||||
text-transform:uppercase;
|
||||
font-variant:small-caps;
|
||||
letter-spacing:80%;
|
||||
color:#666;
|
||||
margin:.3rem 0 0 0;
|
||||
padding:0;
|
||||
}
|
||||
|
||||
header h4 {
|
||||
display:inline;
|
||||
margin:0;
|
||||
line-height:1.3rem;
|
||||
}
|
||||
|
||||
header .header-image {
|
||||
float:left;
|
||||
border:.2rem solid gray;
|
||||
margin:0;
|
||||
padding:0;
|
||||
max-width: 200px;
|
||||
}
|
||||
|
||||
header .header-abstract {
|
||||
font: 1rem/1.4rem 'Roboto', sans-serif;
|
||||
color:#666;
|
||||
margin:1rem 0;
|
||||
}
|
||||
|
||||
header .header-helpers {
|
||||
clear:both;
|
||||
background:#ccc;
|
||||
color:#fff;
|
||||
border-top:1px solid #999;
|
||||
border-bottom:1px solid #999;
|
||||
}
|
||||
|
||||
header ul {
|
||||
margin:0;
|
||||
padding:0;
|
||||
list-style-type: none;
|
||||
}
|
||||
|
||||
header li, header .metarow {
|
||||
font: .9rem/1.1rem 'Roboto Condensed';
|
||||
}
|
||||
|
||||
header .metarow {
|
||||
color:#999;
|
||||
}
|
||||
|
||||
header .peer-review, header .open-license {
|
||||
font-size: 0.9rem;
|
||||
color: #666;
|
||||
margin: 0;
|
||||
}
|
||||
|
||||
/* =============================================================================
|
||||
Lessons Index
|
||||
========================================================================== */
|
||||
|
||||
/*****************
|
||||
FILTER BUTTONS
|
||||
******************/
|
||||
ul.filter, ul.sort-by {
|
||||
margin: 0 0 1rem 0;
|
||||
padding: 0px;
|
||||
text-align:center;
|
||||
}
|
||||
|
||||
li.filter,
|
||||
li.sort,
|
||||
#filter-none {
|
||||
font: .9rem/1.1rem 'Open Sans', sans-serif;
|
||||
padding: .4rem .6rem;
|
||||
border:none;
|
||||
border-radius: 3px;
|
||||
display:inline-block;
|
||||
text-transform:uppercase;
|
||||
text-decoration: none;
|
||||
}
|
||||
|
||||
.filter li:hover,
|
||||
.sort-by li:hover,
|
||||
#filter-none:hover {
|
||||
cursor: pointer;
|
||||
}
|
||||
|
||||
.activities li.current:hover,
|
||||
.filter li.current:hover,
|
||||
.sort-by li.current:hover {
|
||||
cursor:default;
|
||||
}
|
||||
|
||||
.topic li a {
|
||||
text-decoration: none;
|
||||
}
|
||||
|
||||
.activities li {
|
||||
background-color:#38c;
|
||||
color:#fff;
|
||||
}
|
||||
|
||||
.activities li:hover {
|
||||
background-color:#16a;
|
||||
}
|
||||
|
||||
.activities li.current {
|
||||
background-color:#059;
|
||||
}
|
||||
|
||||
.topics li {
|
||||
background-color:#eee;
|
||||
color: #38a;
|
||||
}
|
||||
|
||||
.topics li:hover {
|
||||
background-color:#ccc;
|
||||
}
|
||||
|
||||
.topics li.current {
|
||||
background-color:#aaa;
|
||||
color: #333;
|
||||
}
|
||||
|
||||
|
||||
#filter-none {
|
||||
width:99.5%;
|
||||
clear:both;
|
||||
text-align:center;
|
||||
margin-bottom:1rem;
|
||||
background-color:#fefefe;
|
||||
color:#666;
|
||||
border:1px solid #999;
|
||||
}
|
||||
|
||||
#filter-none:hover {
|
||||
background-color:#ededed;
|
||||
}
|
||||
|
||||
/*****************
|
||||
SEARCH
|
||||
*****************/
|
||||
|
||||
.search-input {
|
||||
width:55%;
|
||||
clear:both;
|
||||
margin-bottom:1rem;
|
||||
background-color:#fefefe;
|
||||
color:#666;
|
||||
border:1px solid #999;
|
||||
font: .9rem/1.1rem 'Open Sans',
|
||||
sans-serif;
|
||||
padding: .4rem .6rem;
|
||||
border-radius: 3px;
|
||||
display:inline-block;
|
||||
text-transform:uppercase;
|
||||
text-decoration: none;
|
||||
}
|
||||
|
||||
#search-button,
|
||||
#enable-search-button {
|
||||
background-color: #efefef;
|
||||
color: rgb(153, 143, 143);
|
||||
width: 35%;
|
||||
font: .9rem/1.1rem 'Open Sans',
|
||||
sans-serif;
|
||||
padding: .4rem .6rem;
|
||||
border: none;
|
||||
border-radius: 3px;
|
||||
display: inline-block;
|
||||
text-transform: uppercase;
|
||||
text-decoration: none;
|
||||
}
|
||||
|
||||
@media only screen and (max-width: 767px) {
|
||||
/* phones */
|
||||
#search-button,
|
||||
#enable-search-button {
|
||||
width: 80%;
|
||||
}
|
||||
}
|
||||
|
||||
|
||||
#search-info-button {
|
||||
padding: 0.5rem;
|
||||
color: rgb(153, 143, 143);
|
||||
}
|
||||
|
||||
#search-info {
|
||||
display: none;
|
||||
height:0px;
|
||||
background:#efefef;
|
||||
overflow:hidden;
|
||||
transition:0.5s;
|
||||
-webkit-transition:0.5s;
|
||||
width: 100%;
|
||||
text-align: left;
|
||||
box-sizing: border-box;
|
||||
}
|
||||
|
||||
#search-info.visible {
|
||||
display: block;
|
||||
height: fit-content;
|
||||
height: -moz-max-content;
|
||||
padding: 10px;
|
||||
margin-top: 10px;
|
||||
}
|
||||
|
||||
/*****************
|
||||
SORT BUTTONS
|
||||
*****************/
|
||||
|
||||
li.sort {
|
||||
background-color: #efefef;
|
||||
color:#666;
|
||||
width:49.5%;
|
||||
}
|
||||
|
||||
li.sort:hover {
|
||||
text-decoration: none;
|
||||
background-color:#cecece;
|
||||
}
|
||||
|
||||
#current-sort {
|
||||
font-size:75%;
|
||||
}
|
||||
|
||||
.sort.my-desc:after, .sort-desc:after {
|
||||
width: 0;
|
||||
height: 0;
|
||||
border-left: .4rem solid transparent;
|
||||
border-right: .4rem solid transparent;
|
||||
border-top: .4rem solid;
|
||||
content:"";
|
||||
position: relative;
|
||||
top:.75rem;
|
||||
right:-.3rem;
|
||||
}
|
||||
|
||||
.sort.my-asc:after, .sort-asc:after {
|
||||
width: 0;
|
||||
height: 0;
|
||||
border-left: .4rem solid transparent;
|
||||
border-right: .4rem solid transparent;
|
||||
border-bottom: .4rem solid;
|
||||
content:"";
|
||||
position: relative;
|
||||
bottom:.75rem;
|
||||
right:-.3rem;
|
||||
}
|
||||
|
||||
.sort-desc:after {
|
||||
top:1rem;
|
||||
}
|
||||
|
||||
.sort-asc:after {
|
||||
bottom:1rem;
|
||||
}
|
||||
|
||||
/*****************************
|
||||
LESSON INDEX RESULTS LIST
|
||||
*****************************/
|
||||
|
||||
h2.results-title {
|
||||
margin:1rem 0;
|
||||
font: 1.6rem/2rem 'Roboto Condensed';
|
||||
color:#666;
|
||||
text-transform:uppercase;
|
||||
}
|
||||
|
||||
#results-value {
|
||||
color:#000;
|
||||
}
|
||||
|
||||
|
||||
#lesson-list .list ul {
|
||||
margin:0;
|
||||
padding:0;
|
||||
}
|
||||
|
||||
#lesson-list .list li {
|
||||
list-style-type:none;
|
||||
margin:0;
|
||||
}
|
||||
|
||||
|
||||
.lesson-description {
|
||||
margin-bottom:2rem;
|
||||
padding:0rem;
|
||||
min-height:120px;
|
||||
text-align:left;
|
||||
}
|
||||
|
||||
.lesson-description img {
|
||||
width:100%;
|
||||
}
|
||||
|
||||
.lesson-image {
|
||||
width:120px;
|
||||
float:left;
|
||||
margin-right:1rem;
|
||||
}
|
||||
|
||||
.above-title {
|
||||
margin:0 0 .2rem 0;
|
||||
font: .8rem/1rem 'Roboto Condensed';
|
||||
color:#999;
|
||||
text-transform:uppercase;
|
||||
clear:none;
|
||||
}
|
||||
|
||||
.lesson-description h2.title {
|
||||
font: 1.2rem/1.3rem 'Crete Round', serif;
|
||||
margin:0 0 .8rem 0;
|
||||
clear:none;
|
||||
}
|
||||
|
||||
.list .date,
|
||||
.lesson-description .activity,
|
||||
.lesson-description .topics,
|
||||
.lesson-description .difficulty {
|
||||
display: none;
|
||||
}
|
||||
|
||||
#pre-loader {
|
||||
visibility: hidden;
|
||||
display: flex;
|
||||
justify-content: center;
|
||||
align-items: center;
|
||||
height: 100vh;
|
||||
width: 100%;
|
||||
position: fixed;
|
||||
top: 0;
|
||||
left: 0;
|
||||
z-index: 9999;
|
||||
transition: opacity 0.3s linear;
|
||||
background: rgba(211, 211, 211, 0.8);
|
||||
}
|
||||
/* =============================================================================
|
||||
Top Navigation Bar
|
||||
========================================================================== */
|
||||
|
||||
.navbar {
|
||||
padding: .6rem 1rem;
|
||||
margin: 0 0 3rem 0;
|
||||
}
|
||||
|
||||
.navbar-dark .navbar-nav .nav-link {
|
||||
font-family:'Open Sans';
|
||||
text-transform:uppercase;
|
||||
color:#fff;
|
||||
font-size:.9rem;
|
||||
}
|
||||
|
||||
.btn-group > .btn-secondary {
|
||||
border-color: #333333;
|
||||
background-color: #888888;
|
||||
}
|
||||
|
||||
.lang {
|
||||
text-transform:lowercase !important;
|
||||
}
|
||||
|
||||
.navbar-dark .navbar-nav .nav-link:hover, .navbar-dark .navbar-brand:hover {
|
||||
color:#39a;
|
||||
}
|
||||
|
||||
.navbar-toggler-icon {
|
||||
background-image: url("data:image/svg+xml;charset=utf8,%3Csvg viewBox='0 0 32 32' xmlns='http://www.w3.org/2000/svg'%3E%3Cpath stroke='rgba(255,255,255, 1)' stroke-width='2' stroke-linecap='round' stroke-miterlimit='10' d='M4 8h24M4 16h24M4 24h24'/%3E%3C/svg%3E");
|
||||
}
|
||||
|
||||
.navbar-collapse {
|
||||
text-align:center;
|
||||
}
|
||||
|
||||
.navbar-dark .navbar-brand {
|
||||
font-family:'Crete Round', serif;
|
||||
color:#fff;
|
||||
letter-spacing: .02em;
|
||||
}
|
||||
|
||||
.btn-group > a.btn {
|
||||
padding-left: 1rem;
|
||||
padding-right: 1rem;
|
||||
}
|
||||
|
||||
a.dropdown-item {
|
||||
border-bottom:1px solid #ccc;
|
||||
font-family:'Roboto';
|
||||
}
|
||||
|
||||
.dropdown-menu {
|
||||
position: absolute;
|
||||
background: #fff;
|
||||
border: 1px solid #ccc;
|
||||
margin:0;
|
||||
padding:0;
|
||||
}
|
||||
|
||||
.dropdown-menu a {
|
||||
font-size:.8rem;
|
||||
line-height:2rem;
|
||||
text-transform:uppercase;
|
||||
}
|
||||
|
||||
.dropdown-menu a:last-child {
|
||||
border-bottom:none;
|
||||
}
|
||||
|
||||
.dropdown-menu:after, .dropdown-menu:before {
|
||||
bottom: 100%;
|
||||
left: 20%;
|
||||
border: solid transparent;
|
||||
content: " ";
|
||||
height: 0;
|
||||
width: 0;
|
||||
position: absolute;
|
||||
pointer-events: none;
|
||||
}
|
||||
|
||||
.dropdown-menu:after {
|
||||
border-color: rgba(255, 255, 255, 0);
|
||||
border-bottom-color: #fff;
|
||||
border-width: 12px;
|
||||
margin-left: -12px;
|
||||
}
|
||||
.dropdown-menu:before {
|
||||
border-color: rgba(51, 153, 170, 0);
|
||||
border-bottom-color: #ccc;
|
||||
border-width: 13px;
|
||||
margin-left: -13px;
|
||||
}
|
||||
|
||||
.navbar-dark .navbar-nav .nav-link:focus {
|
||||
color: #ccc;
|
||||
}
|
||||
|
||||
.header-link {
|
||||
position: absolute;
|
||||
right: 0.6em;
|
||||
opacity: 0;
|
||||
-webkit-transition: opacity 0.2s ease-in-out 0.1s;
|
||||
-moz-transition: opacity 0.2s ease-in-out 0.1s;
|
||||
-ms-transition: opacity 0.2s ease-in-out 0.1s;
|
||||
}
|
||||
|
||||
h2:hover .header-link,
|
||||
h3:hover .header-link,
|
||||
h4:hover .header-link,
|
||||
h5:hover .header-link,
|
||||
h6:hover .header-link {
|
||||
opacity: 1;
|
||||
}
|
||||
|
||||
/* =============================================================================
|
||||
Lesson Typography
|
||||
========================================================================== */
|
||||
|
||||
a {text-decoration:none;}
|
||||
|
||||
a:link {color: #38c;}
|
||||
a:visited {color: #39a;}
|
||||
a:hover {color: #555;}
|
||||
a:active {color: #555;}
|
||||
|
||||
b, strong { font-weight: bold; }
|
||||
|
||||
blockquote { margin: 1em 2em; padding: 0 1em 0 1em; font-style: italic; border:1px solid #666; background: #eeeeee;}
|
||||
|
||||
hr {
|
||||
display: block; height: 1px; border: 0; border-top: 1px solid #ccc; margin: 2em 0; padding: 0; }
|
||||
|
||||
img {
|
||||
max-width:100%;
|
||||
}
|
||||
|
||||
ins { background: #ff9; color: #000; text-decoration: none; }
|
||||
|
||||
|
||||
h1,h2,h3,h4,h5 {
|
||||
font-family:'Crete Round', serif;
|
||||
font-weight:normal;
|
||||
clear:both;
|
||||
}
|
||||
|
||||
|
||||
h1 {
|
||||
font-size:2rem;
|
||||
margin-bottom:1.5rem;
|
||||
letter-spacing:-.03rem;
|
||||
text-align:center;
|
||||
}
|
||||
|
||||
h2 {
|
||||
font-size:1.6rem;
|
||||
margin-top:3rem;
|
||||
letter-spacing:-.02rem;
|
||||
}
|
||||
|
||||
h3 {
|
||||
font-size:1.4rem;
|
||||
margin-top:2.5rem;
|
||||
}
|
||||
|
||||
h4 {
|
||||
font-size:1.2rem;
|
||||
margin-top:1.8rem;
|
||||
}
|
||||
|
||||
h5 {
|
||||
font-size:1.0rem;
|
||||
margin-top:1.4rem;
|
||||
}
|
||||
|
||||
h1 a, h2 a, h3 a, h4 a, h5 a {
|
||||
text-decoration:none;
|
||||
}
|
||||
|
||||
h1 a:link { color: #38c; }
|
||||
h1 a:visited {color: #39a; }
|
||||
|
||||
|
||||
/* select button generated by codeblocks.js */
|
||||
.fa-align-left {opacity: 0.2;}
|
||||
.highlight:hover .fa-align-left {opacity: 1;}
|
||||
|
||||
q { quotes: none; }
|
||||
q:before, q:after { content: ""; content: none; }
|
||||
|
||||
small { font-size: 85%; }
|
||||
|
||||
/* Position subscript and superscript content without affecting line-height: h5bp.com/k */
|
||||
sub, sup { font-size: 75%; line-height: 0; position: relative; vertical-align: baseline; }
|
||||
sup { top: -0.5em; }
|
||||
sub { bottom: -0.25em; }
|
||||
|
||||
li {
|
||||
margin-bottom:.5rem;
|
||||
line-height:1.4rem;
|
||||
}
|
||||
|
||||
li.nav-item {
|
||||
margin-bottom:0;
|
||||
}
|
||||
|
||||
.alert {
|
||||
font-family: 'Roboto';
|
||||
}
|
||||
|
||||
.alert h2, .alert h3, .alert h4 {
|
||||
margin-top:0;
|
||||
}
|
||||
|
||||
|
||||
/* =============================================================================
|
||||
Code Highlighting
|
||||
========================================================================== */
|
||||
|
||||
code {
|
||||
font-family: monospace, serif;
|
||||
font-size:.9rem;
|
||||
}
|
||||
|
||||
.highlight {
|
||||
margin: 1rem 0 1rem 0;
|
||||
padding:.5rem .2rem;
|
||||
font-size:.9rem;
|
||||
white-space: pre;
|
||||
word-wrap: normal;
|
||||
overflow: auto;
|
||||
border: 1px solid #eee;
|
||||
background: #fafafa;
|
||||
}
|
||||
|
||||
|
||||
/* =============================================================================
|
||||
Figures
|
||||
========================================================================== */
|
||||
|
||||
figure {
|
||||
margin: 0 auto .5rem;
|
||||
text-align: center;
|
||||
display:table;
|
||||
}
|
||||
|
||||
figcaption {
|
||||
margin-top:.5rem;
|
||||
font-family:'Open Sans';
|
||||
font-size:0.8em;
|
||||
color: #666;
|
||||
display:block;
|
||||
caption-side: bottom;
|
||||
}
|
||||
|
||||
.author-info, .citation-info {
|
||||
border-top:1px solid #333;
|
||||
padding-top:1rem;
|
||||
margin-top:2rem;
|
||||
}
|
||||
|
||||
.author-name, .suggested-citation-header {
|
||||
font-family:'Roboto Condensed';
|
||||
font-weight: 600;
|
||||
font-size:1.2rem;
|
||||
color: #666;
|
||||
text-transform:uppercase;
|
||||
}
|
||||
|
||||
.author-description p, .suggested-citation-text p {
|
||||
font-size:0.9rem;
|
||||
font-family:'Open Sans';
|
||||
color: #666;
|
||||
}
|
||||
|
||||
/* =============================================================================
|
||||
Tables
|
||||
========================================================================== */
|
||||
|
||||
table {
|
||||
width: 100%;
|
||||
margin-bottom: 1em;
|
||||
}
|
||||
|
||||
th, td {
|
||||
padding: 10px;
|
||||
text-align: left;
|
||||
border-bottom: 1px solid #ddd;
|
||||
}
|
||||
|
||||
thead {
|
||||
background-color: #535353;
|
||||
color: #fff;
|
||||
font-weight: bold;
|
||||
}
|
||||
|
||||
tr:nth-child(even) {background-color: #f2f2f2}
|
||||
|
||||
/* =============================================================================
|
||||
Blog Index and Layout
|
||||
========================================================================== */
|
||||
|
||||
.blog-header {
|
||||
text-align:center;
|
||||
}
|
||||
|
||||
.blog-header h2 {
|
||||
margin:0;
|
||||
line-height: 2rem;
|
||||
}
|
||||
|
||||
.blog-header h3 { /*author*/
|
||||
margin-top:.4rem;
|
||||
color: #666;
|
||||
font-size:1rem;
|
||||
}
|
||||
|
||||
.blog-header h4{
|
||||
color: #999;
|
||||
font-size:1rem;
|
||||
margin-bottom:.2rem;
|
||||
font-family:'Roboto Condensed';
|
||||
text-transform:uppercase;
|
||||
}
|
||||
|
||||
.blog-header figure {
|
||||
max-width:80%;
|
||||
}
|
||||
|
||||
.blog-header figcaption {
|
||||
text-align: center;
|
||||
}
|
||||
|
||||
.blog-page-header {
|
||||
margin-bottom:3rem;
|
||||
}
|
||||
|
||||
/* =============================================================================
|
||||
Project Team
|
||||
========================================================================== */
|
||||
|
||||
.contact-box {
|
||||
margin-bottom:3rem;
|
||||
}
|
||||
|
||||
/* =============================================================================
|
||||
Footer
|
||||
========================================================================== */
|
||||
|
||||
footer[role="contentinfo"] {
|
||||
margin-top: 2rem;
|
||||
padding: 2rem 0;
|
||||
font-family:'Open Sans';
|
||||
font-size:.9rem;
|
||||
color: #fff;
|
||||
background-color:#666;
|
||||
text-align:center;
|
||||
}
|
||||
|
||||
footer a, footer a:link, footer a:visited {
|
||||
color: #fff;
|
||||
border-bottom:1px #eee dotted;
|
||||
}
|
||||
|
||||
footer a:hover {
|
||||
text-decoration: none;
|
||||
border-bottom:1px #fff solid;
|
||||
}
|
||||
|
||||
footer .fa {
|
||||
margin: 0 .2rem 0rem 0rem;
|
||||
}
|
||||
|
||||
.footer-head {
|
||||
font-size:1.1rem;
|
||||
line-height:1.4rem;
|
||||
margin-bottom:1rem;
|
||||
}
|
||||
|
||||
|
||||
} /* end screen */
|
||||
|
||||
@media only screen and (max-width: 768px) {
|
||||
.garnish {
|
||||
display:none;
|
||||
}
|
||||
.dropdown-menu:after, .dropdown-menu:before {
|
||||
display:none;
|
||||
}
|
||||
}
|
||||
|
||||
/* Print Styling */
|
||||
|
||||
@media screen {
|
||||
/* Class to hide elements only shown when printing */
|
||||
.hide-print {
|
||||
display: none !important;
|
||||
}
|
||||
}
|
||||
|
||||
@media print {
|
||||
* { background: transparent !important; color: black !important; box-shadow:none !important; text-shadow: none !important; filter:none !important; -ms-filter: none !important; } /* Black prints faster: h5bp.com/s */
|
||||
a, a:visited { text-decoration: underline; }
|
||||
a[href]:after { content: " (" attr(href) ")"; }
|
||||
abbr[title]:after { content: " (" attr(title) ")"; }
|
||||
a[href^="javascript:"]:after, a[href^="#"]:after { content: ""; } /* Don't show links for images, or javascript/internal links */
|
||||
pre, blockquote {
|
||||
border: 1px solid #999;
|
||||
page-break-inside: avoid;
|
||||
margin: 0.5cm;
|
||||
padding: 0.5cm
|
||||
}
|
||||
thead { display: table-header-group; } /* h5bp.com/t */
|
||||
tr, img { page-break-inside: avoid; }
|
||||
img { max-width: 100% !important; }
|
||||
@page {
|
||||
margin: 1.5cm;
|
||||
}
|
||||
|
||||
body { font-size: 0.85rem;}
|
||||
p, h2, h3 { orphans: 3; widows: 3; }
|
||||
h1, h2, h3 { page-break-after: avoid; }
|
||||
h1 { font-size: 1.4rem; }
|
||||
h2 { font-size: 1.1rem; }
|
||||
h3 { font-size: 1rem; }
|
||||
h4 { font-size: 0.9rem; }
|
||||
.header-bottom {
|
||||
margin-bottom: 2rem;
|
||||
page-break-after: always;
|
||||
}
|
||||
.hide-screen {
|
||||
/* Hide elements that only appear on screen */
|
||||
display: none !important;
|
||||
}
|
||||
|
||||
.print-header {
|
||||
/* format navbar for print */
|
||||
display: block;
|
||||
z-index:1030;
|
||||
width: 100%;
|
||||
height: 3rem;
|
||||
padding: .6rem 1rem;
|
||||
margin-bottom: 1rem;
|
||||
color:#fff;
|
||||
white-space: nowrap;
|
||||
font-family: 'Crete Round', serif;
|
||||
border-bottom: 1px solid lightgrey;
|
||||
}
|
||||
}
|
1527
lod/tutorial/en/lessons/retired/graph-databases-and-SPARQL.html
Normal file
BIN
lod/tutorial/gallery/graph-databases-and-SPARQL.png
Normal file
After Width: | Height: | Size: 41 KiB |
1527
lod/tutorial/graph-databases-and-SPARQL.html
Normal file
17
lod/tutorial/images/ORCIDiD_iconvector.svg
Normal file
@@ -0,0 +1,17 @@
|
||||
<?xml version="1.0" encoding="utf-8"?>
|
||||
<!-- Generator: Adobe Illustrator 19.1.0, SVG Export Plug-In . SVG Version: 6.00 Build 0) -->
|
||||
<svg version="1.1" id="Layer_1" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" x="0px" y="0px"
|
||||
viewBox="0 0 256 256" style="enable-background:new 0 0 256 256;" xml:space="preserve">
|
||||
<style type="text/css">
|
||||
.st0{fill:#A6CE39;}
|
||||
.st1{fill:#FFFFFF;}
|
||||
</style>
|
||||
<path class="st0" d="M256,128c0,70.7-57.3,128-128,128C57.3,256,0,198.7,0,128C0,57.3,57.3,0,128,0C198.7,0,256,57.3,256,128z"/>
|
||||
<g>
|
||||
<path class="st1" d="M86.3,186.2H70.9V79.1h15.4v48.4V186.2z"/>
|
||||
<path class="st1" d="M108.9,79.1h41.6c39.6,0,57,28.3,57,53.6c0,27.5-21.5,53.6-56.8,53.6h-41.8V79.1z M124.3,172.4h24.5
|
||||
c34.9,0,42.9-26.5,42.9-39.7c0-21.5-13.7-39.7-43.7-39.7h-23.7V172.4z"/>
|
||||
<path class="st1" d="M88.7,56.8c0,5.5-4.5,10.1-10.1,10.1c-5.6,0-10.1-4.6-10.1-10.1c0-5.6,4.5-10.1,10.1-10.1
|
||||
C84.2,46.7,88.7,51.3,88.7,56.8z"/>
|
||||
</g>
|
||||
</svg>
|
After Width: | Height: | Size: 983 B |
BIN
lod/tutorial/images/doi_icon.jpg
Normal file
After Width: | Height: | Size: 65 KiB |
BIN
lod/tutorial/images/favicons/en_favicon.ico
Normal file
After Width: | Height: | Size: 318 B |
BIN
lod/tutorial/images/favicons/es_favicon.ico
Normal file
After Width: | Height: | Size: 318 B |
BIN
lod/tutorial/images/graph-databases-and-SPARQL.png
Normal file
After Width: | Height: | Size: 41 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-01.png
Normal file
After Width: | Height: | Size: 19 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-02.png
Normal file
After Width: | Height: | Size: 9.6 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-03.png
Normal file
After Width: | Height: | Size: 21 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-04.png
Normal file
After Width: | Height: | Size: 22 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-05.png
Normal file
After Width: | Height: | Size: 34 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-06.png
Normal file
After Width: | Height: | Size: 190 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-07.png
Normal file
After Width: | Height: | Size: 46 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-08.png
Normal file
After Width: | Height: | Size: 46 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-09.png
Normal file
After Width: | Height: | Size: 112 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-10.png
Normal file
After Width: | Height: | Size: 31 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-11.png
Normal file
After Width: | Height: | Size: 132 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-12.png
Normal file
After Width: | Height: | Size: 58 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-13.png
Normal file
After Width: | Height: | Size: 27 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-14.png
Normal file
After Width: | Height: | Size: 97 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql-lod-15.png
Normal file
After Width: | Height: | Size: 351 KiB |
@@ -0,0 +1,40 @@
|
||||
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
|
||||
<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN"
|
||||
"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
|
||||
<!-- Generated by graphviz version 2.38.0 (20140413.2041)
|
||||
-->
|
||||
<!-- Title: %3 Pages: 1 -->
|
||||
<svg width="262pt" height="124pt"
|
||||
viewBox="0.00 0.00 261.53 124.00" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
|
||||
<g id="graph0" class="graph" transform="scale(1 1) rotate(0) translate(4 120)">
|
||||
<title>%3</title>
|
||||
<polygon fill="white" stroke="none" points="-4,4 -4,-120 257.533,-120 257.533,4 -4,4"/>
|
||||
<!-- nw -->
|
||||
<g id="node1" class="node"><title>nw</title>
|
||||
<ellipse fill="none" stroke="gray" cx="49.3505" cy="-98" rx="49.2014" ry="18"/>
|
||||
<text text-anchor="middle" x="49.3505" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">The Nightwatch</text>
|
||||
</g>
|
||||
<!-- oil -->
|
||||
<g id="node3" class="node"><title>oil</title>
|
||||
<ellipse fill="none" stroke="gray" cx="117.35" cy="-18" rx="42.8742" ry="18"/>
|
||||
<text text-anchor="middle" x="117.35" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">oil on canvas</text>
|
||||
</g>
|
||||
<!-- nw->oil -->
|
||||
<g id="edge1" class="edge"><title>nw->oil</title>
|
||||
<path fill="none" stroke="gray" d="M63.7715,-80.4582C73.3018,-69.5265 85.9453,-55.0236 96.5567,-42.8517"/>
|
||||
<polygon fill="gray" stroke="gray" points="99.2502,-45.0882 103.183,-35.2505 93.9738,-40.4882 99.2502,-45.0882"/>
|
||||
<text text-anchor="middle" x="108.138" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">has medium</text>
|
||||
</g>
|
||||
<!-- wb -->
|
||||
<g id="node2" class="node"><title>wb</title>
|
||||
<ellipse fill="none" stroke="gray" cx="185.35" cy="-98" rx="68.3645" ry="18"/>
|
||||
<text text-anchor="middle" x="185.35" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">Woman with a Balance</text>
|
||||
</g>
|
||||
<!-- wb->oil -->
|
||||
<g id="edge2" class="edge"><title>wb->oil</title>
|
||||
<path fill="none" stroke="gray" d="M170.595,-80.0752C161.138,-69.2266 148.718,-54.9801 138.252,-42.9755"/>
|
||||
<polygon fill="gray" stroke="gray" points="140.589,-40.3299 131.38,-35.0922 135.313,-44.9299 140.589,-40.3299"/>
|
||||
<text text-anchor="middle" x="176.138" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">has medium</text>
|
||||
</g>
|
||||
</g>
|
||||
</svg>
|
After Width: | Height: | Size: 2.2 KiB |
104
lod/tutorial/images/graph-databases-and-SPARQL/sparql01.svg
Normal file
@@ -0,0 +1,104 @@
|
||||
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
|
||||
<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN"
|
||||
"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
|
||||
<!-- Generated by graphviz version 2.38.0 (20140413.2041)
|
||||
-->
|
||||
<!-- Title: %3 Pages: 1 -->
|
||||
<svg width="438pt" height="212pt"
|
||||
viewBox="0.00 0.00 437.70 212.00" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
|
||||
<g id="graph0" class="graph" transform="scale(1 1) rotate(0) translate(4 208)">
|
||||
<title>%3</title>
|
||||
<polygon fill="white" stroke="none" points="-4,4 -4,-208 433.703,-208 433.703,4 -4,4"/>
|
||||
<!-- nw -->
|
||||
<g id="node1" class="node"><title>nw</title>
|
||||
<ellipse fill="none" stroke="gray" cx="132" cy="-186" rx="49.2014" ry="18"/>
|
||||
<text text-anchor="middle" x="132" y="-183" font-family="Helvetica,sans-Serif" font-size="10.00">The Nightwatch</text>
|
||||
</g>
|
||||
<!-- rr -->
|
||||
<g id="node2" class="node"><title>rr</title>
|
||||
<ellipse fill="none" stroke="gray" cx="132" cy="-98" rx="60.0217" ry="18"/>
|
||||
<text text-anchor="middle" x="132" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">Rembrandt van Rijn</text>
|
||||
</g>
|
||||
<!-- nw->rr -->
|
||||
<g id="edge1" class="edge"><title>nw->rr</title>
|
||||
<path fill="none" stroke="gray" d="M132,-167.597C132,-155.746 132,-139.817 132,-126.292"/>
|
||||
<polygon fill="gray" stroke="gray" points="135.5,-126.084 132,-116.084 128.5,-126.084 135.5,-126.084"/>
|
||||
<text text-anchor="middle" x="150.678" y="-143.6" font-family="Helvetica,sans-Serif" font-size="8.00">was</text>
|
||||
<text text-anchor="middle" x="150.678" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">created by</text>
|
||||
</g>
|
||||
<!-- oil -->
|
||||
<g id="node5" class="node"><title>oil</title>
|
||||
<ellipse fill="none" stroke="gray" cx="253" cy="-98" rx="42.8742" ry="18"/>
|
||||
<text text-anchor="middle" x="253" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">oil on canvas</text>
|
||||
</g>
|
||||
<!-- nw->oil -->
|
||||
<g id="edge3" class="edge"><title>nw->oil</title>
|
||||
<path fill="none" stroke="gray" d="M153.632,-169.625C173.196,-155.72 202.162,-135.133 223.779,-119.768"/>
|
||||
<polygon fill="gray" stroke="gray" points="225.837,-122.6 231.96,-113.954 221.781,-116.894 225.837,-122.6"/>
|
||||
<text text-anchor="middle" x="225.787" y="-139.6" font-family="Helvetica,sans-Serif" font-size="8.00">has medium</text>
|
||||
</g>
|
||||
<!-- nwd -->
|
||||
<g id="node7" class="node"><title>nwd</title>
|
||||
<ellipse fill="none" stroke="gray" cx="27" cy="-98" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="27" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">1642</text>
|
||||
</g>
|
||||
<!-- nw->nwd -->
|
||||
<g id="edge2" class="edge"><title>nw->nwd</title>
|
||||
<path fill="none" stroke="gray" d="M112.741,-169.226C95.413,-155.034 69.877,-134.118 51.1799,-118.804"/>
|
||||
<polygon fill="gray" stroke="gray" points="53.3315,-116.043 43.3774,-112.414 48.896,-121.458 53.3315,-116.043"/>
|
||||
<text text-anchor="middle" x="106.566" y="-143.6" font-family="Helvetica,sans-Serif" font-size="8.00">was</text>
|
||||
<text text-anchor="middle" x="106.566" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">created in</text>
|
||||
</g>
|
||||
<!-- d -->
|
||||
<g id="node6" class="node"><title>d</title>
|
||||
<ellipse fill="none" stroke="gray" cx="263" cy="-18" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="263" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">Dutch</text>
|
||||
</g>
|
||||
<!-- rr->d -->
|
||||
<g id="edge5" class="edge"><title>rr->d</title>
|
||||
<path fill="none" stroke="gray" d="M157.881,-81.5897C180.033,-68.4005 211.867,-49.4456 234.691,-35.8559"/>
|
||||
<polygon fill="gray" stroke="gray" points="236.761,-38.6968 243.563,-30.5735 233.18,-32.6822 236.761,-38.6968"/>
|
||||
<text text-anchor="middle" x="227.572" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">has nationality</text>
|
||||
</g>
|
||||
<!-- rrb -->
|
||||
<g id="node8" class="node"><title>rrb</title>
|
||||
<ellipse fill="none" stroke="gray" cx="132" cy="-18" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="132" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">1606</text>
|
||||
</g>
|
||||
<!-- rr->rrb -->
|
||||
<g id="edge4" class="edge"><title>rr->rrb</title>
|
||||
<path fill="none" stroke="gray" d="M132,-79.6893C132,-69.8938 132,-57.4218 132,-46.335"/>
|
||||
<polygon fill="gray" stroke="gray" points="135.5,-46.2623 132,-36.2623 128.5,-46.2624 135.5,-46.2623"/>
|
||||
<text text-anchor="middle" x="152.455" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">was born in</text>
|
||||
</g>
|
||||
<!-- jv -->
|
||||
<g id="node3" class="node"><title>jv</title>
|
||||
<ellipse fill="none" stroke="gray" cx="372" cy="-98" rx="57.9076" ry="18"/>
|
||||
<text text-anchor="middle" x="372" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">Johannes Vermeer</text>
|
||||
</g>
|
||||
<!-- jv->d -->
|
||||
<g id="edge6" class="edge"><title>jv->d</title>
|
||||
<path fill="none" stroke="gray" d="M349.942,-81.2155C332.328,-68.6108 307.605,-50.9191 289.023,-37.6222"/>
|
||||
<polygon fill="gray" stroke="gray" points="290.879,-34.6462 280.71,-31.673 286.805,-40.3388 290.879,-34.6462"/>
|
||||
<text text-anchor="middle" x="345.572" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">has nationality</text>
|
||||
</g>
|
||||
<!-- wb -->
|
||||
<g id="node4" class="node"><title>wb</title>
|
||||
<ellipse fill="none" stroke="gray" cx="277" cy="-186" rx="68.3645" ry="18"/>
|
||||
<text text-anchor="middle" x="277" y="-183" font-family="Helvetica,sans-Serif" font-size="10.00">Woman with a Balance</text>
|
||||
</g>
|
||||
<!-- wb->jv -->
|
||||
<g id="edge7" class="edge"><title>wb->jv</title>
|
||||
<path fill="none" stroke="gray" d="M295.278,-168.646C301.824,-162.776 309.251,-156.101 316,-150 325.983,-140.975 336.934,-131.015 346.484,-122.31"/>
|
||||
<polygon fill="gray" stroke="gray" points="349.095,-124.666 354.124,-115.341 344.377,-119.494 349.095,-124.666"/>
|
||||
<text text-anchor="middle" x="350.678" y="-143.6" font-family="Helvetica,sans-Serif" font-size="8.00">was</text>
|
||||
<text text-anchor="middle" x="350.678" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">created by</text>
|
||||
</g>
|
||||
<!-- wb->oil -->
|
||||
<g id="edge8" class="edge"><title>wb->oil</title>
|
||||
<path fill="none" stroke="gray" d="M272.258,-168.009C268.905,-155.995 264.342,-139.641 260.498,-125.869"/>
|
||||
<polygon fill="gray" stroke="gray" points="263.791,-124.647 257.732,-115.956 257.049,-126.528 263.791,-124.647"/>
|
||||
<text text-anchor="middle" x="289.787" y="-139.6" font-family="Helvetica,sans-Serif" font-size="8.00">has medium</text>
|
||||
</g>
|
||||
</g>
|
||||
</svg>
|
After Width: | Height: | Size: 6.2 KiB |
104
lod/tutorial/images/graph-databases-and-SPARQL/sparql02-1.svg
Normal file
@@ -0,0 +1,104 @@
|
||||
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
|
||||
<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN"
|
||||
"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
|
||||
<!-- Generated by graphviz version 2.38.0 (20140413.2041)
|
||||
-->
|
||||
<!-- Title: %3 Pages: 1 -->
|
||||
<svg width="438pt" height="212pt"
|
||||
viewBox="0.00 0.00 437.70 212.00" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
|
||||
<g id="graph0" class="graph" transform="scale(1 1) rotate(0) translate(4 208)">
|
||||
<title>%3</title>
|
||||
<polygon fill="white" stroke="none" points="-4,4 -4,-208 433.703,-208 433.703,4 -4,4"/>
|
||||
<!-- nw -->
|
||||
<g id="node1" class="node"><title>nw</title>
|
||||
<ellipse fill="none" stroke="red" cx="132" cy="-186" rx="49.2014" ry="18"/>
|
||||
<text text-anchor="middle" x="132" y="-183" font-family="Helvetica,sans-Serif" font-size="10.00">The Nightwatch</text>
|
||||
</g>
|
||||
<!-- rr -->
|
||||
<g id="node2" class="node"><title>rr</title>
|
||||
<ellipse fill="none" stroke="red" cx="132" cy="-98" rx="60.0217" ry="18"/>
|
||||
<text text-anchor="middle" x="132" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">Rembrandt van Rijn</text>
|
||||
</g>
|
||||
<!-- nw->rr -->
|
||||
<g id="edge1" class="edge"><title>nw->rr</title>
|
||||
<path fill="none" stroke="orange" d="M132,-167.597C132,-155.746 132,-139.817 132,-126.292"/>
|
||||
<polygon fill="orange" stroke="orange" points="135.5,-126.084 132,-116.084 128.5,-126.084 135.5,-126.084"/>
|
||||
<text text-anchor="middle" x="150.678" y="-143.6" font-family="Helvetica,sans-Serif" font-size="8.00">was</text>
|
||||
<text text-anchor="middle" x="150.678" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">created by</text>
|
||||
</g>
|
||||
<!-- oil -->
|
||||
<g id="node5" class="node"><title>oil</title>
|
||||
<ellipse fill="none" stroke="gray" cx="253" cy="-98" rx="42.8742" ry="18"/>
|
||||
<text text-anchor="middle" x="253" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">oil on canvas</text>
|
||||
</g>
|
||||
<!-- nw->oil -->
|
||||
<g id="edge3" class="edge"><title>nw->oil</title>
|
||||
<path fill="none" stroke="gray" d="M153.632,-169.625C173.196,-155.72 202.162,-135.133 223.779,-119.768"/>
|
||||
<polygon fill="gray" stroke="gray" points="225.837,-122.6 231.96,-113.954 221.781,-116.894 225.837,-122.6"/>
|
||||
<text text-anchor="middle" x="225.787" y="-139.6" font-family="Helvetica,sans-Serif" font-size="8.00">has medium</text>
|
||||
</g>
|
||||
<!-- nwd -->
|
||||
<g id="node7" class="node"><title>nwd</title>
|
||||
<ellipse fill="none" stroke="gray" cx="27" cy="-98" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="27" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">1642</text>
|
||||
</g>
|
||||
<!-- nw->nwd -->
|
||||
<g id="edge2" class="edge"><title>nw->nwd</title>
|
||||
<path fill="none" stroke="gray" d="M112.741,-169.226C95.413,-155.034 69.877,-134.118 51.1799,-118.804"/>
|
||||
<polygon fill="gray" stroke="gray" points="53.3315,-116.043 43.3774,-112.414 48.896,-121.458 53.3315,-116.043"/>
|
||||
<text text-anchor="middle" x="106.566" y="-143.6" font-family="Helvetica,sans-Serif" font-size="8.00">was</text>
|
||||
<text text-anchor="middle" x="106.566" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">created in</text>
|
||||
</g>
|
||||
<!-- d -->
|
||||
<g id="node6" class="node"><title>d</title>
|
||||
<ellipse fill="none" stroke="orange" cx="263" cy="-18" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="263" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">Dutch</text>
|
||||
</g>
|
||||
<!-- rr->d -->
|
||||
<g id="edge5" class="edge"><title>rr->d</title>
|
||||
<path fill="none" stroke="orange" d="M157.881,-81.5897C180.033,-68.4005 211.867,-49.4456 234.691,-35.8559"/>
|
||||
<polygon fill="orange" stroke="orange" points="236.761,-38.6968 243.563,-30.5735 233.18,-32.6822 236.761,-38.6968"/>
|
||||
<text text-anchor="middle" x="227.572" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">has nationality</text>
|
||||
</g>
|
||||
<!-- rrb -->
|
||||
<g id="node8" class="node"><title>rrb</title>
|
||||
<ellipse fill="none" stroke="gray" cx="132" cy="-18" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="132" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">1606</text>
|
||||
</g>
|
||||
<!-- rr->rrb -->
|
||||
<g id="edge4" class="edge"><title>rr->rrb</title>
|
||||
<path fill="none" stroke="gray" d="M132,-79.6893C132,-69.8938 132,-57.4218 132,-46.335"/>
|
||||
<polygon fill="gray" stroke="gray" points="135.5,-46.2623 132,-36.2623 128.5,-46.2624 135.5,-46.2623"/>
|
||||
<text text-anchor="middle" x="152.455" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">was born in</text>
|
||||
</g>
|
||||
<!-- jv -->
|
||||
<g id="node3" class="node"><title>jv</title>
|
||||
<ellipse fill="none" stroke="red" cx="372" cy="-98" rx="57.9076" ry="18"/>
|
||||
<text text-anchor="middle" x="372" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">Johannes Vermeer</text>
|
||||
</g>
|
||||
<!-- jv->d -->
|
||||
<g id="edge6" class="edge"><title>jv->d</title>
|
||||
<path fill="none" stroke="orange" d="M349.942,-81.2155C332.328,-68.6108 307.605,-50.9191 289.023,-37.6222"/>
|
||||
<polygon fill="orange" stroke="orange" points="290.879,-34.6462 280.71,-31.673 286.805,-40.3388 290.879,-34.6462"/>
|
||||
<text text-anchor="middle" x="345.572" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">has nationality</text>
|
||||
</g>
|
||||
<!-- wb -->
|
||||
<g id="node4" class="node"><title>wb</title>
|
||||
<ellipse fill="none" stroke="red" cx="277" cy="-186" rx="68.3645" ry="18"/>
|
||||
<text text-anchor="middle" x="277" y="-183" font-family="Helvetica,sans-Serif" font-size="10.00">Woman with a Balance</text>
|
||||
</g>
|
||||
<!-- wb->jv -->
|
||||
<g id="edge7" class="edge"><title>wb->jv</title>
|
||||
<path fill="none" stroke="orange" d="M295.278,-168.646C301.824,-162.776 309.251,-156.101 316,-150 325.983,-140.975 336.934,-131.015 346.484,-122.31"/>
|
||||
<polygon fill="orange" stroke="orange" points="349.095,-124.666 354.124,-115.341 344.377,-119.494 349.095,-124.666"/>
|
||||
<text text-anchor="middle" x="350.678" y="-143.6" font-family="Helvetica,sans-Serif" font-size="8.00">was</text>
|
||||
<text text-anchor="middle" x="350.678" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">created by</text>
|
||||
</g>
|
||||
<!-- wb->oil -->
|
||||
<g id="edge8" class="edge"><title>wb->oil</title>
|
||||
<path fill="none" stroke="gray" d="M272.258,-168.009C268.905,-155.995 264.342,-139.641 260.498,-125.869"/>
|
||||
<polygon fill="gray" stroke="gray" points="263.791,-124.647 257.732,-115.956 257.049,-126.528 263.791,-124.647"/>
|
||||
<text text-anchor="middle" x="289.787" y="-139.6" font-family="Helvetica,sans-Serif" font-size="8.00">has medium</text>
|
||||
</g>
|
||||
</g>
|
||||
</svg>
|
After Width: | Height: | Size: 6.2 KiB |
104
lod/tutorial/images/graph-databases-and-SPARQL/sparql02.svg
Normal file
@@ -0,0 +1,104 @@
|
||||
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
|
||||
<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN"
|
||||
"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
|
||||
<!-- Generated by graphviz version 2.38.0 (20140413.2041)
|
||||
-->
|
||||
<!-- Title: %3 Pages: 1 -->
|
||||
<svg width="438pt" height="212pt"
|
||||
viewBox="0.00 0.00 437.70 212.00" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
|
||||
<g id="graph0" class="graph" transform="scale(1 1) rotate(0) translate(4 208)">
|
||||
<title>%3</title>
|
||||
<polygon fill="white" stroke="none" points="-4,4 -4,-208 433.703,-208 433.703,4 -4,4"/>
|
||||
<!-- nw -->
|
||||
<g id="node1" class="node"><title>nw</title>
|
||||
<ellipse fill="none" stroke="red" cx="132" cy="-186" rx="49.2014" ry="18"/>
|
||||
<text text-anchor="middle" x="132" y="-183" font-family="Helvetica,sans-Serif" font-size="10.00">The Nightwatch</text>
|
||||
</g>
|
||||
<!-- rr -->
|
||||
<g id="node2" class="node"><title>rr</title>
|
||||
<ellipse fill="none" stroke="gray" cx="132" cy="-98" rx="60.0217" ry="18"/>
|
||||
<text text-anchor="middle" x="132" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">Rembrandt van Rijn</text>
|
||||
</g>
|
||||
<!-- nw->rr -->
|
||||
<g id="edge1" class="edge"><title>nw->rr</title>
|
||||
<path fill="none" stroke="gray" d="M132,-167.597C132,-155.746 132,-139.817 132,-126.292"/>
|
||||
<polygon fill="gray" stroke="gray" points="135.5,-126.084 132,-116.084 128.5,-126.084 135.5,-126.084"/>
|
||||
<text text-anchor="middle" x="150.678" y="-143.6" font-family="Helvetica,sans-Serif" font-size="8.00">was</text>
|
||||
<text text-anchor="middle" x="150.678" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">created by</text>
|
||||
</g>
|
||||
<!-- oil -->
|
||||
<g id="node5" class="node"><title>oil</title>
|
||||
<ellipse fill="none" stroke="orange" cx="253" cy="-98" rx="42.8742" ry="18"/>
|
||||
<text text-anchor="middle" x="253" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">oil on canvas</text>
|
||||
</g>
|
||||
<!-- nw->oil -->
|
||||
<g id="edge3" class="edge"><title>nw->oil</title>
|
||||
<path fill="none" stroke="orange" d="M153.632,-169.625C173.196,-155.72 202.162,-135.133 223.779,-119.768"/>
|
||||
<polygon fill="orange" stroke="orange" points="225.837,-122.6 231.96,-113.954 221.781,-116.894 225.837,-122.6"/>
|
||||
<text text-anchor="middle" x="225.787" y="-139.6" font-family="Helvetica,sans-Serif" font-size="8.00">has medium</text>
|
||||
</g>
|
||||
<!-- nwd -->
|
||||
<g id="node7" class="node"><title>nwd</title>
|
||||
<ellipse fill="none" stroke="gray" cx="27" cy="-98" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="27" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">1642</text>
|
||||
</g>
|
||||
<!-- nw->nwd -->
|
||||
<g id="edge2" class="edge"><title>nw->nwd</title>
|
||||
<path fill="none" stroke="gray" d="M112.741,-169.226C95.413,-155.034 69.877,-134.118 51.1799,-118.804"/>
|
||||
<polygon fill="gray" stroke="gray" points="53.3315,-116.043 43.3774,-112.414 48.896,-121.458 53.3315,-116.043"/>
|
||||
<text text-anchor="middle" x="105.455" y="-143.6" font-family="Helvetica,sans-Serif" font-size="8.00">was</text>
|
||||
<text text-anchor="middle" x="105.455" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">createdin</text>
|
||||
</g>
|
||||
<!-- d -->
|
||||
<g id="node6" class="node"><title>d</title>
|
||||
<ellipse fill="none" stroke="gray" cx="263" cy="-18" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="263" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">Dutch</text>
|
||||
</g>
|
||||
<!-- rr->d -->
|
||||
<g id="edge5" class="edge"><title>rr->d</title>
|
||||
<path fill="none" stroke="gray" d="M157.881,-81.5897C180.033,-68.4005 211.867,-49.4456 234.691,-35.8559"/>
|
||||
<polygon fill="gray" stroke="gray" points="236.761,-38.6968 243.563,-30.5735 233.18,-32.6822 236.761,-38.6968"/>
|
||||
<text text-anchor="middle" x="227.572" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">has nationality</text>
|
||||
</g>
|
||||
<!-- rrb -->
|
||||
<g id="node8" class="node"><title>rrb</title>
|
||||
<ellipse fill="none" stroke="gray" cx="132" cy="-18" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="132" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">1606</text>
|
||||
</g>
|
||||
<!-- rr->rrb -->
|
||||
<g id="edge4" class="edge"><title>rr->rrb</title>
|
||||
<path fill="none" stroke="gray" d="M132,-79.6893C132,-69.8938 132,-57.4218 132,-46.335"/>
|
||||
<polygon fill="gray" stroke="gray" points="135.5,-46.2623 132,-36.2623 128.5,-46.2624 135.5,-46.2623"/>
|
||||
<text text-anchor="middle" x="152.455" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">was born in</text>
|
||||
</g>
|
||||
<!-- jv -->
|
||||
<g id="node3" class="node"><title>jv</title>
|
||||
<ellipse fill="none" stroke="gray" cx="372" cy="-98" rx="57.9076" ry="18"/>
|
||||
<text text-anchor="middle" x="372" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">Johannes Vermeer</text>
|
||||
</g>
|
||||
<!-- jv->d -->
|
||||
<g id="edge6" class="edge"><title>jv->d</title>
|
||||
<path fill="none" stroke="gray" d="M349.942,-81.2155C332.328,-68.6108 307.605,-50.9191 289.023,-37.6222"/>
|
||||
<polygon fill="gray" stroke="gray" points="290.879,-34.6462 280.71,-31.673 286.805,-40.3388 290.879,-34.6462"/>
|
||||
<text text-anchor="middle" x="345.572" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">has nationality</text>
|
||||
</g>
|
||||
<!-- wb -->
|
||||
<g id="node4" class="node"><title>wb</title>
|
||||
<ellipse fill="none" stroke="red" cx="277" cy="-186" rx="68.3645" ry="18"/>
|
||||
<text text-anchor="middle" x="277" y="-183" font-family="Helvetica,sans-Serif" font-size="10.00">Woman with a Balance</text>
|
||||
</g>
|
||||
<!-- wb->jv -->
|
||||
<g id="edge7" class="edge"><title>wb->jv</title>
|
||||
<path fill="none" stroke="gray" d="M295.278,-168.646C301.824,-162.776 309.251,-156.101 316,-150 325.983,-140.975 336.934,-131.015 346.484,-122.31"/>
|
||||
<polygon fill="gray" stroke="gray" points="349.095,-124.666 354.124,-115.341 344.377,-119.494 349.095,-124.666"/>
|
||||
<text text-anchor="middle" x="350.678" y="-143.6" font-family="Helvetica,sans-Serif" font-size="8.00">was</text>
|
||||
<text text-anchor="middle" x="350.678" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">created by</text>
|
||||
</g>
|
||||
<!-- wb->oil -->
|
||||
<g id="edge8" class="edge"><title>wb->oil</title>
|
||||
<path fill="none" stroke="orange" d="M272.258,-168.009C268.905,-155.995 264.342,-139.641 260.498,-125.869"/>
|
||||
<polygon fill="orange" stroke="orange" points="263.791,-124.647 257.732,-115.956 257.049,-126.528 263.791,-124.647"/>
|
||||
<text text-anchor="middle" x="289.787" y="-139.6" font-family="Helvetica,sans-Serif" font-size="8.00">has medium</text>
|
||||
</g>
|
||||
</g>
|
||||
</svg>
|
After Width: | Height: | Size: 6.2 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql03.png
Normal file
After Width: | Height: | Size: 34 KiB |
127
lod/tutorial/images/graph-databases-and-SPARQL/sparql04-1.svg
Normal file
@@ -0,0 +1,127 @@
|
||||
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
|
||||
<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN"
|
||||
"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
|
||||
<!-- Generated by graphviz version 2.38.0 (20140413.2041)
|
||||
-->
|
||||
<!-- Title: %3 Pages: 1 -->
|
||||
<svg width="636pt" height="364pt"
|
||||
viewBox="0.00 0.00 636.30 364.00" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
|
||||
<g id="graph0" class="graph" transform="scale(1 1) rotate(0) translate(4 360)">
|
||||
<title>%3</title>
|
||||
<polygon fill="white" stroke="none" points="-4,4 -4,-360 632.301,-360 632.301,4 -4,4"/>
|
||||
<!-- o -->
|
||||
<g id="node1" class="node"><title>o</title>
|
||||
<ellipse fill="none" stroke="red" cx="274.27" cy="-338" rx="53.3595" ry="18"/>
|
||||
<text text-anchor="middle" x="274.27" y="-335" font-family="Helvetica,sans-Serif" font-size="10.00">object/PPA82633</text>
|
||||
</g>
|
||||
<!-- th1 -->
|
||||
<g id="node2" class="node"><title>th1</title>
|
||||
<ellipse fill="none" stroke="red" cx="40.2702" cy="-258" rx="40.0417" ry="18"/>
|
||||
<text text-anchor="middle" x="40.2702" y="-255" font-family="Helvetica,sans-Serif" font-size="10.00">thes:x11409</text>
|
||||
</g>
|
||||
<!-- o->th1 -->
|
||||
<g id="edge1" class="edge"><title>o->th1</title>
|
||||
<path fill="none" stroke="red" d="M224.341,-331.342C190.446,-326.351 145.117,-317.397 107.454,-302 93.6639,-296.363 79.5997,-287.87 67.927,-279.917"/>
|
||||
<polygon fill="red" stroke="red" points="69.704,-276.888 59.5111,-273.997 65.6765,-282.613 69.704,-276.888"/>
|
||||
<text text-anchor="middle" x="147.178" y="-295.6" font-family="Helvetica,sans-Serif" font-size="8.00">ecrm:p45_consists_of</text>
|
||||
</g>
|
||||
<!-- dep -->
|
||||
<g id="node4" class="node"><title>dep</title>
|
||||
<ellipse fill="none" stroke="red" cx="172.27" cy="-258" rx="74.1479" ry="18"/>
|
||||
<text text-anchor="middle" x="172.27" y="-255" font-family="Helvetica,sans-Serif" font-size="10.00">person-institution/147800</text>
|
||||
</g>
|
||||
<!-- o->dep -->
|
||||
<g id="edge3" class="edge"><title>o->dep</title>
|
||||
<path fill="none" stroke="red" d="M235.65,-325.351C222.058,-319.869 207.403,-312.234 196.239,-302 191.195,-297.376 186.961,-291.439 183.525,-285.462"/>
|
||||
<polygon fill="red" stroke="red" points="186.46,-283.516 178.779,-276.219 180.234,-286.714 186.46,-283.516"/>
|
||||
<text text-anchor="middle" x="228.286" y="-295.6" font-family="Helvetica,sans-Serif" font-size="8.00">ecrm:P62_depicts</text>
|
||||
</g>
|
||||
<!-- etc -->
|
||||
<g id="node6" class="node"><title>etc</title>
|
||||
<ellipse fill="none" stroke="gray" cx="274.27" cy="-18" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="274.27" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">etc...</text>
|
||||
</g>
|
||||
<!-- o->etc -->
|
||||
<g id="edge10" class="edge"><title>o->etc</title>
|
||||
<path fill="none" stroke="gray" d="M274.27,-319.958C274.27,-304.156 274.27,-279.99 274.27,-259 274.27,-259 274.27,-259 274.27,-97 274.27,-80.1099 274.27,-61.1626 274.27,-46.172"/>
|
||||
<polygon fill="gray" stroke="gray" points="277.77,-46.0417 274.27,-36.0418 270.77,-46.0418 277.77,-46.0417"/>
|
||||
</g>
|
||||
<!-- own -->
|
||||
<g id="node7" class="node"><title>own</title>
|
||||
<ellipse fill="none" stroke="red" cx="395.27" cy="-258" rx="93.1176" ry="18"/>
|
||||
<text text-anchor="middle" x="395.27" y="-255" font-family="Helvetica,sans-Serif" font-size="10.00">thesIdentifier:the-british-museum</text>
|
||||
</g>
|
||||
<!-- o->own -->
|
||||
<g id="edge5" class="edge"><title>o->own</title>
|
||||
<path fill="none" stroke="red" d="M297.887,-321.776C315.86,-310.19 340.856,-294.077 361.023,-281.077"/>
|
||||
<polygon fill="red" stroke="red" points="363.112,-283.894 369.621,-275.534 359.319,-278.011 363.112,-283.894"/>
|
||||
<text text-anchor="middle" x="392.854" y="-295.6" font-family="Helvetica,sans-Serif" font-size="8.00">ecrm:P54_has_current_owner</text>
|
||||
</g>
|
||||
<!-- con -->
|
||||
<g id="node8" class="node"><title>con</title>
|
||||
<ellipse fill="none" stroke="red" cx="524.27" cy="-178" rx="80.1403" ry="18"/>
|
||||
<text text-anchor="middle" x="524.27" y="-175" font-family="Helvetica,sans-Serif" font-size="10.00">object/PPA82633/concept/1</text>
|
||||
</g>
|
||||
<!-- o->con -->
|
||||
<g id="edge6" class="edge"><title>o->con</title>
|
||||
<path fill="none" stroke="red" d="M322.863,-330.474C381.749,-321.472 475.782,-303.207 497.27,-276 513.047,-256.024 519.612,-227.389 522.339,-206.409"/>
|
||||
<polygon fill="red" stroke="red" points="525.845,-206.541 523.445,-196.222 518.886,-205.786 525.845,-206.541"/>
|
||||
<text text-anchor="middle" x="548.839" y="-255.6" font-family="Helvetica,sans-Serif" font-size="8.00">ecrm:P128_carries</text>
|
||||
</g>
|
||||
<!-- th1lab -->
|
||||
<g id="node3" class="node"><title>th1lab</title>
|
||||
<ellipse fill="none" stroke="gray" cx="40.2702" cy="-178" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="40.2702" y="-175" font-family="Helvetica,sans-Serif" font-size="10.00">paper</text>
|
||||
</g>
|
||||
<!-- th1->th1lab -->
|
||||
<g id="edge2" class="edge"><title>th1->th1lab</title>
|
||||
<path fill="none" stroke="gray" d="M40.2702,-239.689C40.2702,-229.894 40.2702,-217.422 40.2702,-206.335"/>
|
||||
<polygon fill="gray" stroke="gray" points="43.7703,-206.262 40.2702,-196.262 36.7703,-206.262 43.7703,-206.262"/>
|
||||
<text text-anchor="middle" x="66.2858" y="-215.6" font-family="Helvetica,sans-Serif" font-size="8.00">skos:prefLabel</text>
|
||||
</g>
|
||||
<!-- deplab -->
|
||||
<g id="node5" class="node"><title>deplab</title>
|
||||
<ellipse fill="none" stroke="gray" cx="172.27" cy="-178" rx="66.8537" ry="18"/>
|
||||
<text text-anchor="middle" x="172.27" y="-175" font-family="Helvetica,sans-Serif" font-size="10.00">Julius Caesar Scaliger</text>
|
||||
</g>
|
||||
<!-- dep->deplab -->
|
||||
<g id="edge4" class="edge"><title>dep->deplab</title>
|
||||
<path fill="none" stroke="gray" d="M172.27,-239.689C172.27,-229.894 172.27,-217.422 172.27,-206.335"/>
|
||||
<polygon fill="gray" stroke="gray" points="175.77,-206.262 172.27,-196.262 168.77,-206.262 175.77,-206.262"/>
|
||||
<text text-anchor="middle" x="198.286" y="-215.6" font-family="Helvetica,sans-Serif" font-size="8.00">skos:prefLabel</text>
|
||||
</g>
|
||||
<!-- contype -->
|
||||
<g id="node9" class="node"><title>contype</title>
|
||||
<ellipse fill="none" stroke="gray" cx="431.27" cy="-98" rx="85.8678" ry="18"/>
|
||||
<text text-anchor="middle" x="431.27" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">ecrm:E73_Information_Object</text>
|
||||
</g>
|
||||
<!-- con->contype -->
|
||||
<g id="edge7" class="edge"><title>con->contype</title>
|
||||
<path fill="none" stroke="gray" d="M504.547,-160.458C491.321,-149.365 473.71,-134.595 459.068,-122.314"/>
|
||||
<polygon fill="gray" stroke="gray" points="461.196,-119.531 451.285,-115.787 456.698,-124.895 461.196,-119.531"/>
|
||||
<text text-anchor="middle" x="494.61" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">rdf:type</text>
|
||||
</g>
|
||||
<!-- concon -->
|
||||
<g id="node10" class="node"><title>concon</title>
|
||||
<ellipse fill="none" stroke="gray" cx="576.27" cy="-98" rx="40.8927" ry="18"/>
|
||||
<text text-anchor="middle" x="576.27" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">thes:x12440</text>
|
||||
</g>
|
||||
<!-- con->concon -->
|
||||
<g id="edge8" class="edge"><title>con->concon</title>
|
||||
<path fill="none" stroke="gray" d="M535.553,-160.075C542.599,-149.507 551.795,-135.713 559.664,-123.91"/>
|
||||
<polygon fill="gray" stroke="gray" points="562.73,-125.619 565.365,-115.357 556.906,-121.736 562.73,-125.619"/>
|
||||
<text text-anchor="middle" x="588.96" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">ecrm:P129_is_about</text>
|
||||
</g>
|
||||
<!-- conlab -->
|
||||
<g id="node11" class="node"><title>conlab</title>
|
||||
<ellipse fill="none" stroke="gray" cx="576.27" cy="-18" rx="33.894" ry="18"/>
|
||||
<text text-anchor="middle" x="576.27" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">academic</text>
|
||||
</g>
|
||||
<!-- concon->conlab -->
|
||||
<g id="edge9" class="edge"><title>concon->conlab</title>
|
||||
<path fill="none" stroke="gray" d="M576.27,-79.6893C576.27,-69.8938 576.27,-57.4218 576.27,-46.335"/>
|
||||
<polygon fill="gray" stroke="gray" points="579.77,-46.2623 576.27,-36.2623 572.77,-46.2624 579.77,-46.2623"/>
|
||||
<text text-anchor="middle" x="602.286" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">skos:prefLabel</text>
|
||||
</g>
|
||||
</g>
|
||||
</svg>
|
After Width: | Height: | Size: 7.8 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql04.png
Normal file
After Width: | Height: | Size: 190 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql05.png
Normal file
After Width: | Height: | Size: 46 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql06.png
Normal file
After Width: | Height: | Size: 112 KiB |
114
lod/tutorial/images/graph-databases-and-SPARQL/sparql07.svg
Normal file
@@ -0,0 +1,114 @@
|
||||
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
|
||||
<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN"
|
||||
"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
|
||||
<!-- Generated by graphviz version 2.38.0 (20140413.2041)
|
||||
-->
|
||||
<!-- Title: %3 Pages: 1 -->
|
||||
<svg width="367pt" height="364pt"
|
||||
viewBox="0.00 0.00 367.21 364.00" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
|
||||
<g id="graph0" class="graph" transform="scale(1 1) rotate(0) translate(4 360)">
|
||||
<title>%3</title>
|
||||
<polygon fill="white" stroke="none" points="-4,4 -4,-360 363.215,-360 363.215,4 -4,4"/>
|
||||
<!-- obj -->
|
||||
<g id="node1" class="node"><title>obj</title>
|
||||
<ellipse fill="none" stroke="gray" cx="148.735" cy="-338" rx="148.97" ry="18"/>
|
||||
<text text-anchor="middle" x="148.735" y="-335" font-family="Helvetica,sans-Serif" font-size="10.00">http://collection.britishmuseum.org/id/object/PPA82633</text>
|
||||
</g>
|
||||
<!-- object_type -->
|
||||
<g id="node2" class="node"><title>object_type</title>
|
||||
<ellipse fill="none" stroke="gray" cx="51.7348" cy="-258" rx="38.5366" ry="18"/>
|
||||
<text text-anchor="middle" x="51.7348" y="-255" font-family="Helvetica,sans-Serif" font-size="10.00">object_type</text>
|
||||
</g>
|
||||
<!-- obj->object_type -->
|
||||
<g id="edge1" class="edge"><title>obj->object_type</title>
|
||||
<path fill="none" stroke="gray" d="M98.3951,-320.855C88.4182,-315.964 78.6502,-309.764 70.9106,-302 66.4004,-297.476 62.8568,-291.706 60.1099,-285.873"/>
|
||||
<polygon fill="gray" stroke="gray" points="63.1969,-284.17 56.2095,-276.205 56.7054,-286.789 63.1969,-284.17"/>
|
||||
<text text-anchor="middle" x="108.647" y="-295.6" font-family="Helvetica,sans-Serif" font-size="8.00">bmo:PX_object_type</text>
|
||||
</g>
|
||||
<!-- production -->
|
||||
<g id="node4" class="node"><title>production</title>
|
||||
<ellipse fill="none" stroke="gray" cx="148.735" cy="-258" rx="36.3999" ry="18"/>
|
||||
<text text-anchor="middle" x="148.735" y="-255" font-family="Helvetica,sans-Serif" font-size="10.00">production</text>
|
||||
</g>
|
||||
<!-- obj->production -->
|
||||
<g id="edge3" class="edge"><title>obj->production</title>
|
||||
<path fill="none" stroke="gray" d="M148.735,-319.689C148.735,-309.894 148.735,-297.422 148.735,-286.335"/>
|
||||
<polygon fill="gray" stroke="gray" points="152.235,-286.262 148.735,-276.262 145.235,-286.262 152.235,-286.262"/>
|
||||
<text text-anchor="middle" x="203.657" y="-295.6" font-family="Helvetica,sans-Serif" font-size="8.00">ecrm:P108i_was_produced_by</text>
|
||||
</g>
|
||||
<!-- other -->
|
||||
<g id="node8" class="node"><title>other</title>
|
||||
<text text-anchor="middle" x="281.735" y="-255" font-family="Helvetica,sans-Serif" font-size="10.00">Other top-level object attributes</text>
|
||||
</g>
|
||||
<!-- obj->other -->
|
||||
<g id="edge7" class="edge"><title>obj->other</title>
|
||||
<path fill="none" stroke="gray" d="M223.709,-322.337C236.677,-317.399 249.318,-310.804 259.735,-302 264.947,-297.595 269.068,-291.646 272.265,-285.579"/>
|
||||
<polygon fill="gray" stroke="gray" points="275.598,-286.706 276.554,-276.154 269.227,-283.807 275.598,-286.706"/>
|
||||
<text text-anchor="middle" x="271.069" y="-295.6" font-family="Helvetica,sans-Serif" font-size="8.00">...</text>
|
||||
</g>
|
||||
<!-- print -->
|
||||
<g id="node3" class="node"><title>print</title>
|
||||
<ellipse fill="none" stroke="gray" cx="51.7348" cy="-178" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="51.7348" y="-175" font-family="Helvetica,sans-Serif" font-size="10.00">print</text>
|
||||
</g>
|
||||
<!-- object_type->print -->
|
||||
<g id="edge2" class="edge"><title>object_type->print</title>
|
||||
<path fill="none" stroke="gray" d="M51.7348,-239.689C51.7348,-229.894 51.7348,-217.422 51.7348,-206.335"/>
|
||||
<polygon fill="gray" stroke="gray" points="55.2349,-206.262 51.7348,-196.262 48.2349,-206.262 55.2349,-206.262"/>
|
||||
<text text-anchor="middle" x="77.7504" y="-215.6" font-family="Helvetica,sans-Serif" font-size="8.00">skos:prefLabel</text>
|
||||
</g>
|
||||
<!-- date -->
|
||||
<g id="node5" class="node"><title>date</title>
|
||||
<ellipse fill="none" stroke="gray" cx="134.735" cy="-178" rx="27" ry="18"/>
|
||||
<text text-anchor="middle" x="134.735" y="-175" font-family="Helvetica,sans-Serif" font-size="10.00">date</text>
|
||||
</g>
|
||||
<!-- production->date -->
|
||||
<g id="edge4" class="edge"><title>production->date</title>
|
||||
<path fill="none" stroke="gray" d="M143.042,-240.075C141.328,-234.389 139.605,-227.974 138.481,-222 137.543,-217.015 136.839,-211.66 136.311,-206.48"/>
|
||||
<polygon fill="gray" stroke="gray" points="139.778,-205.937 135.456,-196.264 132.803,-206.521 139.778,-205.937"/>
|
||||
<text text-anchor="middle" x="175.862" y="-215.6" font-family="Helvetica,sans-Serif" font-size="8.00">ecrm:P9_consists_of</text>
|
||||
</g>
|
||||
<!-- other_prod -->
|
||||
<g id="node9" class="node"><title>other_prod</title>
|
||||
<text text-anchor="middle" x="234.735" y="-175" font-family="Helvetica,sans-Serif" font-size="10.00">Other production info</text>
|
||||
</g>
|
||||
<!-- production->other_prod -->
|
||||
<g id="edge8" class="edge"><title>production->other_prod</title>
|
||||
<path fill="none" stroke="gray" d="M176.351,-246.343C188.633,-240.562 202.579,-232.439 212.735,-222 217.34,-217.267 221.193,-211.376 224.322,-205.489"/>
|
||||
<polygon fill="gray" stroke="gray" points="227.508,-206.94 228.646,-196.406 221.187,-203.931 227.508,-206.94"/>
|
||||
<text text-anchor="middle" x="222.069" y="-215.6" font-family="Helvetica,sans-Serif" font-size="8.00">...</text>
|
||||
</g>
|
||||
<!-- timespan -->
|
||||
<g id="node6" class="node"><title>timespan</title>
|
||||
<ellipse fill="none" stroke="gray" cx="134.735" cy="-98" rx="32.8294" ry="18"/>
|
||||
<text text-anchor="middle" x="134.735" y="-95" font-family="Helvetica,sans-Serif" font-size="10.00">timespan</text>
|
||||
</g>
|
||||
<!-- date->timespan -->
|
||||
<g id="edge5" class="edge"><title>date->timespan</title>
|
||||
<path fill="none" stroke="gray" d="M134.735,-159.689C134.735,-149.894 134.735,-137.422 134.735,-126.335"/>
|
||||
<polygon fill="gray" stroke="gray" points="138.235,-126.262 134.735,-116.262 131.235,-126.262 138.235,-126.262"/>
|
||||
<text text-anchor="middle" x="178.088" y="-135.6" font-family="Helvetica,sans-Serif" font-size="8.00">ecrm:P4_has_time-span</text>
|
||||
</g>
|
||||
<!-- start_date -->
|
||||
<g id="node7" class="node"><title>start_date</title>
|
||||
<ellipse fill="none" stroke="gray" cx="66.7348" cy="-18" rx="34.828" ry="18"/>
|
||||
<text text-anchor="middle" x="66.7348" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">start_date</text>
|
||||
</g>
|
||||
<!-- timespan->start_date -->
|
||||
<g id="edge6" class="edge"><title>timespan->start_date</title>
|
||||
<path fill="none" stroke="gray" d="M105.598,-89.5265C91.4682,-84.285 75.76,-75.6942 67.3129,-62 64.4467,-57.3534 63.1708,-51.8529 62.8105,-46.3654"/>
|
||||
<polygon fill="gray" stroke="gray" points="66.3171,-46.1414 63.0686,-36.0569 59.3193,-45.9661 66.3171,-46.1414"/>
|
||||
<text text-anchor="middle" x="124.446" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">ecrm:P82a_begin_of_the_begin</text>
|
||||
</g>
|
||||
<!-- other_date -->
|
||||
<g id="node10" class="node"><title>other_date</title>
|
||||
<text text-anchor="middle" x="202.735" y="-15" font-family="Helvetica,sans-Serif" font-size="10.00">End of begin, Start of end...</text>
|
||||
</g>
|
||||
<!-- timespan->other_date -->
|
||||
<g id="edge9" class="edge"><title>timespan->other_date</title>
|
||||
<path fill="none" stroke="gray" d="M155.643,-84.0943C164.163,-78.1192 173.652,-70.4665 180.735,-62 184.802,-57.138 188.386,-51.398 191.417,-45.7224"/>
|
||||
<polygon fill="gray" stroke="gray" points="194.728,-46.9184 195.983,-36.3981 188.442,-43.8394 194.728,-46.9184"/>
|
||||
<text text-anchor="middle" x="190.069" y="-55.6" font-family="Helvetica,sans-Serif" font-size="8.00">...</text>
|
||||
</g>
|
||||
</g>
|
||||
</svg>
|
After Width: | Height: | Size: 7.3 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql08.png
Normal file
After Width: | Height: | Size: 132 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql09-1.png
Normal file
After Width: | Height: | Size: 27 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql09.png
Normal file
After Width: | Height: | Size: 58 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql10.png
Normal file
After Width: | Height: | Size: 97 KiB |
BIN
lod/tutorial/images/graph-databases-and-SPARQL/sparql11.png
Normal file
After Width: | Height: | Size: 351 KiB |
34
lod/tutorial/js/bootstrap-4-navbar.js
vendored
Normal file
@@ -0,0 +1,34 @@
|
||||
|
||||
/*!
|
||||
* Bootstrap 4 multi dropdown navbar ( https://bootstrapthemes.co/demo/resource/bootstrap-4-multi-dropdown-navbar/ )
|
||||
* Copyright 2017.
|
||||
* Licensed under the GPL license
|
||||
*/
|
||||
|
||||
|
||||
$( document ).ready( function () {
|
||||
$( '.mobile-drop a.dropdown-toggle' ).on( 'click', function ( e ) {
|
||||
var $el = $( this );
|
||||
var $parent = $( this ).offsetParent( ".mobile-drop" );
|
||||
if ($('.show.mobile-drop').length > 0){
|
||||
$('.show.mobile-drop').each(function(item){
|
||||
$(this).toggleClass('show');
|
||||
});
|
||||
}
|
||||
|
||||
var $subMenu = $( this ).next( ".mobile-drop" );
|
||||
$subMenu.toggleClass( 'show' );
|
||||
|
||||
$( this ).parent( "li" ).toggleClass( 'show' );
|
||||
|
||||
$( this ).parents( 'li.nav-item.dropdown.mobile-drop.show' ).on( 'click', function ( e ) {
|
||||
$( '.mobile-drop .show' ).removeClass( "show" );
|
||||
} );
|
||||
|
||||
if ( !$parent.parent().hasClass( 'navbar-nav' ) ) {
|
||||
$el.next().css( { "top": $el[0].offsetTop, "left": $parent.outerWidth() - 4 } );
|
||||
}
|
||||
|
||||
return false;
|
||||
} );
|
||||
} );
|
8
lod/tutorial/js/ext_links.js
Normal file
@@ -0,0 +1,8 @@
|
||||
$(document).ready(function() {
|
||||
$('a').each(function() {
|
||||
var a = new RegExp('/' + window.location.host + '/');
|
||||
if (!a.test(this.href)) {
|
||||
$(this).attr("target","_blank");
|
||||
}
|
||||
});
|
||||
});
|
13
lod/tutorial/js/header_links.js
Normal file
@@ -0,0 +1,13 @@
|
||||
// http://ben.balter.com/2014/03/13/pages-anchor-links/
|
||||
|
||||
$(function() {
|
||||
return $("h2, h3, h4, h5, h6").each(function(i, el) {
|
||||
var $el, icon, id;
|
||||
$el = $(el);
|
||||
id = $el.attr('id');
|
||||
icon = '<i class="fa fa-link" style="font-size: 0.8em"></i>';
|
||||
if (id) {
|
||||
return $el.append($("<a />").addClass("header-link").attr("href", "#" + id).html(icon));
|
||||
}
|
||||
});
|
||||
});
|
1377
lod/tutorial/sparql-datos-abiertos-enlazados.html
Normal file
4112
lod/tutorial/sparql-intro.ipynb
Normal file
2513
lod/tutorial/sparql-vanGogh.ipynb
Normal file
53
lod/upload.sh
Normal file
@@ -0,0 +1,53 @@
|
||||
#!/bin/sh
|
||||
|
||||
# This is a bit messy
|
||||
if [ "$#" -lt 1 ]; then
|
||||
graph="http://example.com/sitc/submission/"
|
||||
endpoint="http://fuseki.gsi.upm.es/hotels/data"
|
||||
else if [ "$#" -lt 2 ]; then
|
||||
endpoint=$1
|
||||
graph_base="http://example.com/sitc"
|
||||
else
|
||||
if [ "$#" -lt 3 ]; then
|
||||
endpoint=$1
|
||||
graph=$2
|
||||
else
|
||||
echo "Usage: $0 [<endpoint>] [<graph_base_uri>]"
|
||||
echo
|
||||
exit 1
|
||||
fi
|
||||
fi
|
||||
fi
|
||||
|
||||
|
||||
upload(){
|
||||
name=$1
|
||||
file=$2
|
||||
echo '###'
|
||||
echo "Uploading: $graph"
|
||||
echo "Graph: $graph"
|
||||
echo "Endpoint: $endpoint"
|
||||
curl -X POST \
|
||||
--digest -u admin:$PASSWORD \
|
||||
-H Content-Type:text/turtle \
|
||||
-T "$file" \
|
||||
--data-urlencode graph=$graph_base/$name \
|
||||
-G $endpoint
|
||||
|
||||
}
|
||||
|
||||
|
||||
total=0
|
||||
echo -n "Password: "
|
||||
read -s PASSWORD
|
||||
|
||||
echo "Uploading synthethic"
|
||||
upload "synthetic" synthetic/reviews.ttl || exit 1
|
||||
|
||||
for i in *.ttl; do
|
||||
identifier=$(echo ${i%.ttl} | md5sum | awk '{print $1}')
|
||||
echo "Uploading $i"
|
||||
upload $identifier $i
|
||||
total=$((total + 1))
|
||||
done
|
||||
echo Uploaded $total
|
@@ -1,6 +0,0 @@
|
||||
import rdflib
|
||||
import sys
|
||||
g = rdflib.Graph()
|
||||
dataset = sys.argv[1] if len(sys.argv) > 1 else 'reviews.ttl'
|
||||
g.parse(dataset, format="n3")
|
||||
print(g.serialize(format="n3").decode('utf-8'))
|
@@ -4,7 +4,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
""
|
||||
""
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -18,7 +18,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © 2016 Carlos A. Iglesias"
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © Carlos A. Iglesias"
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -71,8 +71,7 @@
|
||||
"source": [
|
||||
"* [Scikit-learn web page](http://scikit-learn.org/stable/)\n",
|
||||
"* [Scikit-learn videos](http://blog.kaggle.com/author/kevin-markham/) and [notebooks](https://github.com/justmarkham/scikit-learn-videos) by Kevin Marham\n",
|
||||
"* [Learning scikit-learn: Machine Learning in Python](http://proquest.safaribooksonline.com/book/programming/python/9781783281930/1dot-machine-learning-a-gentle-introduction/ch01s02_html), Raúl Garreta; Guillermo Moncecchi, Packt Publishing, 2013.\n",
|
||||
"* [Python Machine Learning](http://proquest.safaribooksonline.com/book/programming/python/9781783555130), Sebastian Raschka, Packt Publishing, 2015."
|
||||
"* [Python Machine Learning](https://learning.oreilly.com/library/view/python-machine-learning/9781789955750/), Sebastian Raschka, Packt Publishing, 2019."
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -80,15 +79,15 @@
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Licence\n",
|
||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"The notebook is freely licensed under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"\n",
|
||||
"© 2016 Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
"© Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
]
|
||||
}
|
||||
],
|
||||
"metadata": {
|
||||
"kernelspec": {
|
||||
"display_name": "Python 3",
|
||||
"display_name": "Python 3 (ipykernel)",
|
||||
"language": "python",
|
||||
"name": "python3"
|
||||
},
|
||||
@@ -102,9 +101,26 @@
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.5.2"
|
||||
"version": "3.8.12"
|
||||
},
|
||||
"latex_envs": {
|
||||
"LaTeX_envs_menu_present": true,
|
||||
"autocomplete": true,
|
||||
"bibliofile": "biblio.bib",
|
||||
"cite_by": "apalike",
|
||||
"current_citInitial": 1,
|
||||
"eqLabelWithNumbers": true,
|
||||
"eqNumInitial": 1,
|
||||
"hotkeys": {
|
||||
"equation": "Ctrl-E",
|
||||
"itemize": "Ctrl-I"
|
||||
},
|
||||
"labels_anchors": false,
|
||||
"latex_user_defs": false,
|
||||
"report_style_numbering": false,
|
||||
"user_envs_cfg": false
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
"nbformat_minor": 0
|
||||
"nbformat_minor": 1
|
||||
}
|
||||
|
@@ -4,7 +4,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
""
|
||||
""
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -18,7 +18,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © 2016 Carlos A. Iglesias"
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © Carlos A. Iglesias"
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -40,10 +40,10 @@
|
||||
"\n",
|
||||
"* Learn to use scikit-learn\n",
|
||||
"* Learn the basic steps to apply machine learning techniques: dataset analysis, load, preprocessing, training, validation, optimization and persistence.\n",
|
||||
"* Learn how to do a exploratory data analysis\n",
|
||||
"* Learn how to do an exploratory data analysis\n",
|
||||
"* Learn how to visualise a dataset\n",
|
||||
"* Learn how to load a bundled dataset\n",
|
||||
"* Learn how to separate the dataset into traning and testing datasets\n",
|
||||
"* Learn how to separate the dataset into training and testing datasets\n",
|
||||
"* Learn how to train a classifier\n",
|
||||
"* Learn how to predict with a trained classifier\n",
|
||||
"* Learn how to evaluate the predictions\n",
|
||||
@@ -63,9 +63,7 @@
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"* [Scikit-learn web page](http://scikit-learn.org/stable/)\n",
|
||||
"* [Scikit-learn videos](http://blog.kaggle.com/author/kevin-markham/) and [notebooks](https://github.com/justmarkham/scikit-learn-videos) by Kevin Marham\n",
|
||||
"* [Learning scikit-learn: Machine Learning in Python](http://proquest.safaribooksonline.com/book/programming/python/9781783281930/1dot-machine-learning-a-gentle-introduction/ch01s02_html), Raúl Garreta; Guillermo Moncecchi, Packt Publishing, 2013.\n",
|
||||
"* [Python Machine Learning](http://proquest.safaribooksonline.com/book/programming/python/9781783555130), Sebastian Raschka, Packt Publishing, 2015."
|
||||
"* [Scikit-learn videos](http://blog.kaggle.com/author/kevin-markham/) and [notebooks](https://github.com/justmarkham/scikit-learn-videos) by Kevin Marham\n"
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -73,15 +71,15 @@
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## LIcence\n",
|
||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"The notebook is freely licensed under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"\n",
|
||||
"© 2016 Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
"© Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
]
|
||||
}
|
||||
],
|
||||
"metadata": {
|
||||
"kernelspec": {
|
||||
"display_name": "Python 3",
|
||||
"display_name": "Python 3 (ipykernel)",
|
||||
"language": "python",
|
||||
"name": "python3"
|
||||
},
|
||||
@@ -95,9 +93,26 @@
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.5.1"
|
||||
"version": "3.8.12"
|
||||
},
|
||||
"latex_envs": {
|
||||
"LaTeX_envs_menu_present": true,
|
||||
"autocomplete": true,
|
||||
"bibliofile": "biblio.bib",
|
||||
"cite_by": "apalike",
|
||||
"current_citInitial": 1,
|
||||
"eqLabelWithNumbers": true,
|
||||
"eqNumInitial": 1,
|
||||
"hotkeys": {
|
||||
"equation": "Ctrl-E",
|
||||
"itemize": "Ctrl-I"
|
||||
},
|
||||
"labels_anchors": false,
|
||||
"latex_user_defs": false,
|
||||
"report_style_numbering": false,
|
||||
"user_envs_cfg": false
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
"nbformat_minor": 0
|
||||
"nbformat_minor": 1
|
||||
}
|
||||
|
@@ -4,7 +4,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
""
|
||||
""
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -18,7 +18,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © 2016 Carlos A. Iglesias"
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © Carlos A. Iglesias"
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -87,7 +87,7 @@
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Scikit-learn provides algorithms for solving the following problems:\n",
|
||||
"* **Classification**: Identifying to which category an object belongs to. Some of the available [classification algorithms](http://scikit-learn.org/stable/supervised_learning.html#supervised-learning) are decision trees (ID3, kNN, ...), SVM, Random forest, Perceptron, etc. \n",
|
||||
"* **Classification**: Identifying to which category an object belongs. Some of the available [classification algorithms](http://scikit-learn.org/stable/supervised_learning.html#supervised-learning) are decision trees (ID3, C4.5, ...), kNN, SVM, Random forest, Perceptron, etc. \n",
|
||||
"* **Clustering**: Automatic grouping of similar objects into sets. Some of the available [clustering algorithms](http://scikit-learn.org/stable/modules/clustering.html#clustering) are k-Means, Affinity propagation, etc.\n",
|
||||
"* **Regression**: Predicting a continuous-valued attribute associated with an object. Some of the available [regression algorithms](http://scikit-learn.org/stable/supervised_learning.html#supervised-learning) are linear regression, logistic regression, etc.\n",
|
||||
"* **Dimensionality reduction**: Reducing the number of random variables to consider. Some of the available [dimensionality reduction algorithms](http://scikit-learn.org/stable/modules/decomposition.html#decompositions) are SVD, PCA, etc."
|
||||
@@ -105,7 +105,7 @@
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"In addition, scikit-learn helps in several tasks:\n",
|
||||
"* ** Model selection**: Comparing, validating, choosing parameters and models, and persisting models. Some of the [available functionalities](http://scikit-learn.org/stable/model_selection.html#model-selection) are cross-validation or grid search for optimizing the parameters. \n",
|
||||
"* **Model selection**: Comparing, validating, choosing parameters and models, and persisting models. Some [available functionalities](http://scikit-learn.org/stable/model_selection.html#model-selection) are cross-validation or grid search for optimizing the parameters. \n",
|
||||
"* **Preprocessing**: Several common utility functions and transformer classes to change raw feature vectors into a representation that is more suitable for the downstream estimators. Some of the available [preprocessing functions](http://scikit-learn.org/stable/modules/preprocessing.html#preprocessing) are scaling and normalizing data, or imputing missing values."
|
||||
]
|
||||
},
|
||||
@@ -122,13 +122,15 @@
|
||||
"source": [
|
||||
"If you installed the conda distribution, scikit-learn is already installed! This is the best option.\n",
|
||||
"\n",
|
||||
"Anyway, before starting, update all the packages: `conda update --all`. \n",
|
||||
"\n",
|
||||
"In case it is an old installation, you can update it using conda: `conda update scikit-learn`.\n",
|
||||
"\n",
|
||||
"If it is not installed, install it with conda: `conda install scikit-learn`.\n",
|
||||
"\n",
|
||||
"If you have installed scipy and numpy, you can also installed using pip: `pip install -U scikit-learn`.\n",
|
||||
"If you have installed scipy and numpy, you can also install using pip: `pip install -U scikit-learn`.\n",
|
||||
"\n",
|
||||
"It is not recommended to use pip for installing scipy and numpy. Instead, use conda or install the linux package *python-sklearn*."
|
||||
"It is not recommended to use pip to install scipy and numpy. Instead, use conda or install the Linux package *python-sklearn*."
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -154,9 +156,9 @@
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Licence\n",
|
||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"The notebook is freely licensed under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"\n",
|
||||
"© 2016 Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
"© Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
]
|
||||
}
|
||||
],
|
||||
@@ -176,9 +178,26 @@
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.5.1+"
|
||||
"version": "3.6.7"
|
||||
},
|
||||
"latex_envs": {
|
||||
"LaTeX_envs_menu_present": true,
|
||||
"autocomplete": true,
|
||||
"bibliofile": "biblio.bib",
|
||||
"cite_by": "apalike",
|
||||
"current_citInitial": 1,
|
||||
"eqLabelWithNumbers": true,
|
||||
"eqNumInitial": 1,
|
||||
"hotkeys": {
|
||||
"equation": "Ctrl-E",
|
||||
"itemize": "Ctrl-I"
|
||||
},
|
||||
"labels_anchors": false,
|
||||
"latex_user_defs": false,
|
||||
"report_style_numbering": false,
|
||||
"user_envs_cfg": false
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
"nbformat_minor": 0
|
||||
"nbformat_minor": 1
|
||||
}
|
||||
|
@@ -4,11 +4,11 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"\n",
|
||||
"\n",
|
||||
"\n",
|
||||
"# Course Notes for Learning Intelligent Systems\n",
|
||||
"\n",
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © 2016 Carlos A. Iglesias\n",
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © Carlos A. Iglesias\n",
|
||||
"\n",
|
||||
"## [Introduction to Machine Learning](2_0_0_Intro_ML.ipynb)"
|
||||
]
|
||||
@@ -34,11 +34,11 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"The goal of this notebook is to learn how to read and load a sample dataset.\n",
|
||||
"This notebook aims to learn how to read and load a sample dataset.\n",
|
||||
"\n",
|
||||
"Scikit-learn comes with some bundled [datasets](http://scikit-learn.org/stable/datasets/): iris, digits, boston, etc.\n",
|
||||
"Scikit-learn comes with some bundled [datasets](https://scikit-learn.org/stable/datasets.html): iris, digits, boston, etc.\n",
|
||||
"\n",
|
||||
"In this notebook we are going to use the Iris dataset."
|
||||
"In this notebook, we will use the Iris dataset."
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -54,24 +54,31 @@
|
||||
"source": [
|
||||
"The [Iris flower dataset](https://en.wikipedia.org/wiki/Iris_flower_data_set), available at [UCI dataset repository](https://archive.ics.uci.edu/ml/datasets/Iris), is a classic dataset for classification.\n",
|
||||
"\n",
|
||||
"The dataset consists of 50 samples from each of three species of Iris (Iris setosa, Iris virginica and Iris versicolor). Four features were measured from each sample: the length and the width of the sepals and petals, in centimetres. Based on the combination of these four features.\n",
|
||||
"The dataset consists of 50 samples from each of three species of Iris (Iris setosa, Iris virginica, and Iris versicolor). Four features were measured from each sample: the length and the width of the sepals and petals, in centimetres. Based on the combination of these four features, a machine learning model will learn to differentiate the species of Iris.\n",
|
||||
"\n",
|
||||
""
|
||||
""
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"In ordert to read the dataset, we import the datasets bundle and then load the Iris dataset. "
|
||||
"Here you can see the species and the features.\n",
|
||||
"\n",
|
||||
""
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"To read the dataset, we import the datasets bundle and then load the Iris dataset. "
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 8,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# import datasets from scikit-learn\n",
|
||||
@@ -90,22 +97,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 9,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"data": {
|
||||
"text/plain": [
|
||||
"sklearn.datasets.base.Bunch"
|
||||
]
|
||||
},
|
||||
"execution_count": 9,
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"output_type": "execute_result"
|
||||
}
|
||||
],
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"#type 'bunch' of a dataset\n",
|
||||
"type(iris)"
|
||||
@@ -113,80 +107,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 10,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"Iris Plants Database\n",
|
||||
"\n",
|
||||
"Notes\n",
|
||||
"-----\n",
|
||||
"Data Set Characteristics:\n",
|
||||
" :Number of Instances: 150 (50 in each of three classes)\n",
|
||||
" :Number of Attributes: 4 numeric, predictive attributes and the class\n",
|
||||
" :Attribute Information:\n",
|
||||
" - sepal length in cm\n",
|
||||
" - sepal width in cm\n",
|
||||
" - petal length in cm\n",
|
||||
" - petal width in cm\n",
|
||||
" - class:\n",
|
||||
" - Iris-Setosa\n",
|
||||
" - Iris-Versicolour\n",
|
||||
" - Iris-Virginica\n",
|
||||
" :Summary Statistics:\n",
|
||||
"\n",
|
||||
" ============== ==== ==== ======= ===== ====================\n",
|
||||
" Min Max Mean SD Class Correlation\n",
|
||||
" ============== ==== ==== ======= ===== ====================\n",
|
||||
" sepal length: 4.3 7.9 5.84 0.83 0.7826\n",
|
||||
" sepal width: 2.0 4.4 3.05 0.43 -0.4194\n",
|
||||
" petal length: 1.0 6.9 3.76 1.76 0.9490 (high!)\n",
|
||||
" petal width: 0.1 2.5 1.20 0.76 0.9565 (high!)\n",
|
||||
" ============== ==== ==== ======= ===== ====================\n",
|
||||
"\n",
|
||||
" :Missing Attribute Values: None\n",
|
||||
" :Class Distribution: 33.3% for each of 3 classes.\n",
|
||||
" :Creator: R.A. Fisher\n",
|
||||
" :Donor: Michael Marshall (MARSHALL%PLU@io.arc.nasa.gov)\n",
|
||||
" :Date: July, 1988\n",
|
||||
"\n",
|
||||
"This is a copy of UCI ML iris datasets.\n",
|
||||
"http://archive.ics.uci.edu/ml/datasets/Iris\n",
|
||||
"\n",
|
||||
"The famous Iris database, first used by Sir R.A Fisher\n",
|
||||
"\n",
|
||||
"This is perhaps the best known database to be found in the\n",
|
||||
"pattern recognition literature. Fisher's paper is a classic in the field and\n",
|
||||
"is referenced frequently to this day. (See Duda & Hart, for example.) The\n",
|
||||
"data set contains 3 classes of 50 instances each, where each class refers to a\n",
|
||||
"type of iris plant. One class is linearly separable from the other 2; the\n",
|
||||
"latter are NOT linearly separable from each other.\n",
|
||||
"\n",
|
||||
"References\n",
|
||||
"----------\n",
|
||||
" - Fisher,R.A. \"The use of multiple measurements in taxonomic problems\"\n",
|
||||
" Annual Eugenics, 7, Part II, 179-188 (1936); also in \"Contributions to\n",
|
||||
" Mathematical Statistics\" (John Wiley, NY, 1950).\n",
|
||||
" - Duda,R.O., & Hart,P.E. (1973) Pattern Classification and Scene Analysis.\n",
|
||||
" (Q327.D83) John Wiley & Sons. ISBN 0-471-22361-1. See page 218.\n",
|
||||
" - Dasarathy, B.V. (1980) \"Nosing Around the Neighborhood: A New System\n",
|
||||
" Structure and Classification Rule for Recognition in Partially Exposed\n",
|
||||
" Environments\". IEEE Transactions on Pattern Analysis and Machine\n",
|
||||
" Intelligence, Vol. PAMI-2, No. 1, 67-71.\n",
|
||||
" - Gates, G.W. (1972) \"The Reduced Nearest Neighbor Rule\". IEEE Transactions\n",
|
||||
" on Information Theory, May 1972, 431-433.\n",
|
||||
" - See also: 1988 MLC Proceedings, 54-64. Cheeseman et al\"s AUTOCLASS II\n",
|
||||
" conceptual clustering system finds 3 classes in the data.\n",
|
||||
" - Many, many more ...\n",
|
||||
"\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# print descrition of the dataset\n",
|
||||
"print(iris.DESCR)"
|
||||
@@ -194,19 +117,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 11,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)']\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# names of the features (attributes of the entities)\n",
|
||||
"print(iris.feature_names)"
|
||||
@@ -214,19 +127,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 12,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"['setosa' 'versicolor' 'virginica']\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"#names of the targets(classes of the classifier)\n",
|
||||
"print(iris.target_names)"
|
||||
@@ -234,22 +137,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 13,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"data": {
|
||||
"text/plain": [
|
||||
"numpy.ndarray"
|
||||
]
|
||||
},
|
||||
"execution_count": 13,
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"output_type": "execute_result"
|
||||
}
|
||||
],
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"#type numpy array\n",
|
||||
"type(iris.data)"
|
||||
@@ -264,168 +154,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 14,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"[[ 5.1 3.5 1.4 0.2]\n",
|
||||
" [ 4.9 3. 1.4 0.2]\n",
|
||||
" [ 4.7 3.2 1.3 0.2]\n",
|
||||
" [ 4.6 3.1 1.5 0.2]\n",
|
||||
" [ 5. 3.6 1.4 0.2]\n",
|
||||
" [ 5.4 3.9 1.7 0.4]\n",
|
||||
" [ 4.6 3.4 1.4 0.3]\n",
|
||||
" [ 5. 3.4 1.5 0.2]\n",
|
||||
" [ 4.4 2.9 1.4 0.2]\n",
|
||||
" [ 4.9 3.1 1.5 0.1]\n",
|
||||
" [ 5.4 3.7 1.5 0.2]\n",
|
||||
" [ 4.8 3.4 1.6 0.2]\n",
|
||||
" [ 4.8 3. 1.4 0.1]\n",
|
||||
" [ 4.3 3. 1.1 0.1]\n",
|
||||
" [ 5.8 4. 1.2 0.2]\n",
|
||||
" [ 5.7 4.4 1.5 0.4]\n",
|
||||
" [ 5.4 3.9 1.3 0.4]\n",
|
||||
" [ 5.1 3.5 1.4 0.3]\n",
|
||||
" [ 5.7 3.8 1.7 0.3]\n",
|
||||
" [ 5.1 3.8 1.5 0.3]\n",
|
||||
" [ 5.4 3.4 1.7 0.2]\n",
|
||||
" [ 5.1 3.7 1.5 0.4]\n",
|
||||
" [ 4.6 3.6 1. 0.2]\n",
|
||||
" [ 5.1 3.3 1.7 0.5]\n",
|
||||
" [ 4.8 3.4 1.9 0.2]\n",
|
||||
" [ 5. 3. 1.6 0.2]\n",
|
||||
" [ 5. 3.4 1.6 0.4]\n",
|
||||
" [ 5.2 3.5 1.5 0.2]\n",
|
||||
" [ 5.2 3.4 1.4 0.2]\n",
|
||||
" [ 4.7 3.2 1.6 0.2]\n",
|
||||
" [ 4.8 3.1 1.6 0.2]\n",
|
||||
" [ 5.4 3.4 1.5 0.4]\n",
|
||||
" [ 5.2 4.1 1.5 0.1]\n",
|
||||
" [ 5.5 4.2 1.4 0.2]\n",
|
||||
" [ 4.9 3.1 1.5 0.1]\n",
|
||||
" [ 5. 3.2 1.2 0.2]\n",
|
||||
" [ 5.5 3.5 1.3 0.2]\n",
|
||||
" [ 4.9 3.1 1.5 0.1]\n",
|
||||
" [ 4.4 3. 1.3 0.2]\n",
|
||||
" [ 5.1 3.4 1.5 0.2]\n",
|
||||
" [ 5. 3.5 1.3 0.3]\n",
|
||||
" [ 4.5 2.3 1.3 0.3]\n",
|
||||
" [ 4.4 3.2 1.3 0.2]\n",
|
||||
" [ 5. 3.5 1.6 0.6]\n",
|
||||
" [ 5.1 3.8 1.9 0.4]\n",
|
||||
" [ 4.8 3. 1.4 0.3]\n",
|
||||
" [ 5.1 3.8 1.6 0.2]\n",
|
||||
" [ 4.6 3.2 1.4 0.2]\n",
|
||||
" [ 5.3 3.7 1.5 0.2]\n",
|
||||
" [ 5. 3.3 1.4 0.2]\n",
|
||||
" [ 7. 3.2 4.7 1.4]\n",
|
||||
" [ 6.4 3.2 4.5 1.5]\n",
|
||||
" [ 6.9 3.1 4.9 1.5]\n",
|
||||
" [ 5.5 2.3 4. 1.3]\n",
|
||||
" [ 6.5 2.8 4.6 1.5]\n",
|
||||
" [ 5.7 2.8 4.5 1.3]\n",
|
||||
" [ 6.3 3.3 4.7 1.6]\n",
|
||||
" [ 4.9 2.4 3.3 1. ]\n",
|
||||
" [ 6.6 2.9 4.6 1.3]\n",
|
||||
" [ 5.2 2.7 3.9 1.4]\n",
|
||||
" [ 5. 2. 3.5 1. ]\n",
|
||||
" [ 5.9 3. 4.2 1.5]\n",
|
||||
" [ 6. 2.2 4. 1. ]\n",
|
||||
" [ 6.1 2.9 4.7 1.4]\n",
|
||||
" [ 5.6 2.9 3.6 1.3]\n",
|
||||
" [ 6.7 3.1 4.4 1.4]\n",
|
||||
" [ 5.6 3. 4.5 1.5]\n",
|
||||
" [ 5.8 2.7 4.1 1. ]\n",
|
||||
" [ 6.2 2.2 4.5 1.5]\n",
|
||||
" [ 5.6 2.5 3.9 1.1]\n",
|
||||
" [ 5.9 3.2 4.8 1.8]\n",
|
||||
" [ 6.1 2.8 4. 1.3]\n",
|
||||
" [ 6.3 2.5 4.9 1.5]\n",
|
||||
" [ 6.1 2.8 4.7 1.2]\n",
|
||||
" [ 6.4 2.9 4.3 1.3]\n",
|
||||
" [ 6.6 3. 4.4 1.4]\n",
|
||||
" [ 6.8 2.8 4.8 1.4]\n",
|
||||
" [ 6.7 3. 5. 1.7]\n",
|
||||
" [ 6. 2.9 4.5 1.5]\n",
|
||||
" [ 5.7 2.6 3.5 1. ]\n",
|
||||
" [ 5.5 2.4 3.8 1.1]\n",
|
||||
" [ 5.5 2.4 3.7 1. ]\n",
|
||||
" [ 5.8 2.7 3.9 1.2]\n",
|
||||
" [ 6. 2.7 5.1 1.6]\n",
|
||||
" [ 5.4 3. 4.5 1.5]\n",
|
||||
" [ 6. 3.4 4.5 1.6]\n",
|
||||
" [ 6.7 3.1 4.7 1.5]\n",
|
||||
" [ 6.3 2.3 4.4 1.3]\n",
|
||||
" [ 5.6 3. 4.1 1.3]\n",
|
||||
" [ 5.5 2.5 4. 1.3]\n",
|
||||
" [ 5.5 2.6 4.4 1.2]\n",
|
||||
" [ 6.1 3. 4.6 1.4]\n",
|
||||
" [ 5.8 2.6 4. 1.2]\n",
|
||||
" [ 5. 2.3 3.3 1. ]\n",
|
||||
" [ 5.6 2.7 4.2 1.3]\n",
|
||||
" [ 5.7 3. 4.2 1.2]\n",
|
||||
" [ 5.7 2.9 4.2 1.3]\n",
|
||||
" [ 6.2 2.9 4.3 1.3]\n",
|
||||
" [ 5.1 2.5 3. 1.1]\n",
|
||||
" [ 5.7 2.8 4.1 1.3]\n",
|
||||
" [ 6.3 3.3 6. 2.5]\n",
|
||||
" [ 5.8 2.7 5.1 1.9]\n",
|
||||
" [ 7.1 3. 5.9 2.1]\n",
|
||||
" [ 6.3 2.9 5.6 1.8]\n",
|
||||
" [ 6.5 3. 5.8 2.2]\n",
|
||||
" [ 7.6 3. 6.6 2.1]\n",
|
||||
" [ 4.9 2.5 4.5 1.7]\n",
|
||||
" [ 7.3 2.9 6.3 1.8]\n",
|
||||
" [ 6.7 2.5 5.8 1.8]\n",
|
||||
" [ 7.2 3.6 6.1 2.5]\n",
|
||||
" [ 6.5 3.2 5.1 2. ]\n",
|
||||
" [ 6.4 2.7 5.3 1.9]\n",
|
||||
" [ 6.8 3. 5.5 2.1]\n",
|
||||
" [ 5.7 2.5 5. 2. ]\n",
|
||||
" [ 5.8 2.8 5.1 2.4]\n",
|
||||
" [ 6.4 3.2 5.3 2.3]\n",
|
||||
" [ 6.5 3. 5.5 1.8]\n",
|
||||
" [ 7.7 3.8 6.7 2.2]\n",
|
||||
" [ 7.7 2.6 6.9 2.3]\n",
|
||||
" [ 6. 2.2 5. 1.5]\n",
|
||||
" [ 6.9 3.2 5.7 2.3]\n",
|
||||
" [ 5.6 2.8 4.9 2. ]\n",
|
||||
" [ 7.7 2.8 6.7 2. ]\n",
|
||||
" [ 6.3 2.7 4.9 1.8]\n",
|
||||
" [ 6.7 3.3 5.7 2.1]\n",
|
||||
" [ 7.2 3.2 6. 1.8]\n",
|
||||
" [ 6.2 2.8 4.8 1.8]\n",
|
||||
" [ 6.1 3. 4.9 1.8]\n",
|
||||
" [ 6.4 2.8 5.6 2.1]\n",
|
||||
" [ 7.2 3. 5.8 1.6]\n",
|
||||
" [ 7.4 2.8 6.1 1.9]\n",
|
||||
" [ 7.9 3.8 6.4 2. ]\n",
|
||||
" [ 6.4 2.8 5.6 2.2]\n",
|
||||
" [ 6.3 2.8 5.1 1.5]\n",
|
||||
" [ 6.1 2.6 5.6 1.4]\n",
|
||||
" [ 7.7 3. 6.1 2.3]\n",
|
||||
" [ 6.3 3.4 5.6 2.4]\n",
|
||||
" [ 6.4 3.1 5.5 1.8]\n",
|
||||
" [ 6. 3. 4.8 1.8]\n",
|
||||
" [ 6.9 3.1 5.4 2.1]\n",
|
||||
" [ 6.7 3.1 5.6 2.4]\n",
|
||||
" [ 6.9 3.1 5.1 2.3]\n",
|
||||
" [ 5.8 2.7 5.1 1.9]\n",
|
||||
" [ 6.8 3.2 5.9 2.3]\n",
|
||||
" [ 6.7 3.3 5.7 2.5]\n",
|
||||
" [ 6.7 3. 5.2 2.3]\n",
|
||||
" [ 6.3 2.5 5. 1.9]\n",
|
||||
" [ 6.5 3. 5.2 2. ]\n",
|
||||
" [ 6.2 3.4 5.4 2.3]\n",
|
||||
" [ 5.9 3. 5.1 1.8]]\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"#Data in the iris dataset. The value of the features of the samples.\n",
|
||||
"print(iris.data)"
|
||||
@@ -433,23 +164,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 15,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"[0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0\n",
|
||||
" 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1\n",
|
||||
" 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 2 2\n",
|
||||
" 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2\n",
|
||||
" 2 2]\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# Target. Category of every sample\n",
|
||||
"print(iris.target)"
|
||||
@@ -457,19 +174,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 16,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"(150, 4)\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# Iris data is a numpy array\n",
|
||||
"# We can inspect its shape (rows, columns). In our case, (n_samples, n_features)\n",
|
||||
@@ -478,39 +185,19 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 17,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"2\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"#Using numpy, I can print the dimensions (here we are working with 2D matriz)\n",
|
||||
"#Using numpy, I can print the dimensions (here we are working with a 2D matrix)\n",
|
||||
"print(iris.data.ndim)"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 18,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"150\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# I can print n_samples\n",
|
||||
"print(iris.data.shape[0])"
|
||||
@@ -518,19 +205,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 19,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"4\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# ... n_features\n",
|
||||
"print(iris.data.shape[1])"
|
||||
@@ -538,19 +215,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 20,
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)']\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# names of the features\n",
|
||||
"print(iris.feature_names)"
|
||||
@@ -560,7 +227,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"In following sessions we will learn how to load a dataset from a file (csv, excel, ...) using the pandas library."
|
||||
"In the following sessions, we will learn how to load a dataset from a file (CSV, Excel, ...) using the pandas library."
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -588,9 +255,9 @@
|
||||
"source": [
|
||||
"## Licence\n",
|
||||
"\n",
|
||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"The notebook is freely licensed under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"\n",
|
||||
"© 2016 Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
"© Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
]
|
||||
}
|
||||
],
|
||||
@@ -610,9 +277,26 @@
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.5.1+"
|
||||
"version": "3.5.5"
|
||||
},
|
||||
"latex_envs": {
|
||||
"LaTeX_envs_menu_present": true,
|
||||
"autocomplete": true,
|
||||
"bibliofile": "biblio.bib",
|
||||
"cite_by": "apalike",
|
||||
"current_citInitial": 1,
|
||||
"eqLabelWithNumbers": true,
|
||||
"eqNumInitial": 1,
|
||||
"hotkeys": {
|
||||
"equation": "Ctrl-E",
|
||||
"itemize": "Ctrl-I"
|
||||
},
|
||||
"labels_anchors": false,
|
||||
"latex_user_defs": false,
|
||||
"report_style_numbering": false,
|
||||
"user_envs_cfg": false
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
"nbformat_minor": 0
|
||||
"nbformat_minor": 1
|
||||
}
|
||||
|
@@ -4,7 +4,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
""
|
||||
""
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -18,7 +18,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © 2016 Carlos A. Iglesias"
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © Carlos A. Iglesias"
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -55,7 +55,7 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 1,
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
@@ -76,12 +76,12 @@
|
||||
"source": [
|
||||
"A common practice in machine learning to evaluate an algorithm is to split the data at hand into two sets, one that we call the **training set** on which we learn data properties and one that we call the **testing set** on which we test these properties. \n",
|
||||
"\n",
|
||||
"We are going to use *scikit-learn* to split the data into random training and testing sets. We follow the ratio 75% for training and 25% for testing. We use `random_state` to ensure that the result is always the same and it is reproducible. (Otherwise, we would get different training and testing sets every time)."
|
||||
"We will use *scikit-learn* to split the data into random training and testing sets. We follow the ratio 75% for training and 25% for testing. We use `random_state` to ensure that the result is always the same and it is reproducible. (Otherwise, we would get different training and testing sets every time)."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 2,
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
@@ -93,17 +93,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 4,
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"(112, 4) (38, 4)\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# Dimensions of train and testing\n",
|
||||
"print(x_train.shape, x_test.shape)"
|
||||
@@ -111,54 +103,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 5,
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"[[ 5.7 2.9 4.2 1.3]\n",
|
||||
" [ 6.7 3.1 4.4 1.4]\n",
|
||||
" [ 4.7 3.2 1.6 0.2]\n",
|
||||
" [ 6.5 2.8 4.6 1.5]\n",
|
||||
" [ 6.1 2.6 5.6 1.4]\n",
|
||||
" [ 6.3 3.3 6. 2.5]\n",
|
||||
" [ 4.8 3.4 1.9 0.2]\n",
|
||||
" [ 5.1 3.5 1.4 0.3]\n",
|
||||
" [ 6.4 3.1 5.5 1.8]\n",
|
||||
" [ 6.9 3.2 5.7 2.3]\n",
|
||||
" [ 6.8 3.2 5.9 2.3]\n",
|
||||
" [ 4.4 3. 1.3 0.2]\n",
|
||||
" [ 6.3 3.4 5.6 2.4]\n",
|
||||
" [ 6.1 2.9 4.7 1.4]\n",
|
||||
" [ 6.9 3.1 5.1 2.3]\n",
|
||||
" [ 6.4 2.9 4.3 1.3]\n",
|
||||
" [ 6. 3. 4.8 1.8]\n",
|
||||
" [ 5.2 3.5 1.5 0.2]\n",
|
||||
" [ 6.3 3.3 4.7 1.6]\n",
|
||||
" [ 7.2 3.2 6. 1.8]\n",
|
||||
" [ 4.9 3.1 1.5 0.1]\n",
|
||||
" [ 5.7 3.8 1.7 0.3]\n",
|
||||
" [ 6.5 3. 5.8 2.2]\n",
|
||||
" [ 4.8 3. 1.4 0.1]\n",
|
||||
" [ 6. 2.2 5. 1.5]\n",
|
||||
" [ 6.2 2.8 4.8 1.8]\n",
|
||||
" [ 6.1 3. 4.6 1.4]\n",
|
||||
" [ 6.1 2.8 4. 1.3]\n",
|
||||
" [ 6.5 3. 5.2 2. ]\n",
|
||||
" [ 5.9 3. 5.1 1.8]\n",
|
||||
" [ 5.6 2.7 4.2 1.3]\n",
|
||||
" [ 6.7 3.1 4.7 1.5]\n",
|
||||
" [ 5.6 2.8 4.9 2. ]\n",
|
||||
" [ 6.4 3.2 5.3 2.3]\n",
|
||||
" [ 6.7 3.1 5.6 2.4]\n",
|
||||
" [ 6.7 3. 5.2 2.3]\n",
|
||||
" [ 5.8 2.7 5.1 1.9]\n",
|
||||
" [ 5.7 3. 4.2 1.2]]\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"#Test set\n",
|
||||
"print (x_test)"
|
||||
@@ -175,14 +122,14 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Standardization of datasets is a common requirement for many machine learning estimators implemented in the scikit; they might behave badly if the individual features do not more or less look like standard normally distributed data: Gaussian with zero mean and unit variance.\n",
|
||||
"Standardization of datasets is a common requirement for many machine learning estimators implemented in the scikit; they might misbehave if the individual features do not more or less look like standard normally distributed data: Gaussian with zero mean and unit variance.\n",
|
||||
"\n",
|
||||
"The preprocessing module further provides a utility class `StandardScaler` to compute the mean and standard deviation on a training set. Later, the same transformation will be applied on the testing set."
|
||||
"The preprocessing module further provides a utility class `StandardScaler` to compute a training set's mean and standard deviation. Later, the same transformation will be applied on the testing set."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 10,
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
@@ -195,54 +142,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 11,
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [
|
||||
{
|
||||
"name": "stdout",
|
||||
"output_type": "stream",
|
||||
"text": [
|
||||
"[[-0.09752318 -0.32858743 0.34599443 0.25682671]\n",
|
||||
" [ 1.06445511 0.09442168 0.45718919 0.39124069]\n",
|
||||
" [-1.25950146 0.30592623 -1.09953753 -1.22172707]\n",
|
||||
" [ 0.83205945 -0.54009199 0.56838396 0.52565467]\n",
|
||||
" [ 0.36726814 -0.9631011 1.12435779 0.39124069]\n",
|
||||
" [ 0.59966379 0.51743079 1.34674732 1.86979447]\n",
|
||||
" [-1.14330363 0.72893534 -0.93274538 -1.22172707]\n",
|
||||
" [-0.79471015 0.9404399 -1.2107323 -1.08731309]\n",
|
||||
" [ 0.71586162 0.09442168 1.06876041 0.92889661]\n",
|
||||
" [ 1.29685076 0.30592623 1.17995517 1.60096651]\n",
|
||||
" [ 1.18065293 0.30592623 1.29114994 1.60096651]\n",
|
||||
" [-1.60809495 -0.11708288 -1.26632968 -1.22172707]\n",
|
||||
" [ 0.59966379 0.72893534 1.12435779 1.73538049]\n",
|
||||
" [ 0.36726814 -0.32858743 0.62398134 0.39124069]\n",
|
||||
" [ 1.29685076 0.09442168 0.84637087 1.60096651]\n",
|
||||
" [ 0.71586162 -0.32858743 0.40159181 0.25682671]\n",
|
||||
" [ 0.25107031 -0.11708288 0.67957873 0.92889661]\n",
|
||||
" [-0.67851232 0.9404399 -1.15513491 -1.22172707]\n",
|
||||
" [ 0.59966379 0.51743079 0.62398134 0.66006865]\n",
|
||||
" [ 1.64544425 0.30592623 1.34674732 0.92889661]\n",
|
||||
" [-1.0271058 0.09442168 -1.15513491 -1.35614105]\n",
|
||||
" [-0.09752318 1.57495356 -1.04394015 -1.08731309]\n",
|
||||
" [ 0.83205945 -0.11708288 1.23555256 1.46655253]\n",
|
||||
" [-1.14330363 -0.11708288 -1.2107323 -1.35614105]\n",
|
||||
" [ 0.25107031 -1.80911932 0.79077349 0.52565467]\n",
|
||||
" [ 0.48346596 -0.54009199 0.67957873 0.92889661]\n",
|
||||
" [ 0.36726814 -0.11708288 0.56838396 0.39124069]\n",
|
||||
" [ 0.36726814 -0.54009199 0.23479966 0.25682671]\n",
|
||||
" [ 0.83205945 -0.11708288 0.90196826 1.19772457]\n",
|
||||
" [ 0.13487248 -0.11708288 0.84637087 0.92889661]\n",
|
||||
" [-0.21372101 -0.75159654 0.34599443 0.25682671]\n",
|
||||
" [ 1.06445511 0.09442168 0.62398134 0.52565467]\n",
|
||||
" [-0.21372101 -0.54009199 0.73517611 1.19772457]\n",
|
||||
" [ 0.71586162 0.30592623 0.95756564 1.60096651]\n",
|
||||
" [ 1.06445511 0.09442168 1.12435779 1.73538049]\n",
|
||||
" [ 1.06445511 -0.11708288 0.90196826 1.60096651]\n",
|
||||
" [ 0.01867465 -0.75159654 0.84637087 1.06331059]\n",
|
||||
" [-0.09752318 -0.11708288 0.34599443 0.12241273]]\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# As we see, the iris dataset is now normalized\n",
|
||||
"print(x_test)"
|
||||
@@ -261,7 +163,6 @@
|
||||
"source": [
|
||||
"* [Feature selection](http://scikit-learn.org/stable/modules/feature_selection.html)\n",
|
||||
"* [Classification probability](http://scikit-learn.org/stable/auto_examples/classification/plot_classification_probability.html)\n",
|
||||
"* [Mastering Pandas](http://proquest.safaribooksonline.com/book/programming/python/9781783981960), Femi Anthony, Packt Publishing, 2015.\n",
|
||||
"* [Matplotlib web page](http://matplotlib.org/index.html)\n",
|
||||
"* [Using matlibplot in IPython](http://ipython.readthedocs.org/en/stable/interactive/plotting.html)\n",
|
||||
"* [Seaborn Tutorial](https://stanford.edu/~mwaskom/software/seaborn/tutorial.html)"
|
||||
@@ -272,9 +173,9 @@
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"### Licences\n",
|
||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"The notebook is freely licensed under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"\n",
|
||||
"© 2016 Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
"© Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
]
|
||||
}
|
||||
],
|
||||
@@ -294,7 +195,24 @@
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.6.3"
|
||||
"version": "3.5.6"
|
||||
},
|
||||
"latex_envs": {
|
||||
"LaTeX_envs_menu_present": true,
|
||||
"autocomplete": true,
|
||||
"bibliofile": "biblio.bib",
|
||||
"cite_by": "apalike",
|
||||
"current_citInitial": 1,
|
||||
"eqLabelWithNumbers": true,
|
||||
"eqNumInitial": 1,
|
||||
"hotkeys": {
|
||||
"equation": "Ctrl-E",
|
||||
"itemize": "Ctrl-I"
|
||||
},
|
||||
"labels_anchors": false,
|
||||
"latex_user_defs": false,
|
||||
"report_style_numbering": false,
|
||||
"user_envs_cfg": false
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
|
@@ -4,7 +4,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
""
|
||||
""
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -18,7 +18,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © 2016 Carlos A. Iglesias"
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © Carlos A. Iglesias"
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -53,9 +53,9 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"This is an introduction of general ideas about machine learning and the interface of scikit-learn, taken from the [scikit-learn tutorial](http://www.astroml.org/sklearn_tutorial/general_concepts.html). \n",
|
||||
"This is an introduction to general ideas about machine learning and the interface of scikit-learn, taken from the [scikit-learn tutorial](http://www.astroml.org/sklearn_tutorial/general_concepts.html). \n",
|
||||
"\n",
|
||||
"You can skip it during the lab session and read it later,"
|
||||
"You can skip it during the lab session and read it later."
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -69,20 +69,20 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Machine learning algorithms are programs that learn a model from a dataset with the aim of making predictions or learning structures to organize the data.\n",
|
||||
"Machine learning algorithms are programs that learn a model from a dataset to make predictions or learn structures to organize the data.\n",
|
||||
"\n",
|
||||
"In scikit-learn, machine learning algorithms take as an input a *numpy* array (n_samples, n_features), where\n",
|
||||
"* **n_samples**: number of samples. Each sample is an item to process (i.e. classify). A sample can be a document, a picture, a sound, a video, a row in database or CSV file, or whatever you can describe with a fixed set of quantitative traits.\n",
|
||||
"* **n_features**: The number of features or distinct traits that can be used to describe each item in a quantitative manner.\n",
|
||||
"In scikit-learn, machine learning algorithms take as input a *numpy* array (n_samples, n_features), where\n",
|
||||
"* **n_samples**: number of samples. Each sample is an item to process (i.e., classify). A sample can be a document, a picture, a sound, a video, a row in a database or CSV file, or whatever you can describe with a fixed set of quantitative traits.\n",
|
||||
"* **n_features**: The number of features or distinct traits that can be used to describe each item quantitatively.\n",
|
||||
"\n",
|
||||
"The number of features should be defined in advance. There is a specific type of feature sets that are high dimensional (e.g. millions of features), but most of the values are zero for a given sample. Using (numpy) arrays, all those values that are zero would also take up memory. For this reason, these feature sets are often represented with sparse matrices (scipy.sparse) instead of (numpy) arrays.\n",
|
||||
"The number of features should be defined in advance. A specific type of feature set is high-dimensional (e.g., millions of features), but most values are zero for a given sample. Using (numpy) arrays, all those zero values would also take up memory. For this reason, these feature sets are often represented with sparse matrices (scipy.sparse) instead of (numpy) arrays.\n",
|
||||
"\n",
|
||||
"The first step in machine learning is **identifying the relevant features** from the input data, and the second step is **extracting the features** from the input data. \n",
|
||||
"\n",
|
||||
"[Machine learning algorithms](http://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/) can be classified according to learning style into:\n",
|
||||
"* **Supervised learning**: input data (training dataset) has a known label or result. Example problems are classification and regression. A model is prepared through a training process where it is required to make predictions and is corrected when those predictions are wrong. The training process continues until the model achieves a desired level of accuracy on the training data.\n",
|
||||
"* **Unsupervised learning**: input data is not labeled. A model is prepared by deducing structures present in the input data. This may be to extract general rules. Example problems are clustering, dimensionality reduction and association rule learning.\n",
|
||||
"* **Semi-supervised learning**:i nput data is a mixture of labeled and unlabeled examples. There is a desired prediction problem but the model must learn the structures to organize the data as well as make predictions. Example problems are classification and regression."
|
||||
"* **Unsupervised learning**: input data is not labeled. A model is prepared by deducing structures present in the input data. This may be to extract general rules. Example problems are clustering, dimensionality reduction, and association rule learning.\n",
|
||||
"* **Semi-supervised learning**: input data is a mixture of labeled and unlabeled examples. There is a desired prediction problem, but the model must learn the structures to organize the data and make predictions. Example problems are classification and regression."
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -96,8 +96,8 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"In *supervised machine learning models*, the machine learning algorithm takes as an input a training dataset, composed of feature vectors and labels, and produces a predictive model which is used for make prediction on new data.\n",
|
||||
""
|
||||
"In *supervised machine learning models*, the machine learning algorithm takes as input a training dataset, composed of feature vectors and labels, and produces a predictive model used to predict new data.\n",
|
||||
""
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -111,7 +111,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"In *unsupervised machine learning models*, the machine learning model algorithm takes as an input the feature vectors and produces a predictive model that is used to fit its parameters so as to best summarize regularities found in the data.\n",
|
||||
"In *unsupervised machine learning models*, the machine learning model algorithm takes as input the feature vectors. It produces a predictive model that is used to fit its parameters to summarize the best regularities found in the data.\n",
|
||||
""
|
||||
]
|
||||
},
|
||||
@@ -129,15 +129,15 @@
|
||||
"scikit-learn has a uniform interface for all the estimators, some methods are only available if the estimator is supervised or unsupervised:\n",
|
||||
"\n",
|
||||
"* Available in *all estimators*:\n",
|
||||
" * **model.fit()**: fit training data. For supervised learning applications, this accepts two arguments: the data X and the labels y (e.g. model.fit(X, y)). For unsupervised learning applications, this accepts only a single argument, the data X (e.g. model.fit(X)).\n",
|
||||
" * **model.fit()**: fit training data. For supervised learning applications, this accepts two arguments: the data X and the labels y (e.g., model.fit(X, y)). For unsupervised learning applications, this accepts only a single argument, the data X (e.g. model.fit(X)).\n",
|
||||
"\n",
|
||||
"* Available in *supervised estimators*:\n",
|
||||
" * **model.predict()**: given a trained model, predict the label of a new set of data. This method accepts one argument, the new data X_new (e.g. model.predict(X_new)), and returns the learned label for each object in the array.\n",
|
||||
" * **model.predict()**: given a trained model, predict the label of a new dataset. This method accepts one argument, the new data X_new (e.g., model.predict(X_new)), and returns the learned label for each object in the array.\n",
|
||||
" * **model.predict_proba()**: For classification problems, some estimators also provide this method, which returns the probability that a new observation has each categorical label. In this case, the label with the highest probability is returned by model.predict().\n",
|
||||
"\n",
|
||||
"* Available in *unsupervised estimators*:\n",
|
||||
" * **model.transform()**: given an unsupervised model, transform new data into the new basis. This also accepts one argument X_new, and returns the new representation of the data based on the unsupervised model.\n",
|
||||
" * **model.fit_transform()**: some estimators implement this method, which performs a fit and a transform on the same input data.\n",
|
||||
" * **model.fit_transform()**: Some estimators implement this method, which performs a fit and a transform on the same input data.\n",
|
||||
"\n",
|
||||
"\n",
|
||||
""
|
||||
@@ -145,9 +145,7 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"collapsed": false
|
||||
},
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## References"
|
||||
]
|
||||
@@ -156,7 +154,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"* [General concepts of machine learning with scikit-learn](http://www.astroml.org/sklearn_tutorial/general_concepts.html)\n",
|
||||
"* [Python Data Science Handbook](https://jakevdp.github.io/PythonDataScienceHandbook/index.html)\n",
|
||||
"* [A Tour of Machine Learning Algorithms](http://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/)"
|
||||
]
|
||||
},
|
||||
@@ -171,15 +169,15 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"The notebook is freely licensed under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"\n",
|
||||
"© 2016 Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
"© Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
]
|
||||
}
|
||||
],
|
||||
"metadata": {
|
||||
"kernelspec": {
|
||||
"display_name": "Python 3",
|
||||
"display_name": "Python 3 (ipykernel)",
|
||||
"language": "python",
|
||||
"name": "python3"
|
||||
},
|
||||
@@ -193,9 +191,26 @@
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.5.1+"
|
||||
"version": "3.8.12"
|
||||
},
|
||||
"latex_envs": {
|
||||
"LaTeX_envs_menu_present": true,
|
||||
"autocomplete": true,
|
||||
"bibliofile": "biblio.bib",
|
||||
"cite_by": "apalike",
|
||||
"current_citInitial": 1,
|
||||
"eqLabelWithNumbers": true,
|
||||
"eqNumInitial": 1,
|
||||
"hotkeys": {
|
||||
"equation": "Ctrl-E",
|
||||
"itemize": "Ctrl-I"
|
||||
},
|
||||
"labels_anchors": false,
|
||||
"latex_user_defs": false,
|
||||
"report_style_numbering": false,
|
||||
"user_envs_cfg": false
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
"nbformat_minor": 0
|
||||
"nbformat_minor": 1
|
||||
}
|
||||
|
@@ -4,7 +4,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
""
|
||||
""
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -18,7 +18,7 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © 2016 Carlos A. Iglesias"
|
||||
"Department of Telematic Engineering Systems, Universidad Politécnica de Madrid, © Carlos A. Iglesias"
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -48,29 +48,16 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"The goal of this notebook is to learn how to save a model in the the scikit by using Python’s built-in persistence model, namely pickle\n",
|
||||
"The goal of this notebook is to learn how to save a model in the scikit by using Python’s built-in persistence model, namely pickle\n",
|
||||
"\n",
|
||||
"First we recap the previous tasks: load data, preprocess and train the model."
|
||||
"First, we recap the previous tasks: load data, preprocess, and train the model."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 1,
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [
|
||||
{
|
||||
"data": {
|
||||
"text/plain": [
|
||||
"Pipeline(steps=[('scaler', StandardScaler(copy=True, with_mean=True, with_std=True)), ('KNN', KNeighborsClassifier(algorithm='auto', leaf_size=30, metric='minkowski',\n",
|
||||
" metric_params=None, n_jobs=1, n_neighbors=5, p=2,\n",
|
||||
" weights='uniform'))])"
|
||||
]
|
||||
},
|
||||
"execution_count": 1,
|
||||
"metadata": {},
|
||||
"output_type": "execute_result"
|
||||
}
|
||||
],
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# load iris\n",
|
||||
"from sklearn import datasets\n",
|
||||
@@ -106,20 +93,9 @@
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 2,
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [
|
||||
{
|
||||
"data": {
|
||||
"text/plain": [
|
||||
"array([0])"
|
||||
]
|
||||
},
|
||||
"execution_count": 2,
|
||||
"metadata": {},
|
||||
"output_type": "execute_result"
|
||||
}
|
||||
],
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"import pickle\n",
|
||||
"s = pickle.dumps(model)\n",
|
||||
@@ -131,19 +107,17 @@
|
||||
"cell_type": "markdown",
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"A more efficient alternative to pickle is joblib, especially for big data problems. In this case the model can only be saved to a file and not to a string."
|
||||
"A more efficient alternative to pickle is joblib, especially for big data problems. In this case, the model can only be saved to a file and not to a string."
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": 3,
|
||||
"metadata": {
|
||||
"collapsed": true
|
||||
},
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# save model\n",
|
||||
"from sklearn.externals import joblib\n",
|
||||
"import joblib\n",
|
||||
"joblib.dump(model, 'filename.pkl') \n",
|
||||
"\n",
|
||||
"#load model\n",
|
||||
@@ -162,7 +136,9 @@
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"* [Tutorial scikit-learn](http://scikit-learn.org/stable/tutorial/basic/tutorial.html)\n",
|
||||
"* [Model persistence in scikit-learn](http://scikit-learn.org/stable/modules/model_persistence.html#model-persistence)"
|
||||
"* [Model persistence in scikit-learn](http://scikit-learn.org/stable/modules/model_persistence.html#model-persistence)\n",
|
||||
"* [scikit-learn : Machine Learning Simplified](https://learning.oreilly.com/library/view/scikit-learn-machine/9781788833479/), Raúl Garreta; Guillermo Moncecchi, Packt Publishing, 2017.\n",
|
||||
"* [Python Machine Learning](https://learning.oreilly.com/library/view/python-machine-learning/9781789955750/), Sebastian Raschka, Packt Publishing, 2019."
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -170,15 +146,24 @@
|
||||
"metadata": {},
|
||||
"source": [
|
||||
"## Licence\n",
|
||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"The notebook is freely licensed under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||
"\n",
|
||||
"© 2016 Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
"© Carlos A. Iglesias, Universidad Politécnica de Madrid."
|
||||
]
|
||||
}
|
||||
],
|
||||
"metadata": {
|
||||
"datacleaner": {
|
||||
"position": {
|
||||
"top": "50px"
|
||||
},
|
||||
"python": {
|
||||
"varRefreshCmd": "try:\n print(_datacleaner.dataframe_metadata())\nexcept:\n print([])"
|
||||
},
|
||||
"window_display": false
|
||||
},
|
||||
"kernelspec": {
|
||||
"display_name": "Python 3",
|
||||
"display_name": "Python 3 (ipykernel)",
|
||||
"language": "python",
|
||||
"name": "python3"
|
||||
},
|
||||
@@ -192,7 +177,24 @@
|
||||
"name": "python",
|
||||
"nbconvert_exporter": "python",
|
||||
"pygments_lexer": "ipython3",
|
||||
"version": "3.6.3"
|
||||
"version": "3.8.12"
|
||||
},
|
||||
"latex_envs": {
|
||||
"LaTeX_envs_menu_present": true,
|
||||
"autocomplete": true,
|
||||
"bibliofile": "biblio.bib",
|
||||
"cite_by": "apalike",
|
||||
"current_citInitial": 1,
|
||||
"eqLabelWithNumbers": true,
|
||||
"eqNumInitial": 1,
|
||||
"hotkeys": {
|
||||
"equation": "Ctrl-E",
|
||||
"itemize": "Ctrl-I"
|
||||
},
|
||||
"labels_anchors": false,
|
||||
"latex_user_defs": false,
|
||||
"report_style_numbering": false,
|
||||
"user_envs_cfg": false
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
|
BIN
ml1/filename.pkl
Normal file
BIN
ml1/images/iris-classes.png
Normal file
After Width: | Height: | Size: 1.4 MiB |
BIN
ml1/images/iris-features.png
Normal file
After Width: | Height: | Size: 944 KiB |
Before Width: | Height: | Size: 21 KiB After Width: | Height: | Size: 106 KiB |
@@ -1,7 +1,7 @@
|
||||
import numpy as np
|
||||
import matplotlib.pyplot as plt
|
||||
from matplotlib.colors import ListedColormap
|
||||
from sklearn.cross_validation import train_test_split
|
||||
from sklearn.model_selection import train_test_split
|
||||
from sklearn.preprocessing import StandardScaler
|
||||
from sklearn.datasets import make_moons, make_circles, make_classification
|
||||
from sklearn.neighbors import KNeighborsClassifier
|
||||
|