# Consuming Senpy services

This short tutorial will teach you how to consume services in several ways, taking advantage of the features of the framework.

In particular, we will cover:

* Annotating text with sentiment
* Annotating text with emotion
* Getting results in different formats (Turtle, XML, text...)
* Asking for specific emotion models (automatic model conversion)
* Listing available services in an endpoint
* Switching to different services
* Calling multiple services in the same request (Pipelines)

The latest version of this IPython notebook is available at: https://github.com/gsi-upm/senpy/tree/master/docs/Quickstart.ipynb

## Requirements

For the sake of simplicity, this tutorial will use the demo server: http://senpy.gsi.upm.es:

In [None]:
endpoint = 'http://senpy.gsi.upm.es/api'

This server runs some open source plugins for sentiment and emotion analysis.

The HTTP API of Senpy can be queried with your favourite tool.
This is just an example using curl:

```bash
curl "http://senpy.gsi.upm.es/api/sentiment140" --data-urlencode "input=Senpy is awesome"
```

For simplicity, in this tutorial we will use the requests library. We will also add a function to add syntax highlighting for the JSON-LD/Turtle results:

In [None]:
try:
    from IPython.display import Code
    def pretty(txt, language='json-ld'):
        return Code(txt, language=language)
except ImportError:
    def pretty(txt, **kwargs):
        print(txt)

Once you're familiar with Senpy, you can deploy your own instance quite easily. e.g. using docker:

```
docker run -ti --name 'SenpyEndpoint' -d -p 5000:5000 gsiupm/senpy
```

Then, feel free to change the endpoint variable to run the examples in your own instance.

## Sentiment Analysis of Text

To start, let us analyse the sentiment in the following sentence: *senpy is a wonderful service*.

For now, we will use the [sentiment140](http://www.sentiment140.com/) service, through the sentiment140 plugin.
We will later cover how to use a different service.


In [None]:
import requests
res = requests.get(f'{endpoint}/sentiment140',
                   params={"input": "Senpy is awesome",})
pretty(res.text)

Senpy services always return an object of type `senpy:Results`, with a list of entries.
You can think of an entry as a self-contained textual context (`nif:Context` and `senpy:Entry`).
Entries can be as short as a sentence, or as long as a news article.

Each entry has a `nif:isString` property that contains the original text of the entry, and several other properties that are provided by the plugins.

For instance, sentiment annotations are provided through `marl:hasOpinion`.

The annotations are semantic.
We can ask Senpy for the expanded JSON-LD output to reveal the full URIs of each property and entity:

In [None]:
import requests
res = requests.get(f'{endpoint}/sentiment140',
                   params={"input": "Senpy is awesome",
                           "expanded": True})
pretty(res.text)

In [None]:
pretty(res.text)

## Other output formats

Senpy supports several semantic formats, like turtle and xml-RDF.
You can select the format of the output with the `outformat` parameter:

In [None]:
res = requests.get(f'{endpoint}/sentiment140',
                   params={"input": "Senpy is the best framework for semantic sentiment analysis, and very easy to use",
                            "outformat": "turtle"})
pretty(res.text, language='turtle')

## Selecting fields from the output

The full output in the previous sections is very useful because it is semantically annotated.
However, it is also quite verbose if we only want to label a piece of text, or get a polarity value.

For such simple cases, the API has a special `fields` method you can use to get a specific field from the results, and even transform the results. Senpy uses jmespath under the hood, which has its own notation.

To illustrate this, let us get only the text (`nif:isString`) from each entry:

In [None]:
res = requests.get(f'{endpoint}/sentiment140',
                   params={"input": "Senpy is a wonderful service",
                            "fields": 'entries[]."nif:isString"'})
print(res.text)

["Senpy is a wonderful service"]


Or we could get both the text and the polarity of the text (assuming there is only one opinion per entry) with a slightly more complicated query:

In [None]:
res = requests.get(f'{endpoint}/sentiment140',
                   params={"input": "Senpy is a service. Wonderful service.",
                           "delimiter": "sentence",
                           "fields": 'entries[0].["nif:isString", "marl:hasOpinion"[0]."marl:hasPolarity"]'})
print(res.text)

["Senpy is a service. Wonderful service.", "marl:Neutral"]


jmespath is rather extensive for this tutorial. We will cover only the most simple cases, so you do not need to learn much about the notation.

For more complicated transformations, check out [jmespath](http://jmespath.org).
In addition to a fairly complete documentation, they have a live environment you can use to test your queries.

## Emotion analysis


Senpy uses the `onyx` vocabulary to represent emotions, which incorporates the notion of `EmotionSet`'s, an emotion that is composed of several emotions.
In a nutshell, an `Entry` is linked to one or more `EmotionSet`, which in turn is made up of one or more `Emotion`.

Let's illustrate it with an example, using the `emotion-depechemood` plugin.

In [None]:
res = requests.get(f'{endpoint}/emotion-depechemood',
                   params={"input": "Senpy is a wonderful that service"})
pretty(res.text)

As you have probably noticed, there are several emotions in this result, each with a different intensity.

We can also tell senpy to only return the emotion with the maximum intensity using the `maxemotion` parameter:

In [None]:
res = requests.get(f'{endpoint}/emotion-depechemood',
                   params={"input": "Senpy is a wonderful service",
                           "maxemotion": True})
pretty(res.text)

We can combine this feature with the `fields` parameter to get only the label and the intensity:

In [None]:
res = requests.get(f'{endpoint}/emotion-depechemood',
                   params={"input": "Senpy is a wonderful service",
                           "fields": 'entries[]."onyx:hasEmotionSet"[]."onyx:hasEmotion"[]["onyx:hasEmotionCategory","onyx:hasEmotionIntensity"]',
                           "maxemotion": True})
pretty(res.text)

## Emotion conversion

If the model used by a plugin is not right for your application, you can ask for a specific emotion model in your request.

Senpy ships with emotion conversion capabilities, and it will try to automatically convert the results.

For example, the `emotion-anew` plugin uses the dimensional `pad` (or VAD, valence-arousal-dominance) model, as we can see here:

In [None]:
res = requests.get(f'{endpoint}/emotion-anew',
                   params={"input": "Senpy is a wonderful service and I love it"})
print(res.text)

{
  "@context": "http://senpy.gsi.upm.es/api/contexts/YXBpL2Vtb3Rpb24tYW5ldz9pbnB1dD1TZW5weStpcythK3dvbmRlcmZ1bCtzZXJ2aWNlK2FuZCtJK2xvdmUraXQj",
  "@type": "Results",
  "entries": [
    {
      "@id": "prefix:",
      "@type": "Entry",
      "marl:hasOpinion": [],
      "nif:isString": "Senpy is a wonderful service and I love it",
      "onyx:hasEmotionSet": [
        {
          "@id": "Emotions0",
          "@type": "EmotionSet",
          "onyx:hasEmotion": [
            {
              "@id": "Emotion0",
              "@type": "Emotion",
              "http://www.gsi.dit.upm.es/ontologies/onyx/vocabularies/anew/ns#arousal": 6.44,
              "http://www.gsi.dit.upm.es/ontologies/onyx/vocabularies/anew/ns#dominance": 7.11,
              "http://www.gsi.dit.upm.es/ontologies/onyx/vocabularies/anew/ns#valence": 8.72,
              "prov:wasGeneratedBy": "prefix:Analysis_1554364675.1427004"
            }
          ],
          "prov:wasGeneratedBy": "prefix:Analysis_1554364675.142700

If we need a category level, we can ask for the equivalent results in the `big6` model:

In [None]:
res = requests.get(f'{endpoint}/emotion-anew',
                   params={"input": "Senpy is a wonderful service and I love it",
                           "emotion-model": "emoml:big6"})
pretty(res.text)

Because we don't usually care about the original emotion, the conversion can be presented in three ways:

* full: the original and converted emotions are included at the same level
* filtered: the original emotion is replaced by the converted emotion
* nested: the original emotion is replaced, but the converted emotion points to it

For example, here's how the `nested` structure would look like:

In [None]:
res = requests.get(f'{endpoint}/emotion-anew',
                   params={"input": "Senpy is a wonderful service and I love it",
                           "emotion-model": "emoml:big6",
                          "conversion": "nested"})
pretty(res.text)

Again, for completion, we could get only the label with the `fields` parameter:

In [None]:
res = requests.get(f'{endpoint}/emotion-anew',
                   params={"input": "Senpy is a wonderful service and I love it",
                           "emotion-model": "emoml:big6",
                           "fields": 'entries[].[["nif:isString","onyx:hasEmotionSet"[]."onyx:hasEmotion"[]."onyx:hasEmotionCategory"][]][]',
                           "conversion": "filtered"})
pretty(res.text)

## Built-in client

The built-in senpy client allows you to query any Senpy endpoint. We will illustrate how to use it with the public demo endpoint, and then show you how to spin up your own endpoint using docker.

## Building pipelines

You can query several senpy services in the same request.
This feature is called pipelining, and the result of combining several plugins in a request is called a pipeline.

The simplest way to use pipelines is to add every plugin you want to use to the URL, separated by either a slash or a comma.

For instance, to get sentiment (`sentiment140`) and emotion (`depechemood`) annotations at the same time:

In [None]:
res = requests.get(f'{endpoint}/sentiment140/emotion-depechemood',
                   params={"input": "Senpy is a wonderful service"})
pretty(res.text)

In a senpy pipeline, the call is processed by each plugin in sequence.
The output of a plugin is used as input for the next one.

Pipelines take the same parameters as the plugins they are made of.
For example, if we want to split the original sentence before analysing its sentiment, we can use a pipeline made out of the `split` and the `sentiment140` plugins.

`split` takes an extra parameter (`delimiter`) to select the type of splitting (by sentence or by paragraph), and `sentiment140` takes a `language` parameter.

This is how the request looks like:

In [None]:
res = requests.get(f'{endpoint}/split/sentiment140',
                 params={"input": "Senpy is awesome. And services are composable.", 
                         "delimiter": "sentence",
                         "language": "en",
                         "outformat": "json-ld"})
pretty(res.text)

As you can see, `split` creates two new entries, which are also annotated by `sentiment140`.

Once again, we could use the `fields` parameter to get a list of strings and labels:

In [None]:
res = requests.get(f'{endpoint}/split/sentiment140',
                 params={"input": "Senpy is awesome. And services are composable.", 
                         "delimiter": "sentence",
                         "fields": 'entries[].[["nif:isString","marl:hasOpinion"[]."marl:hasPolarity"][]][]',
                         "language": "en",
                         "outformat": "json-ld"})
pretty(res.text)

## Evaluation

Sentiment analysis plugins can also be evaluated on a series of pre-defined datasets, using the `gsitk` tool.

For instance, to evaluate the `sentiment-vader` plugin on the `vader` and `sts` datasets, we would simply call:

In [None]:
res = requests.get(f'{endpoint}/evaluate',
                   params={"algo": "sentiment-vader",
                           "dataset": "vader,sts",
                           'outformat': 'json-ld'
                          })
pretty(res.text)

The same results can be visualized as a table in the Web interface:

![](evaluation-results.png)

**note**: to evaluate a plugin on a dataset, senpy will need to predict the labels of the entries using the plugin.
This process might take long for plugins that use an external service, such as `sentiment140`.

## Advanced topics

### Verbose output

By default, senpy does not include information that might be too verbose, such as the parameters that were used in the analysis.

You can instruct senpy to provide a more verbose output with the `verbose` parameter:

In [None]:
import requests
res = requests.get(f'{endpoint}/sentiment140',
                   params={
                       "input": "Senpy is the best framework for semantic sentiment analysis, and very easy to use",
                       "verbose": True}).text
pretty(res)

### Getting help

In [None]:
import requests
res = requests.get(f'{endpoint}/',
                   params={
                       "help": True}).text
pretty(res)

### Ignoring the context

In [None]:
import requests
res = requests.get(f'{endpoint}/',
                   params={
                       "input": "This will tell senpy to only include the context in the headers",
                       "inheaders": True})
pretty(res.text)

To retrieve the context URI, use the `LINK` header:

In [None]:
print(res.headers['Link'])

<http://senpy.gsi.upm.es/api/contexts/YXBpLz9pbnB1dD1UaGlzK3dpbGwrdGVsbCtzZW5weSt0bytvbmx5K2luY2x1ZGUrdGhlK2NvbnRleHQraW4rdGhlK2hlYWRlcnMmaW5oZWFkZXJzPVRydWUj>;rel="http://www.w3.org/ns/json-ld#context"; type="application/ld+json"
