This section talks about the entries of a dictionary.
Entries are the elements that define each one of the lexical forms that are going to constitute a dictionary and complement the basic resources used by MeaningCloud's APIs. With a new dictionary you can add semantic information to entities/concepts already detected by an API or complete the resources to improve the recall of the application in specific domains.
When you access a dictionary, you will be able to add new entries just by introducing their form
, which is the only required field for the element. There are two other fields that can be specified: entry type
and ontology type
.
The first, entry type
defines if the entry is going to be an entity or a concept. Entries with no entry type
will be considered entities by default.
The difference between these two elements is directly related to the interpretation we are going to do of the data and how it relates to our ontology. In general, you could say that entities correspond to named entities (proper nouns, etc.) while concepts are keywords of the domain. With relation to the ontology, entites are instances of a type defined in it, while concepts are subclasses of a type of the ontology.
If we use the banking domain as an example, we would define "savings account" and "current account" as concepts of the domain, while specific names of products provided by different banks would be considered entities, for instance "Cuenta Nómina", "ISA" or "Cash Card Account".
Another difference between entities and concepts is how they are detected by the APIs, more specifically, the basic variants used to detect their appearances in the text. The variants associated to concepts have more to do with morphology while entities are related to common heuristics associated to the most common types of named entities.
The second field, ontology type
, selects the node in the ontology the entry will belong to. By default, every entry is inside Top, the parent node of the ontology. As can be seen in the image, there are several default values that can be chosen. Those values are the most common ones from our ontology.
There's also the possibility ofwriting your own ontology node through the 'Write your own value' option inside My Types. Every node created by the user will be added as a child of the node Top. Hierarchy is represented by the character >
. For instance, to define a node Characters inside the node Person, you would have to create an entry as follows:
Once you have created an entry with your own ontology_type
value, these values will be listed in the menu so you can select them when you create more entries in the dictionary. This way you will avoid having to write the new ontology_type
value every time you add a new entry.
These two fields are a way to edit the semantic information associated to the entry. The information saved in the dictionary is the semantic information; these fields provide shortcuts to edit easily the most used fields in it.
This is what you will see when you add a new entry:
If one of the values specified is not correct, you will see an error message specifying why the entry cannot be created. There are two types of errors:
form
which cannot be empty and is limited to 255 characters.
For each entry created you will see its basic information: its form, entry type, last node of the ontology type and when it was last edited.
In the first column of the table there are two actions associated to the entry:
Every entry created will be shown in this table, where you will be able to select how many entries to show in each page, order them by any of the columns and filter them dynamically by text appearance. This dynamic filtering will also include the aliases
field, even though it is not shown in the summary.
To modify any field of an entry, you will have to access its editing view. From there, you can modify all of its information.
Both the sidebar and the main panel feature a question mark, , which shows a small tour that explains briefly each field.
The fields are divided in two sections:
The basic fields of an entry are ID
, Form
, Aliases
and Sense ID
, and only Form
is mandatory when creating an entry.
ID
value.
There's an additional option associated to this field that allows you to choose to use only the senses specified in the user dictionary.
If an entry has a sense in the user dictionary and several other senses in the basic resources provided, if the check is selected those senses from the basic resources will be ignored, and only the ones from user dictionary will be used.
This option is coded in the dictionary by adding a dash to the sense ID. This dash will not be show in the editing interface, but it will appear in the result of exporting a dictionary.
At the bottom of the Basic information there's a section with some advanced settings that can be added to the entry. This section will appear collapsed if the default values are used.
There are three fields available in this section: