This section talks about the categories of a model, the elements that define your taxonomy, and the one with which you will work more often.
Categories are the elements that determine how a text is going to be classified once the model is completely defined, and what you should modify and tune when any mistake is detected during the quality assessment process.
When you access a model, you will be able to add new categories by introducing a code and a label. The category created will be empty, except for the two fields defined. This is what you will see when you add a category with "001" as code and "My first category" as label:
If one of the values specified is not correct, you will see an error message specifying why the category could not be created. There are three types of errors:
For each category created you will see a small summary of its contents: the number of each type of terms in it, and the number of words in the training text. The status included in the last column will indicate if the category was correctly defined the last time the model was built.
In the first column of the table there are two actions associated to the category:
Every category created will be shown in this table, where you will be able to select how many categories to show in each page, order them by code or by label and filter them dynamically by text appearance.
This view also shows different statistics of the model's categories, like the number of each type of terms or the number of words in the training text.
To modify any field of a category, you will have to access its editing view. From there, you can modify its basic attributes, define rules for classification and add training texts for the statistical classification.
The eight fields available will enable you to completely define every category of a model. The fields are divided in three sections:
The basic fields of a category are Code and Label, and they are the only mandatory ones when creating a category.
There's a third field, Description, which allows you to add free text to describe the criteria defined for the category definition. It's limited to 1024 characters.
The fields related to rules definition are explained in the term rules section. The fields related to models with a statistical component are detailed in the combine training text and rules section.