CER / WER metrics

The atr-ner-eval cer command can be used to compute the Character and Word Error Rates of your automatic workflow globally, or for each semantic category.

Metric description

The Character Error Rate and Word Error Rate are well known metrics for Automatic Text Recognition (ATR) and Automatic Speech Recognition (ASR), based on the Levenshtein edit distance.

The Character Error Rate is the percentage of characters that have been transcribed incorrectly by the automatic workflow. A 5% CER means that the 95 characters out of 100 are correctly transcribed, while 5 are misread (insertion, deletion, substitution).
The Word Error Rate is the percentage of words that have been transcribed incorrectly by the automatic workflow. A 5% WER means that the 95 words out of 100 are correctly transcribed, while 5 are misread (insertion, deletion, substitution).

Parameters

Here are the available parameters for this metric:

Parameter	Description	Type	Default
`--label-dir`	Path to the directory containing BIO label files.	`pathlib.Path`
`--prediction-dir`	Path to the directory containing BIO prediction files.	`pathlib.Path`
`--by-category`	Whether to display CER and WER for each category.	`bool`	`False`

The parameters are also described when running atr-ner-eval cer --help.

Examples

Global evaluation

Use the following command to compute the overall CER and WER:

atr-ner-eval cer --label-dir Simara/labels/ \
                 --prediction-dir Simara/predictions/

It will output the results in Markdown format:

2023-12-13 16:35:05,598 INFO/atr_ner_eval.utils: The dataset is complete.
2023-12-13 16:35:05,828 INFO/atr_ner_eval.cer_wer: CER / WER evaluation table:

| Category | CER (%) | WER (%) | Support |
|:---------|:-------:|:-------:|--------:|
| total    |   6.37  |  17.26  |     804 |

Evaluation for each category

Use the following command to compute CER and WER for each semantic category:

atr-ner-eval cer --label-dir Simara/labels/ \
                 --prediction-dir Simara/predictions/ \
                 --by-category

It will output the results in Markdown format:

2023-12-13 16:35:05,598 INFO/atr_ner_eval.utils: The dataset is complete.
2023-12-13 16:35:05,828 INFO/atr_ner_eval.cer_wer: CER / WER evaluation table:

| Category        | CER (%) | WER (%) | Support |
|:----------------|:-------:|:-------:|--------:|
| analysis        |   9.42  |  23.36  |     780 |
| arrangement     |   8.32  |  15.82  |      79 |
| article_number  |   3.14  |   4.2   |     676 |
| date            |   2.18  |   3.58  |     757 |
| series          |   3.07  |   3.22  |     676 |
| title           |   7.67  |  21.04  |     804 |
| volume_number   |  12.38  |   14.1  |     675 |
| total           |   6.37  |  17.26  |     804 |