Analyzing Grammatical Error Correction

Before diving into the detail of this doc, you're strongly recommended to know some important concepts about system analyses.

Grammatical error correction is the task of correcting different kinds of errors in text, such as spelling errors. If you're interested in how datasets for this task look like, you can perform the following command after installing DataLab

from datalabs import load_dataset
dataset = load_dataset("gaokao2018_np1", "writing_grammar")
print(dataset['test'][0])

In what follows, we will describe how to analyze grammatical error correction systems.

Data Preparation

Format of `Dataset` File

(1) datalab: if your datasets have been supported by datalab, you fortunately don't need to prepare the dataset.

Format of `System Output` File

In order to perform an analysis of your results, your system outputs should be arranged into following format:

[
  {
    "predicted_edits": {
      "start_idx": [8, 17, 39],
      "end_idx": [8, 18, 40],
      "corrections": [
        ["the"],
        ["found"],
        ["other"],
        ]
    }

  }
]

where

the len(start_idx) == len(end_idx) == len(corrections), representing how many corrections your systems think should be made
the combination of start_idx and end_idx (e.g., (start_idx[i], end_idx[i])) tells the where should be corrected in the original text.
the value of corrections (corrections[i]) tells what correction should be made

Performing Basic Analysis

Let's say we have one system output file:

rst_2018_quanguojuan1_gec.json

The below example loads the gaokao2018_np1 dataset (with the subdataset name of writing-grammar) from DataLab:

explainaboard --task grammatical-error-correction --dataset gaokao2018_np1 --sub-dataset writing-grammar --metrics SeqCorrectScore --system-outputs ./integration_tests/artifacts/gaokao/rst_2018_quanguojuan1_gec.json > report.json

where

--task: denotes the task name, you can find all supported task names here
--system-outputs: denote the path of system outputs. Multiple one should be separated by space, for example, system1 system2
--dataset: denotes the dataset name
--dataset: denotes the subdataset name
--metrics: represent(s) evaluated metrics being used
report.json: the generated analysis file with json format. Tips: use a json viewer like this one for better interpretation.

Alternatively, you can load the dataset from an existing file using the --custom-dataset-paths option

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

task_grammatical_error_correction.md

task_grammatical_error_correction.md

Analyzing Grammatical Error Correction

Data Preparation

Format of `Dataset` File

Format of `System Output` File

Performing Basic Analysis

Files

task_grammatical_error_correction.md

Latest commit

History

task_grammatical_error_correction.md

File metadata and controls

Analyzing Grammatical Error Correction

Data Preparation

Format of Dataset File

Format of System Output File

Performing Basic Analysis

Format of `Dataset` File

Format of `System Output` File