Next: Introduction
A Comparative Evaluation of Deep and Shallow Approaches to the Automatic Detection of Common Grammatical Errors
To appear in:
Proceedings of the joint conference on Empirical Methods in Natural
Language Processing (EMNLP) and on Computational Natural Language
Learning (CoNLL) 2007,
Prague, June 28-30, 2007
--> Resources
Abstract:
This paper compares a deep and a shallow processing approach to the
problem of classifying a sentence as grammatically well-formed or
ill-formed.
The deep processing approach uses the XLE LFG
parser and English grammar:
two versions are
presented, one which uses the XLE directly to perform the classification,
and another one which uses a decision tree trained on features consisting
of the XLE's output statistics.
The shallow processing approach predicts grammaticality based on
n-gram
frequency statistics:
we present two versions, one
which uses frequency thresholds and one which uses a decision tree
trained on the frequencies of the rarest n-grams in the input sentence.
We find that the use of a decision tree improves on the basic approach
only for the deep parser-based approach.
We also show that combining both the shallow and deep decision tree features
is effective.
Our evaluation is carried out using a
large test set of
grammatical and ungrammatical sentences.
The ungrammatical test set is generated automatically
by inserting grammatical errors
into well-formed BNC sentences.
Footnotes
- ... Genabith1
- Also affiliated to IBM CAS, Dublin.
Next: Introduction
jwagner@computing.dcu.ie