next up previous
Next: Introduction

A Comparative Evaluation of Deep and Shallow Approaches to the Automatic Detection of Common Grammatical Errors

Joachim Wagner, Jennifer Foster, and Josef van Genabith1

National Centre for Language Technology

School of Computing, Dublin City University, Dublin 9, Ireland

To appear in: Proceedings of the joint conference on Empirical Methods in Natural Language Processing (EMNLP) and on Computational Natural Language Learning (CoNLL) 2007, Prague, June 28-30, 2007

--> Resources

Abstract:

This paper compares a deep and a shallow processing approach to the problem of classifying a sentence as grammatically well-formed or ill-formed. The deep processing approach uses the XLE LFG parser and English grammar: two versions are presented, one which uses the XLE directly to perform the classification, and another one which uses a decision tree trained on features consisting of the XLE's output statistics. The shallow processing approach predicts grammaticality based on n-gram frequency statistics: we present two versions, one which uses frequency thresholds and one which uses a decision tree trained on the frequencies of the rarest n-grams in the input sentence. We find that the use of a decision tree improves on the basic approach only for the deep parser-based approach. We also show that combining both the shallow and deep decision tree features is effective. Our evaluation is carried out using a large test set of grammatical and ungrammatical sentences. The ungrammatical test set is generated automatically by inserting grammatical errors into well-formed BNC sentences.



Footnotes

... Genabith1
Also affiliated to IBM CAS, Dublin.



next up previous
Next: Introduction
jwagner@computing.dcu.ie