Joachim Wagner
CNGL system administrator,
postgraduate research student
School of Computing
Dublin City University
+353 (0)1 700 6915
jwagner@computing.DCU.IE
My office is L2.04 facing the PostDoc lab in the new CNGL area.
My current research is probabilistic detection of ungrammatical sentences for CALL. I aim to develop a new CL technology for judging grammaticality of input based on existing probabilistic parsers. This way I hope to make available the advantages of these parsers (broad coverage and high robustness) to CALL.
My supervisors are
Past co-supervisor:
Collaborators
The first three years of this research were supported by
The Embark Initiative.
Joachim Wagner (to appear 2012, pending approval by the faculty awards board): Detecting Grammatical Errors with Treebank-Induced, Probabilistic Parsers. PhD Thesis, Dublin City University, Dublin, Ireland. Download thesis [PDF, 5.5 MB]
Jennifer Foster, Ozlem Cetinoglu, Joachim Wagner, Joseph Le Roux, Joakim Nivre, Deirdre Hogan and Josef van Genabith (2011): From News to Comment: Resources and Benchmarks for Parsing the Language of Web 2.0. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP), Chiang Mai, Thailand.
Jennifer Foster, Ozlem Cetinoglu, Joachim Wagner and Josef van Genabith (to appear Oct 2011): Comparing the use of edited and unedited text in parser self-training. In Proceedings of the 12th International Conference on Parsing Technologies (IWPT 2011), Dublin, Ireland
Jennifer Foster, Ozlem Cetinoglu, Joachim Wagner, Joseph Le Roux and Stephen Hogan (2011): #hardtoparse: POS Tagging and Parsing the Twitterverse. In Proceedings of the Workshop on Analyzing Microtext at the Twenty-Fifth Conference on Artificial Intelligence (AAAI-11), 8 August 2011, Hyatt Regency Hotel, San Francisco
Joachim Wagner and Jennifer Foster (2009): The effect of correcting grammatical errors on parse probabilities. In Proceedings of the 11th International Conference on Parsing Technologies (IWPT'09), Paris, France, 7th-9th October, 2009
Joachim Wagner, Jennifer Foster and Josef van Genabith (2009): Judging Grammaticality: Experiments in Sentence Classification. In CALICO Journal, pages 474-490, volume 26, number 3
Jennifer Foster, Joachim Wagner, and Josef van Genabith (2008): Adapting a WSJ-Trained Parser to Grammatically Noisy Text. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Short Papers, pages 221-224, Columbus, OH, June 15-20, 2008
Deirdre Hogan, Jennifer Foster, Joachim Wagner and Josef van Genabith (2008): Parser-Based Retraining for Domain Adaptation of Probabilistic Generators (Title of early draft: Investigating the Effect of Domain Variation on Generation Performance). In Proceedings of the 5th International Natural Language Generation Conference (INLG08), Salt Fork Park, Ohio, June 12-14, 2008
Jennifer Foster, Joachim Wagner, and Josef van Genabith (2008): Using Decision Trees to Detect and Classify Grammatical Errors. Talk presented jointly by Jennifer and me at the Calico '08 Workshop on Automatic Analysis of Learner Language: Bridging Foreign Language Teaching Needs and NLP Possibilities, University of San Francisco, March 18 and 19, 2008, PDF
Joachim Wagner (2008): Nadja Nesselhauf, Collocations in a Learner Corpus. Book review in Machine Translation Vol 20, No 4, March 2006 [sic], pages 301-303, DOI: 10.1007/s10590-007-9028-8, Draft PDF
Joachim Wagner, Djamé Seddah, Jennifer Foster and Josef van Genabith (2007): C-Structures and F-Structures for the British National Corpus. In Proceedings of the Twelfth International Lexical Functional Grammar Conference (LFG07), pages 418-438, CSLI Publications, Stanford University, July 28-30, 2007, PDF from publisher website, DORAS repository
Joachim Wagner, Jennifer Foster and Josef van Genabith (2007): A Comparative Evaluation of Deep and Shallow Approaches to the Automatic Detection of Common Grammatical Errors. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) , Prague, June 28-30, 2007 (Extended version presented at the Summer 2007 ParGram meeting in Palo Alto.)
Jennifer Foster, Joachim Wagner, Djamé Seddah and Josef van Genabith (2007): Adapting WSJ-Trained Parsers to the British National Corpus using In-Domain Self-Training. In Proceedings of the 10th International Conference on Parsing Technologies (IWPT 2007), Prague, June 23-24, 2007
Joachim Wagner, Jennifer Foster and Josef van Genabith (2006): Detecting Grammatical Errors Using Probabilistic Parsing. Talk presented by Jennifer at the Workshop on Interfaces of Intelligent Computer-Assisted Language Learning, Ohio State University, December 17, 2006,
Gareth J. F. Jones, Michael Burke, John Judge, Anna Khasin, Adenike Lam-Adesina and Joachim Wagner (2005): Dublin City University at CLEF 2004: Experiments in Monolingual, Bilingual and Multilingual Retrieval. In Multilingual Information Access for Text, Speech and Images: 5th Workshop of the Cross-Language Evaluation Forum, Carol Peters, Paul Clough, Julio Gonzalo, G.Jones, M.Kluck and B.Magnini (Eds.), Volume 3491 of Lecture Notes in Computer Science, pages 207 - 220, Springer, Heidelberg, Germany (in print), 2005.
Petra Ludewig and Joachim Wagner (2004): Collocations - mediating between lexical abstractions and textual concretions. In Proc. of the sixth TALC conference, pages 32 -33, Granada, Spain - Handout
Cara Greene, Katrina Keogh, Thomas Koller, Joachim Wagner, Monica Ward and Josef van Genabith (2004): Using NLP Technology in CALL. In NLP and Speech Technologies in Advanced Language Learning Systems - Proc. of InSTIL/ICALL2004 Symposium on Computer Assisted Language Learning, ed. Rodolfo Delmonte, Philippe Delcloque and Sara Tonelli, pages 55 - 58, Venice, Italy - Handout, more
Joachim Wagner (2004): A false friend exercise with authentic material retrieved from a corpus. In NLP and Speech Technologies in Advanced Language Learning Systems - Proc. of InSTIL/ICALL2004 Symposium on Computer Assisted Language Learning, pages 115 - 118, Venice, Italy - Poster, more
Monica Ward, Thomas Koller and Joachim Wagner (2003): Integrating Techniques from computational Linguistics into Computer-Assisted Language Learning. Poster presented at the Annual IRCSET Symposium 2003, Dublin, Ireland
Joachim Wagner (2003): Datengesteuerte maschinelle Übersetzung mit flachen Analysestrukturen, Master's thesis, Universität Osnabrück, Germany
Jahn-Takeshi Saito, Joachim Wagner, Graham Katz, Philip Reuter, Michael Burke, and Sabine Reinhard (2002): Evaluation of GermaNet: Problems Using GermaNet for Automatic Word Sense Disambiguation. In Proc. of the LREC Workshop on WordNet Structure and Standardization and how THese Affect WordNet Applications and Evaluation, pages 14-29, Las Palmas de Gran Canaria
Norman Kummer and Joachim Wagner (2002): Phrase processing for detecting collocations with KoKS, In online Proc. of Colloc02 Workshop on Computational Approaches to Collocations, http://www.ai.univie.ac.at/colloc02/, Vienna, Austria - more
Arno Erpenbeck, Britta Koch, Norman Kummer, Philip Reuter, Patrick Tschorn and Joachim Wagner (2002): KOKS - Korpusbasierte Kollokationssuche, technical report (Abschlussbericht), Universität Osnabrück, Germany
I was involved in the work on corpus preprocessing in several projects:
| Project | Corpus | Year |
|---|---|---|
| LogoTax | Spiegel | 1999 |
| KoKS | De-News and EuroParl (project's own download) | 2001 |
| My M.A. | Harry Potter 1-4 | 2003 |
| PhD research | leser-service.de (book excerpts) | 2004 |
| DCU CLEF | Newspaper | 2004 |
| PhD research | Glasgow Herald (sample) | 2005 |
| PhD research | EuroParl (OPUS) | 2005 |
| PhD research | Jennifer Foster's error corpus | 2006 |
| PhD research | BNC 1.0 | 2006 |
| PhD research | JPU learner corpus (sample) | 2007 |
| PhD research | PELCRA learner corpus (sample) | 2007 |
| PhD research | Microsoft ``ESL 123 Mass Noun Examples'' | 2007 |
| PhD research | ICLE learner corpus | 2008 |
| PhD research | WSJ raw sections 26-60 (PTB v0.75) | 2008 |
| PhD research | Gonzaga learner corpus (sample) | 2008 |
2012-01-12T15:47:03+0000
Thu Jan 12 15:47:03 GMT 2012
© 2004 - 2012
Joachim Wagner
jwagner@computing.DCU.IE