Hany Hassan

 

 

 

 

Research

 

Publications

 

Personal

 

Contact

 

 

 

 

 

I moved to Microsoft Research, Redmond, WA.

This site is no longer updated, my contact is: hany dot hassan2 at mail dot dcu dot ie

I have finished my PhD at Dublin City University with my advisors   Prof. Andy Way. and   Dr. Khalil Sima'an. My Thesis is entitled "Lexical Syntax for Statistical Machine Translation". The thesis has been selected as the winner of the 2009 LRC Best Thesis Award.

.

My Resume is here

 Research Interests

My research interests are broadly in Knowledge Management and Natural Language Processing. In Particular, I am interested in developing Machine Learning and Statistical Modeling techniques for advancing the technologies of Statistical Machine Translation, Information Extraction, Unstructured Data Management and Business Intelligence using unstructured data. .

Patents and Publications:

Thesis:

Lexical Syntax for Statistical Machine Translation, PhD thesis. pdf 

Patents Pending :

Detecting and Predicting Anomalous Events, Patent Pending. Filed by IBM in 2008

Method and System for Access of Multilingual Textual Resources using Conceptual Representation Matching , Patent Pending. Filed by IBM in 2007

Method and System for Detecting Anomalous Behavior in Business Process Performance, Patent pending. Filed by IBM in 2007

Method and System for Extracting and Visualizing Graph-Structured Relations from Unstructured Text, Patent pending. Filed by IBM in 2005

Method and System for Automatically Generating Multilingual Electronic Content from Unstructured Data, Patent pending. Filed by IBM in 2005

Publications:

Journal Papers:

Syntactically Lexicalized Phrase-Based Statistical Translation. Hany Hassan, Khalil Sima'an and Andy Way, In IEEE Transactions on Audio, Speech and Language Processing. September 2008.

Conference Papers:

Hany Hassan, Khalil Sima'an and Andy Way. A Syntactified Direct Translation Model with Linear-time Decoding. To appear  in EMNLP 2009

Hany Hassan, Khalil Sima'an and Andy Way. A Syntactic Language Model based on Incremental CCG Parsing. In Proceedings IEEE Workshop on Spoken Language Technology (SLT) 2008, Goa, India

Language Independent Text Correction using Finite State Automata . Ahmed Hassan, Sara Noeman, and Hany Hassan, Proceedings of the 2008 International Joint Conference on Natural Language Processing (IJCNLP, 2008).

Improving Named Entity Translation by Exploiting Comparable and Parallel Corpora . Ahmed Hassan, Haytham Fahmy, and Hany Hassan. Proceedings of the 2007 Conference on Recent Advances in Natural Language Processing (RANLP, 2007), AMML Workshop.

MaTrEx: the DCU Machine Translation System for IWSLT 2007. Hassan, H., Y. Ma and A. Way. 2007. In Proceedings of the International Workshop on Spoken Language Translation, Trento, Italy

Supertagged Phrase-Based Statistical Machine Translation , Hany Hassan , Khalil Sima'an and Andy Way , ACL 2007 , Prague

Arabic Cross-Document Person Name Normalization , Walid Magdy, Kareem Darwish, Ossama Emam and Hany Hassan , Semitic Languages workshop - ACL 2007 , Prague

BioNoculars: Extracting Protein-Protein Interactions from Biomedical Text , Amgad Madkour, Kareem Darwish, Hany Hassan, Ahmed Hassan, Ossama Emam , BioNLP workshop - ACL 2007 , Prague

Syntactic Phrase-Based Statistical Machine Translation, Hany Hassan, Mary Hearne, Andy Way and Khalil Sima'an. 2006 , In Proceedings of the IEEE 2006 Workshop on Spoken Language Translation, Palm Beach, Aruba (to appear).

Unsupervised Information Extraction Approach Using Graph Mutual Reinforcement, Hany Hassan , Ahmed Hassan and Ossama Emam , EMNLP 2006

Graph Based Semi-Supervised Approach for Information Extraction, Hany Hassan , Ahmed Hassan and Sara Noeman , TextGraphs Workshop - HLT/NAACL 2006

An Integrated Approach for Arabic-English Named Entity Translation, ACL 2005, Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages

Hany Hassan and Jeffrey Sorensen:  

Examining the Effect of Improved Context Sensitive Morphology on Arabic Information Retrieval, Kareem Darwish, Hany Hassan and Ossama Emam: , ACL 2005, Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages  

A Statistical Model for Multilingual Entity Detection and Tracking. , Radu Florian , Hany Hassan, Abraham Ittycheriah, Hongyan Jing, Xiaoqiang Luo, Nicolas Nicolov and Salim Roukos , HLT-NAACL 2004: 1-8 

Language Model Based Arabic Word Segmentation. Young-Suk Lee, Kishore Papineni, Salim Roukos, Ossama Emam, Hany Hassan:, ACL 2003: 399-406

TIPS: A Translingual Information Processing System. Yaser Al-Onaizan, Radu Florian, Martin Franz, Hany Hassan, Young-Suk Lee, J. Scott McCarley, Kishore Papineni, Salim Roukos, Jeffrey Sorensen, Christoph Tillmann, Todd Ward, Fei Xia, HLT-NAACL 2003