Welcome to Wei-Yun Ma's Homepage!                                                                             

                                      

 

                                       Wei-Yun Ma

                                       Ph.D Candidate

           Computer Science at Columbia University

                                New York City, U.S.A.

 

 

 

I am a forth-year Ph.D student in Computer Science at Columbia University.

I work with Prof. Kathy McKeown in the Natural Language Processing Group.

My current research topic is Multi-Engine Machine Translation (MEMT), focusing on

fusing the outputs of MT systems based on syntactic and semantic analyses of the

source and the target language to produce a new better translation.
 

The papers that I covered for my candidacy exam on MEMT are available here.

 

My research interests include Natural Language Processing (NLP), Machine Translation,
Semantic and Knowledge Representation, Information Retrieval, Speech Processing and
Machine Learning. I can be reached with the E-mail:wm2174@columbia.edu

Background

I come from Taiwan. I received a Bachelor's degree in Computer Science and Engineering at 
Yuan Ze University (YZU), and a Masters degree in Computer Science Information Engineering at 
National Chiao Tung University (NCTU), where I focused on how to improve speed for 
continuous speech recognition. In 2006, I came to Computer Science at Columbia University, where
I got another Masters degree and now is pursing my Ph.D degree.
 
I had interned at Industrial Technology Research Institute (ITRI) for one year to develop 
speech-command-recognition 8051 chip and had worked in Chinese Knowledge Information Processing Group (CKIP)
at Institute of Information Science, Academia Sinica, participating in several Chinese NLP projects for five years.

Research Publications

Ma, Wei-Yun, Kathleen McKeown. 2011. System Combination for Machine Translation Based on Text-to-Text Generation
Proc. Machine Translation Summit XIII
 
Kristen Parton, Wei-Yun Ma, Kathleen McKeown, and James Allan. 2011. Using Query Time Information to Improve Multilingual 
Search and Response Generation. In Handbook of Natural Language Processing and Machine Translation: 
DARPA Global Autonomous Language Exploitation. Joseph Olive (ed.)
 
Ma, Wei-Yun, Kathleen McKeown. 2009. Where's the Verb Correcting Machine Translation During Question Answering
Proc. ACL-IJCNLP
 
Parton, Kristen, Kathleen R. McKeown, Bob Coyne, Mona T. Diab, Ralph Grishman, 
Dilek Hakkani-Tür, Mary Harper, Heng Ji, Wei-Yun Ma, Adam Meyers, Sara Stolbach, 
Ang Sun, Gokhan Tur, Wei Xu and Sibel Yaman. 2009. 
Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W Task, 
Proc. ACL-IJCNLP  
 
Huang, Chu-Ren, Wei-Yun Ma, Yi-Ching Wu, and Chih-Ming Chiu, 2006, Knowledge-Rich Approach to Automatic
Grammatical Information Acquisition:Enriching Chinese Sketch Engine with a Lexical Grammar,
presented at The 20th Pacific Asia Conference on Language, Information and Computation. 
 
Ma, Wei-Yun and Chu-Ren Huang, 2006, 中文動詞名物化判斷的統計式模型設計 (The Identification of Nominalizations in
Mandarin Chinese Using Corpus-based Models), Proceedings of Conference on 
Computational Linguistics and Speech Processing (ROCLING 2006).
 
Ma, Wei-Yun and Chu-Ren Huang, 2006, Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus,
presented at Language Resources and Evaluation Conference (LREC 2006).
   
Ma, Wei-Yun and Keh-Jiann Chen, 2005, Design of CKIP Chinese Word Segmentation System,
Chinese and Oriental Languages Information Processing Society, Vol 14. No. 3.  pp. 235-249.
 
Hong, Jia-Fei, Chu-Ren Huang, and Wei-Yun Ma, 2006, 語料庫為本的的兩岸對應詞彙發掘 (Corpus-based Extraction 
of Cross-strait Corresponding Words), presented at the Seventh Chinese Lexical Semantics Workshop (CLSW-7)

Ma, Wei-Yun and Keh-Jiann Chen, 2003, A Bottom-up Merging Algorithm for Chinese Unknown Word Extraction,
Proceedings of ACL, Second SIGHAN Workshop on Chinese Language Processing, pp31-38.
   
Ma, Wei-Yun and Keh-Jiann Chen, 2003, Introduction to CKIP Chinese Word Segmentation System for 
the First International Chinese Word Segmentation Bakeoff, Proceedings of ACL, Second SIGHAN Workshop
on Chinese Language Processing, pp168-171.
   
Chen, Keh-Jiann, Wei-Yun Ma, 2002, Unknown Word Extraction for Chinese Documents,
Proceedings of COLING 2002, pp.169-175.
   
Ma, Wei-Yun, Chen Keh-Jiann, 2001, 中文語料庫構建及管理系統設計 (Construction and Management for 
Chinese Corpus), Proceedings of Conference on Computational Linguistics and Speech Processing (ROCLING 2001), 
pp.175-191.