<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-4671480241746391796</id><updated>2011-09-20T10:37:42.615-07:00</updated><category term='system'/><category term='LM'/><category term='sttk'/><category term='asr'/><category term='mteval'/><category term='speech'/><category term='TM'/><category term='language'/><category term='parsing'/><category term='ebmt'/><category term='syntax'/><category term='M45'/><category term='mt'/><category term='meteor'/><category term='xfer'/><category term='discriminative'/><category term='memt'/><title type='text'>MT Lunch Seminar (LTI, CMU)</title><subtitle type='html'>An informal discussion series for researchers from the Machine Translation groups at the Language Technologies Institute, Carnegie Mellon University</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>54</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-3833236152394934287</id><published>2011-09-07T16:35:00.000-07:00</published><updated>2011-09-07T16:35:29.124-07:00</updated><title type='text'>Training Machine Translation with a Second-Order Taylor Approximation of Weighted Translation Instances</title><content type='html'>Title: Training Machine Translation with a Second-Order Taylor Approximation of Weighted Translation Instances&lt;br /&gt;Speaker:   Aaron Phillips&lt;br /&gt;When:   Tuesday, September 13, 12:00 Noon to 1:00pm.&lt;br /&gt;Where:   GHC 6501&lt;br /&gt;&lt;br /&gt;Abstract: The Cunei Machine Translation Platform is an open-source MT system designed to model instances of translation. One of the challenges to this approach is effective training. We describe two techniques that improve the training procedure and allow us to leverage the strengths of instance-based modeling. First, during training we approximate our model with a second-order Taylor series. Second, we discount models based on the magnitude of their approximation. By reducing error in training, our model now consistently outperforms the standard SMT model with gains ranging from 0.51 to 3.77 BLEU on German-English and Czech-English test sets.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-3833236152394934287?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/3833236152394934287/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=3833236152394934287' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/3833236152394934287'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/3833236152394934287'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2011/09/training-machine-translation-with.html' title='Training Machine Translation with a Second-Order Taylor Approximation of Weighted Translation Instances'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-6237485178259201445</id><published>2011-05-16T13:19:00.001-07:00</published><updated>2011-05-16T13:19:45.739-07:00</updated><title type='text'>Syntax-to-Morphology Mapping in Factored Phrase-Based SMT (English and Turkish)</title><content type='html'>Title: Syntax-to-Morphology Mapping in Factored Phrase-Based&lt;br /&gt;Statistical Machine Translation between English and Turkish&lt;br /&gt;&lt;br /&gt;Speaker: Reyyan Yeniterzi&lt;br /&gt;&lt;br /&gt;When: Tuesday, May 17 at 12:15pm&lt;br /&gt;Where: GHC 6501&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;&lt;br /&gt;Motivated by the observation that many local and some nonlocal&lt;br /&gt;syntactic structures in English essentially map to morphologically&lt;br /&gt;complex words in Turkish, a new approach which is called&lt;br /&gt;syntax-to-morphology mapping was introduced recently (Yeniterzi and&lt;br /&gt;Oflazer, 2010). This approach maps syntactic structures in English to&lt;br /&gt;complex words in Turkish directly. It mainly recognizes certain local&lt;br /&gt;and nonlocal syntactic structures on the English side and packages&lt;br /&gt;those structures and attach to heads to obtain parallel morphological&lt;br /&gt;structures.&lt;br /&gt;&lt;br /&gt;With the help of this method, one can identify and reorganize phrases&lt;br /&gt;on the English side, to align English syntax to Turkish morphology.&lt;br /&gt;Furthermore with this method, continuous and discontinuous variants of&lt;br /&gt;certain (syntactic) source phrases can be conflated during the SMT&lt;br /&gt;phrase extraction process. Since most function words encoding syntax&lt;br /&gt;are now abstracted into complex tags, the length of the English&lt;br /&gt;sentences can be dramatically reduced.&lt;br /&gt;&lt;br /&gt;The initial experiments were performed on English-to-Turkish SMT&lt;br /&gt;system. In this project, we built upon this initial system by doing&lt;br /&gt;lexical reordering and data augmentation. Furthermore we also applied&lt;br /&gt;syntax-to-morphology mapping to a Turkish-to-English SMT system for&lt;br /&gt;the first time.&lt;br /&gt;&lt;br /&gt;This is joint work with Kemal Oflazer from Qatar CMU. It was presented&lt;br /&gt;in the Machine Translation and Morphologically-rich Languages Research&lt;br /&gt;Workshop at Haifa, Israel in January, 2011.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-6237485178259201445?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/6237485178259201445/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=6237485178259201445' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6237485178259201445'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6237485178259201445'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2011/05/syntax-to-morphology-mapping-in.html' title='Syntax-to-Morphology Mapping in Factored Phrase-Based SMT (English and Turkish)'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-5537654194372941145</id><published>2011-05-03T13:41:00.001-07:00</published><updated>2011-05-03T13:41:34.457-07:00</updated><title type='text'>Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability</title><content type='html'>Title: Better Hypothesis Testing for Statistical Machine Translation:&lt;br /&gt;Controlling for Optimizer Instability&lt;br /&gt;Speaker: Jonathan Clark&lt;br /&gt;When: Tuesday, 4/19 at Noon&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;In statistical machine translation, a researcher seeks to determine&lt;br /&gt;whether some innovation (e.g., a new feature, model, or inference&lt;br /&gt;algorithm) improves translation quality in comparison to a baseline&lt;br /&gt;system. To answer this question, he runs an experiment to evaluate the&lt;br /&gt;behavior of the two systems on held-out data.  In this paper, we&lt;br /&gt;consider how to make such experiments more statistically reliable. We&lt;br /&gt;provide a systematic analysis of the effects of optimizer instability&lt;br /&gt;(an extraneous variable that is seldom controlled for) on experimental&lt;br /&gt;outcomes, and make recommendations for reporting results more&lt;br /&gt;accurately.&lt;br /&gt;&lt;br /&gt;This is joint work with Chris Dyer, Alon Lavie, and Noah Smith. It was&lt;br /&gt;recently accepted for publication as an ACL short paper.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-5537654194372941145?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/5537654194372941145/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=5537654194372941145' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/5537654194372941145'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/5537654194372941145'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2011/05/better-hypothesis-testing-for.html' title='Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-2974850443635082565</id><published>2011-03-02T12:29:00.000-08:00</published><updated>2011-03-02T12:29:36.636-08:00</updated><title type='text'>Qin Gao: Expanding parallel corpora for machine translation</title><content type='html'>Speaker: Qin Gao &lt;br /&gt;When: at noon, March 8, 2011 &lt;br /&gt;Where: GHC 4405 &lt;br /&gt;&lt;br /&gt;We present an approach of expanding parallel corpora for machine translation. By utilizing Semantic role labeling (SRL) on one side of the language pair, we extract SRL substitution rules from existing parallel corpus. The rules are then used for generating new sentence pairs. An SVM classifier is built to filter the generated sentence pairs. The filtered corpus is used for training phrase-based translation models, which can be used directly in translation tasks or combined with baseline models. Experiment results on Chinese-English machine translation tasks show an average improvement of 0.45 BLEU and 1.22 TER points across 5 different NIST test sets.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-2974850443635082565?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/2974850443635082565/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=2974850443635082565' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2974850443635082565'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2974850443635082565'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2011/03/qin-gao-expanding-parallel-corpora-for.html' title='Qin Gao: Expanding parallel corpora for machine translation'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-1873236110830335539</id><published>2011-01-13T17:34:00.000-08:00</published><updated>2011-01-13T17:34:53.027-08:00</updated><title type='text'>Machine Translation and Computer-Assisted Translation</title><content type='html'>Title: Prospects for Integrating Machine Translation and Computer-Assisted Translation in the Translation Industry&lt;br /&gt;&lt;br /&gt;Speaker: Gregory M. Shreve from the Department of Modern and Classical Language Studies at Kent State University and colleagues&lt;br /&gt;Location: GHC 6115 &lt;br /&gt;Time: 12:30 pm, 14 Jan 2011&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;The speaker's CV can be found at http://www.kent.edu/mcls/faculty/mcls_shreve.cfm.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-1873236110830335539?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/1873236110830335539/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=1873236110830335539' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1873236110830335539'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1873236110830335539'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2011/01/machine-translation-and-computer.html' title='Machine Translation and Computer-Assisted Translation'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-6893994870959396855</id><published>2010-12-13T12:21:00.000-08:00</published><updated>2010-12-13T12:21:12.248-08:00</updated><title type='text'>Efficient Language Model Inference - Kenneth Heafield</title><content type='html'>Title: Efficient Language Model Inference&lt;br /&gt;Who?     Kenneth Heafield&lt;br /&gt;When?   Tuesday, December 21 @ Noon&lt;br /&gt;Where?  GHC 4405&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;In GHC 4405 at noon on Tuesday Dec 21, I will give a speaking&lt;br /&gt;requirement talk on Efficient Language Model Inference.  As this is also&lt;br /&gt;a MT Lunch, there will be free lunch.&lt;br /&gt;&lt;br /&gt;If you're using SRILM, come to my talk to reduce your memory consumption&lt;br /&gt;by 86% while reducing CPU time by 16%.  Users of IRSTLM should come for&lt;br /&gt;the same reason; the code uses 42% less memory and 19% less CPU.&lt;br /&gt;&lt;br /&gt;Language models are an important feature in speech, translation,&lt;br /&gt;generation, IR, and other technologies.  More training data and less&lt;br /&gt;pruning generally lead to higher quality, but RAM is a limiting factor.&lt;br /&gt; Further, systems consult language models so frequently that lookups&lt;br /&gt;dominate CPU time.&lt;br /&gt;&lt;br /&gt;This talk presents language modeling code with several optimizations to&lt;br /&gt;improve time and space performance.  Storing backoff information in&lt;br /&gt;feature state reduces redundant lookups.  Constructing known&lt;br /&gt;distributions and biasing binary search speeds search and reduces page&lt;br /&gt;faults.  Memory mapping reduces load time.  Bit level packing increases&lt;br /&gt;locality.  Stronger filtering removes n-grams that cannot be assembled&lt;br /&gt;during decoding due to phrase and sentence constraints.  The code is&lt;br /&gt;currently integrated into Moses and being integrated into cdec and&lt;br /&gt;Joshua.  I will cover how my code works and how to use it in other&lt;br /&gt;decoders.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-6893994870959396855?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/6893994870959396855/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=6893994870959396855' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6893994870959396855'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6893994870959396855'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2010/12/efficient-language-model-inference.html' title='Efficient Language Model Inference - Kenneth Heafield'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-6497143934409340256</id><published>2010-10-06T12:15:00.001-07:00</published><updated>2010-10-06T12:15:57.400-07:00</updated><title type='text'>Choosing the Right Evaluation for Machine Translation</title><content type='html'>Time: Noon on Tuesday, October 12&lt;br /&gt;Place: GHC 6501 (usual location)&lt;br /&gt;&lt;br /&gt;Title: Choosing the Right Evaluation for Machine Translation: an Examination of Annotator and Automatic Metric Performance on Human Judgment Tasks&lt;br /&gt;Authors: Michael Denkowski and Alon Lavie&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;This work examines the motivation, design, and practical results of several types of human evaluation tasks for machine translation.  In addition to considering annotator performance and task informativeness over multiple evaluations, we explore the practicality of tuning automatic evaluation metrics to each judgment type in a comprehensive experiment using the METEOR metric.  We present results showing clear advantages of tuning to certain types of judgments and discuss causes of inconsistency when tuning to various judgment data, as well as sources of difficulty in the human evaluation tasks themselves.&lt;br /&gt;&lt;br /&gt;This work will be presented at AMTA 2010.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-6497143934409340256?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/6497143934409340256/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=6497143934409340256' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6497143934409340256'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6497143934409340256'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2010/10/choosing-right-evaluation-for-machine.html' title='Choosing the Right Evaluation for Machine Translation'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-5268930858608212781</id><published>2010-09-13T14:51:00.000-07:00</published><updated>2010-09-13T14:53:31.464-07:00</updated><title type='text'>Models for Synchronous Grammar Induction for Statistical Machine Translation</title><content type='html'>&lt;meta equiv="content-type" content="text/html; charset=utf-8"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, sans-serif; font-size: small; line-height: 18px; white-space: pre-wrap; -webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; "&gt;Title:  Models for Synchronous Grammar Induction for Statistical Machine Translation  &lt;/span&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: Arial, sans-serif; font-size: small; line-height: 18px; white-space: pre-wrap; -webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; "&gt;Presenters: Chris Dyer, LTI&amp;amp;  Desai Chen, CSD undergraduate  &lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: Arial, sans-serif; font-size: small; line-height: 18px; white-space: pre-wrap; -webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; "&gt;Tuesday, September 14, at Noon to 1:30pm in GHC 6501. &lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: Arial, sans-serif; font-size: small; line-height: 18px; white-space: pre-wrap; -webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; "&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: Arial, sans-serif; font-size: small; line-height: 18px; white-space: pre-wrap; -webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; "&gt;Abstract: The last decade of research in Statistical Machine Translation (SMT) has seen rapid progress. The most successful methods have been based on synchronous context free grammars (SCFGs), which encode translational equivalences and license reordering between tokens in the source and target languages. Yet, while closely related language pairs can be translated with a high degree of precision now, the result for distant pairs is far from acceptable. In theory, however, the "right"' SCFG is capable of handling most, if not all, structurally divergent language pairs. This talk will report on the results of the 2010 Language Engineering Workshop held at Johns Hopkins University that the goal to focus on the crucial practical aspects of acquiring such SCFGs from bilingual, but otherwise unannotated, text. We started with existing algorithms for inducing unlabeled SCFGs (e.g. the popular Hiero model) and then used unsupervised learning methods to refine the syntactic constituents used in the translation rules of the grammar. &lt;/span&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: Arial, sans-serif; font-size: small; line-height: 18px; white-space: pre-wrap; -webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; "&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-5268930858608212781?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/5268930858608212781/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=5268930858608212781' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/5268930858608212781'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/5268930858608212781'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2010/09/chris-dyer-lti-desai-chen-csd.html' title='Models for Synchronous Grammar Induction for Statistical Machine Translation'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-7029003710336643767</id><published>2010-06-28T13:50:00.000-07:00</published><updated>2010-06-28T13:51:45.772-07:00</updated><title type='text'>Syntax-to-Morphology Mapping in Factored Phrase-Based Statistical Machine Translation from English to Turkish</title><content type='html'>&lt;span style="font-weight: bold;"&gt; Tuesday, June 29 at Noon, in GHC 6501&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Title:  Syntax-to-Morphology Mapping in Factored Phrase-Based Statistical&lt;br /&gt;Machine Translation from English to Turkish&lt;br /&gt;&lt;br /&gt;Authors: Reyyan Yeniterzi and Kemal Oﬂazer&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;&lt;br /&gt;We present a novel scheme to apply factored phrase-based SMT to a&lt;br /&gt;language pair with very disparate morphological structures. Our&lt;br /&gt;approach relies on syntactic analysis on the source side (English) and&lt;br /&gt;then encodes a wide variety of local and non-local syntactic&lt;br /&gt;structures as complex structural tags which appear as additional&lt;br /&gt;factors in the training data. On the target side (Turkish), we only&lt;br /&gt;perform morphological analysis and disambiguation but treat the&lt;br /&gt;complete complex morphological tag as a factor, instead of separating&lt;br /&gt;morphemes. We incrementally explore capturing various syntactic&lt;br /&gt;substructures as complex tags on the English side, and evaluate how&lt;br /&gt;our translations improve in BLEU scores. Our maximal set of source and&lt;br /&gt;target side transformations, coupled with some additional techniques,&lt;br /&gt;provide an 39\% relative improvement from a baseline 17.08 to 23.78&lt;br /&gt;BLEU, all averaged over 10 training and test sets. Now that the&lt;br /&gt;syntactic analysis on the English side is available, we also&lt;br /&gt;experiment with more long distance constituent reordering to bring the&lt;br /&gt;English constituent order close to Turkish, but ﬁnd that these&lt;br /&gt;transformations do not provide any additional consistent tangible&lt;br /&gt;gains when averaged over the 10 sets.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-7029003710336643767?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/7029003710336643767/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=7029003710336643767' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7029003710336643767'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7029003710336643767'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2010/06/syntax-to-morphology-mapping-in.html' title='Syntax-to-Morphology Mapping in Factored Phrase-Based Statistical Machine Translation from English to Turkish'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4154403036728638065</id><published>2010-05-12T13:52:00.000-07:00</published><updated>2010-06-28T13:52:55.809-07:00</updated><title type='text'>Chunk-Based EBMT</title><content type='html'>When: noon on May 18&lt;br /&gt;Where: GHC 6501&lt;br /&gt;&lt;br /&gt;Speaker: Jaedy Kim&lt;br /&gt;&lt;br /&gt;Topic: Chunk-Based EBMT&lt;br /&gt;&lt;br /&gt;Abstract: Corpus driven machine translation approaches such as Phrase-Based Statistical Machine Translation and Example-Based Machine Translation have been successful by using word alignment to find translation fragments for matched source parts in a bilingual training corpus.&lt;br /&gt;However, they still cannot properly deal with systematic translation for insertion or deletion words between two distant languages.&lt;br /&gt;In this work, we used syntactic chunks as translation units to alleviate this problem, improve alignments and show improvement in BLEU for Korean to English and Chinese to English translation tasks.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4154403036728638065?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4154403036728638065/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4154403036728638065' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4154403036728638065'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4154403036728638065'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2010/05/chunk-based-ebmt.html' title='Chunk-Based EBMT'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-5858463200938809488</id><published>2010-04-12T15:26:00.000-07:00</published><updated>2010-04-12T15:27:30.103-07:00</updated><title type='text'>Generalized templates for EBMT</title><content type='html'>&lt;span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; "&gt;Speaker: Rashmi Gangadharaiah&lt;br /&gt;Location: GHC 6501 &lt;/span&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; "&gt;&lt;br /&gt;Topic: Generalized templates for EBMT&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;-----------&lt;br /&gt;Example-Based Machine Translation (EBMT), like other corpus based methods, requires substantial parallel training data. One way to reduce data requirements and improve translation quality is to generalize parts of the parallel corpus into translation templates. This automated generalization process requires clustering. In most clustering approaches the optimal number of clusters (N) is found empirically on a development set which often takes several days. We introduce a spectral clustering framework that automatically estimates the optimal N and removes unstable oscillating points. The new framework produces significant improvements in low-resource EBMT settings for English-to-French (~1.4 BLEU points), English-to-Chinese (~1 BLEU point), and English-to-Haitian (~2 BLEU points). The translation quality with templates created using automatically and empirically found best N were almost the same. By discarding “incoherent” points, a further boost in translation scores is observed, even above the empirically found N.&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-5858463200938809488?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/5858463200938809488/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=5858463200938809488' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/5858463200938809488'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/5858463200938809488'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2010/04/generalized-templates-for-ebmt.html' title='Generalized templates for EBMT'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-7557049743321563683</id><published>2010-03-15T09:27:00.001-07:00</published><updated>2010-03-15T09:27:31.160-07:00</updated><title type='text'>Two talks</title><content type='html'>(1) Greg Hanneman:&lt;br /&gt;Title:  The Stat-XFER Group Submission for WMT '10&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;Each year, the Workshop in Statistical Machine Translation collects state-of-the-art MT results for a variety of European language pairs via a shared translation task.  In this talk, I will describe the CMU's Stat-XFER MT group submission to this year's WMT French--English track, our third submission to the WMT series, using the Joshua decoder.  A large focus will be on new modeling decisions or system-building techniques that have changed from eariler submissions based on new research carried out in our group.  I will also present some open questions facing builders of large-scale hierarchcial MT systems in general.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;(2) Vamshi Ambati:&lt;br /&gt;Title: Making sense of Crowd data for Machine Translation&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;Quality of crowd data is a common concern in crowd-sourcing approaches to data collection. When working with crowd data, the objectives are two-fold - maximizing the quality of data from non-experts, and minimizing the cost of annotation by pruning noisy annotators.&lt;br /&gt;I will discuss our recent experiments in Machine Translation for selection of high quality crowd translations by explicitly modeling annotator reliability based on agreement with other  submissions. I will also present some preliminary results in cost minimization and report their adaptation and feasibility to machine translation.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-7557049743321563683?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/7557049743321563683/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=7557049743321563683' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7557049743321563683'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7557049743321563683'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2010/03/two-talks.html' title='Two talks'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-9182032282781319156</id><published>2010-02-18T22:43:00.000-08:00</published><updated>2010-02-18T22:44:21.707-08:00</updated><title type='text'>Nonparametric Word Segmentation for Machine Translation</title><content type='html'>Speaker: Thuylinh Nguyen&lt;br /&gt;Title: Nonparametric &lt;span class="il"&gt;Word&lt;/span&gt; &lt;span class="il"&gt;Segmentation&lt;/span&gt; for Machine Translation&lt;br /&gt;Thursday 18 Feb 2010. 12-1:30pm in GHC 4405.&lt;br /&gt;&lt;br /&gt;In this talk we present an unsupervised &lt;span class="il"&gt;word&lt;/span&gt; &lt;span class="il"&gt;segmentation&lt;/span&gt; for machine&lt;br /&gt;translation. The model utilizes existing nonparametric monolingual&lt;br /&gt;segmentations. The monolingual &lt;span class="il"&gt;segmentation&lt;/span&gt; model and the bilingual &lt;span class="il"&gt;word&lt;/span&gt;&lt;br /&gt;alignment model are coupled so that source text &lt;span class="il"&gt;segmentation&lt;/span&gt; optimizes&lt;br /&gt;the one-to-one mapping with the target text. Often, there are words in&lt;br /&gt;the source language that do not appear in target language and vise&lt;br /&gt;versa. Our model therefore models source language &lt;span class="il"&gt;word&lt;/span&gt; deletion and &lt;span class="il"&gt;word&lt;/span&gt;&lt;br /&gt;insertion. The experiments show improvements on Arabic-English and&lt;br /&gt;Chinese-English translation tasks.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-9182032282781319156?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/9182032282781319156/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=9182032282781319156' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/9182032282781319156'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/9182032282781319156'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2010/02/nonparametric-word-segmentation-for.html' title='Nonparametric Word Segmentation for Machine Translation'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4122805570030089124</id><published>2010-01-13T20:16:00.001-08:00</published><updated>2010-01-13T20:18:21.062-08:00</updated><title type='text'>LoonyBin: Making Empirical MT Reproducible, Efficient, and Less Annoying</title><content type='html'>Speaker: Jonathan Clark&lt;br /&gt;When:  Tuesday, January 19 at Noon&lt;br /&gt;Where: GHC 6501&lt;br /&gt;What:  Free Knowledge and Free Food&lt;br /&gt;Title:   LoonyBin: Making Empirical MT Reproducible, Efficient, and&lt;br /&gt;Less Annoying&lt;br /&gt;&lt;br /&gt;Abstract: Construction of machine translation systems has evolved into&lt;br /&gt;a multi-stage workﬂow involving many complicated dependencies. Many&lt;br /&gt;decoder distributions have addressed this by including monolithic&lt;br /&gt;training scripts – &lt;a href="http://train-factored-model.pl/" target="_blank"&gt;train-factored-model.pl&lt;/a&gt; for Moses and &lt;a href="http://mr_runmer.pl/" target="_blank"&gt;mr_runmer.pl&lt;/a&gt;&lt;br /&gt;for SAMT. However, such scripts can be tricky to modify for novel&lt;br /&gt;experiments and typically have limited support for the variety of job&lt;br /&gt;schedulers found on academic and commercial computer clusters. Further&lt;br /&gt;complicating these systems are hyperparameters, which often cannot be&lt;br /&gt;directly optimized by conventional methods requiring users to&lt;br /&gt;determine which combination of values is best via trial and error. The&lt;br /&gt;recently-released LoonyBin open-source workﬂow management tool&lt;br /&gt;addresses these issues by providing: 1) a visual interface for the&lt;br /&gt;user to create and modify workﬂows; 2) a well-deﬁned logging&lt;br /&gt;mechanism; 3) a script generator that compiles visual workﬂows into&lt;br /&gt;shell scripts, and 4) the concept of Hyperworkﬂows, which intuitively&lt;br /&gt;and succinctly encodes small experimental variations within a larger&lt;br /&gt;workﬂow. We also describe the Machine Translation Toolpack for&lt;br /&gt;LoonyBin, which exposes state-of-the-art machine translation tools as&lt;br /&gt;drag-and-drop components within LoonyBin.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4122805570030089124?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4122805570030089124/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4122805570030089124' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4122805570030089124'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4122805570030089124'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2010/01/loony-bin.html' title='LoonyBin: Making Empirical MT Reproducible, Efficient, and Less Annoying'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-7476521973942618061</id><published>2009-12-09T14:39:00.000-08:00</published><updated>2009-12-09T14:40:30.069-08:00</updated><title type='text'>MEMT and METEOR</title><content type='html'>Kenneth Heafield and Michael Denkowski:  Features for System Combination&lt;br /&gt;(This is work done as an MT lab project.)&lt;br /&gt;&lt;br /&gt;Michael will give an update on his recent work on the METEOR MT evaluation matrix.&lt;br /&gt;&lt;br /&gt;10 Dec 2009, Thursday, 12:00-1:30, in GHC 6501&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-7476521973942618061?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/7476521973942618061/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=7476521973942618061' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7476521973942618061'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7476521973942618061'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2009/12/memt-and-meteor.html' title='MEMT and METEOR'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-464866176337538128</id><published>2009-11-09T18:24:00.000-08:00</published><updated>2009-11-09T18:29:30.362-08:00</updated><title type='text'>Lori's talk</title><content type='html'>&lt;span style="font-size:100%;"&gt;Speaker: Lori Levin&lt;br /&gt;Where: GHC 6501&lt;br /&gt;When: Nov 09, 2009 - Tuesday - Noon&lt;br /&gt;&lt;span&gt;&lt;/span&gt;&lt;span&gt;&lt;span style="font-family:Arial;"&gt;Title:   A Pendulum Swung Too Far&lt;br /&gt;Abstract: &lt;/span&gt;&lt;/span&gt;&lt;span&gt;&lt;span style="font-family:Arial;"&gt; This paper by Ken Church  &lt;/span&gt;&lt;/span&gt;&lt;span&gt;&lt;span style="font-family:Arial;"&gt;deals with the never ending battle between Empiricism and Rationalism,&lt;br /&gt;esp. its incarnation in NLP.&lt;/span&gt;&lt;/span&gt;&lt;span&gt;&lt;span style="font-family:Arial;"&gt; Lori will summarize and present the arguments formulated in the&lt;br /&gt;paper.  She will then continue with her own views on why linguistics&lt;br /&gt;needs to be brought back into NLP and MT in particular.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-464866176337538128?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/464866176337538128/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=464866176337538128' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/464866176337538128'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/464866176337538128'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2009/11/loris-talk.html' title='Lori&apos;s talk'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-1107120892679605509</id><published>2009-08-10T12:52:00.000-07:00</published><updated>2009-08-10T12:53:32.876-07:00</updated><title type='text'>Two talks</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Talk 1:&lt;/span&gt;&lt;br /&gt;Nguyen Bach:  Source-side Dependency Tree Reordering Models with Subtree Movements and Constraints&lt;br /&gt;&lt;br /&gt;Abstract:  We propose a novel source-side dependency tree reordering model for statistical machine translation, in which subtree movements and constraints are represented as reordering events associated with the widely used lexicalized reordering models. This model allows us to not only efficiently capture the statistical distribution of the subtree-to-subtree transitions in training data, but also utilize it directly at the decoding time to guide the search process. Using subtree movements and constraints as features in a log-linear model, we are able to help the reordering models make better selections.  It also allows the subtle importance of monolingual syntactic movements to be learned alongside other reordering features. We show  improvements in translation quality in English-Spanish and English-Iraqi translation tasks.&lt;br /&gt;&lt;br /&gt;This is joint work with Qin Gao and Stephan Vogel.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Talk 2:&lt;/span&gt;&lt;br /&gt;Francisco (Paco) Guzman:  Reassessment of the Role of Phrase Extraction in SMT&lt;br /&gt;&lt;br /&gt;Abstract:  In this paper we study in detail the relation between word alignment and phrase extraction.  First, we analyze different word alignments according to several characteristics and compare them to hand-aligned data. Then, we analyze the phrase-pairs generated by these alignments.  We observed that the number of unaligned words has a large impact on the characteristics of the phrase table.  A manual evaluation of phrase pair quality showed that the increase in the number of unaligned words results in a lower quality.  Finally, we present translation results from using the number of unaligned words as features from which we obtain up to 2BP of improvement.&lt;br /&gt;&lt;br /&gt;This is joint work with Qin Gao and Stephan Vogel.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-1107120892679605509?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/1107120892679605509/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=1107120892679605509' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1107120892679605509'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1107120892679605509'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2009/08/two-talks.html' title='Two talks'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-635729775899092349</id><published>2009-06-15T08:55:00.000-07:00</published><updated>2009-06-15T08:56:08.264-07:00</updated><title type='text'>Making Disfluent Output Slightly Less So:  MT System Combination Search Spaces and Optimization</title><content type='html'>Speaker: Kenneth Heafield &lt;br /&gt;&lt;br /&gt;Title:   Making Disfluent Output Slightly Less So:&lt;br /&gt;        MT System Combination Search Spaces and Optimization&lt;br /&gt;&lt;br /&gt;Abstract:  System combination merges several machine translation outputs&lt;br /&gt;into a single improved sentence.  This talk starts by summarizing the&lt;br /&gt;approach including, a search space derived from the alignments, and&lt;br /&gt;hypothesis scoring.  The current search space focuses on picking words&lt;br /&gt;in a roughly word synchronous way.  Another search space under development&lt;br /&gt;builds a directed graph in which aligned words correspond to a vertex and&lt;br /&gt;each bigram corresponds to a directed edge.  Search is conducted much like&lt;br /&gt;a left-to-right MT decoder.  Speed optimizations, which allow decoding at&lt;br /&gt;5.5 sentences per second, apply to other MT systems in the areas of&lt;br /&gt;duplicate handling, language model state, and multithreading.  This speed&lt;br /&gt;allows me to find hyperparameters by searching hundreds of parameter&lt;br /&gt;combinations, each with a full round of tuning.  In preparation for&lt;br /&gt;last Friday's NIST submission, system combination improved 2.4 BLEU&lt;br /&gt;points over the best component system for Urdu to English translation.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-635729775899092349?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/635729775899092349/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=635729775899092349' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/635729775899092349'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/635729775899092349'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2009/06/making-disfluent-output-slightly-less.html' title='Making Disfluent Output Slightly Less So:  MT System Combination Search Spaces and Optimization'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-1395037045746888847</id><published>2009-04-28T16:38:00.000-07:00</published><updated>2009-05-01T06:28:22.054-07:00</updated><title type='text'>EBMT with external word alignment and chunk alignment</title><content type='html'>Title:  EBMT with external word alignment and chunk alignment.&lt;br /&gt;&lt;br /&gt;Who: Jae Dong Kim&lt;br /&gt;When: Tuesday May 12, 12:00pm&lt;br /&gt;Where: NSH 3305&lt;br /&gt;&lt;br /&gt;Abstract:  Since both EBMT and SMT are data driven methods, more accurate word alignment improves system performance in EBMT as in SMT. However, EBMT has focused on finding analogous examples while SMT has achieved plausibly accurate word alignment. For this reason, it is natural that one thinks that EBMT can benefit from using SMT word alignment. In this talk, I am going to talk about our approach to make use of more accurate external word alignment from SMT in our EBMT system. I am also going to talk about my preliminary results with chunk alignment for translation in EBMT.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-1395037045746888847?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/1395037045746888847/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=1395037045746888847' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1395037045746888847'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1395037045746888847'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2009/04/ebmt-with-external-word-alignment-and.html' title='EBMT with external word alignment and chunk alignment'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-9214956001134013657</id><published>2009-04-13T08:12:00.000-07:00</published><updated>2009-04-13T08:13:31.005-07:00</updated><title type='text'>Language Model Adaptation for Difficult to Translate Phrases</title><content type='html'>Presenter: Behrang Mohit&lt;br /&gt;Title: Language Model Adaptation for Difficult to Translate Phrases&lt;br /&gt;Date: Tuesday 12:30pm,  14 April 2009&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;We investigate the idea of adapting language models for phrases that&lt;br /&gt;have poor translation quality. We apply a selective adaptation&lt;br /&gt;criterion which uses a classifier to locate the most difficult phrase&lt;br /&gt;of each source language sentence. A special adapted language model is&lt;br /&gt;constructed for the highlighted phrase. Our adaptation heuristic uses&lt;br /&gt;lexical features of the phrase to locate the relevant parts of the&lt;br /&gt;parallel corpus for language model training. As we vary the&lt;br /&gt;experimental setup by changing the size of the SMT training data, our&lt;br /&gt;adaptation method consistently shows strong improvements over the&lt;br /&gt;baseline systems.&lt;br /&gt;This is a joint work with Frank Liberato and Rebecca Hwa.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-9214956001134013657?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/9214956001134013657/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=9214956001134013657' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/9214956001134013657'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/9214956001134013657'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2009/04/language-model-adaptation-for-difficult.html' title='Language Model Adaptation for Difficult to Translate Phrases'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-6176224139972502750</id><published>2009-03-12T09:06:00.001-07:00</published><updated>2009-03-12T09:08:26.273-07:00</updated><title type='text'>Moving Beyond Phrase-Pairs: Dynamically Scoring Collections of Translation Examples</title><content type='html'>&lt;pre style="font-family: arial;"&gt;Moving Beyond Phrase-Pairs: Dynamically Scoring Collections of&lt;br /&gt;Translation Examples&lt;br /&gt;&lt;br /&gt;Who: Aaron B. Phillips&lt;br /&gt;When: Friday Mar 13, 12:00pm&lt;br /&gt;Where: NSH 1507&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;&lt;br /&gt;Statistical Machine Translation has prospered because it is based on&lt;br /&gt;models that are consistent and straightforward to optimize. The&lt;br /&gt;log-linear model in particular allows the researcher to exploit&lt;br /&gt;numerous, possibly dependent, features. However, the modeling approach&lt;br /&gt;taken by SMT enforces a particular top-down view of the data using&lt;br /&gt;phrase-pairs that does not easily allow for the integration of features&lt;br /&gt;that may change from example to example. What I propose is a shift in&lt;br /&gt;how the model is built. Inspired by Example-Based Machine Translation, I&lt;br /&gt;calculate features for each example separately, but like SMT this&lt;br /&gt;information is collected into a single log-linear model that is&lt;br /&gt;straightforward to optimize. This is accomplished by identifying at&lt;br /&gt;run-time the most appropriate collection of translation examples instead&lt;br /&gt;of using precomputed phrase-pairs. A search is performed over each&lt;br /&gt;example-specific feature such as the alignment quality, genre, or&lt;br /&gt;context to determine a collection that maximizes the score. The weights&lt;br /&gt;for each example-specific feature are adjustable during optimization and&lt;br /&gt;allow for a trade-off between forming collections over all the examples&lt;br /&gt;and forming collections that consist of a few high-quality examples.&lt;br /&gt;This framework seeks to unify the approaches of EBMT and SMT. It results&lt;br /&gt;in a model that is straightforward to optimize &lt;b&gt;&lt;span&gt;*&lt;/span&gt;and&lt;span&gt;*&lt;/span&gt;&lt;/b&gt; allows the&lt;br /&gt;integration of novel example-specific features.&lt;br /&gt;&lt;/pre&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-6176224139972502750?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/6176224139972502750/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=6176224139972502750' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6176224139972502750'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6176224139972502750'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2009/03/moving-beyond-phrase-pairs-dynamically.html' title='Moving Beyond Phrase-Pairs: Dynamically Scoring Collections of Translation Examples'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-7455386167768639968</id><published>2009-02-03T10:27:00.000-08:00</published><updated>2009-02-03T10:28:11.620-08:00</updated><title type='text'>An Overview of Tree-to-String Translation Models: Yang Liu</title><content type='html'>Speaker: Yang Liu&lt;br /&gt;Title: &lt;span lang="EN-US"&gt;An Overview of Tree-to-String Translation Models&lt;/span&gt; &lt;p style="margin: 0cm 0cm 0pt;"&gt;&lt;span lang="EN-US"&gt; &lt;/span&gt;&lt;/p&gt; &lt;p style="margin: 0cm 0cm 0pt;"&gt;&lt;span lang="EN-US"&gt;Abstract: &lt;/span&gt;&lt;/p&gt; &lt;p style="margin: 0cm 0cm 0pt;"&gt;&lt;span lang="EN-US"&gt;Recent research on statistical machine translation has lead to the rapid development of syntax-based translation models, in which syntactic information can be exploited to direct translation. In this talk, I will give an overview of tree-to-string translation models, one of the state-of-the-art syntax-based models. In a tree-to-string model, the source side is a phrase structure parse tree and the target side is a string. This talk includes the following topics: (1) naive tree-to-string model, (2) tree-sequence based tree-to-string model, (3) context-aware tree-to-string model, and (4) forest-based tree-to-string model. Experimental results show that forest-based tree-to-string model outperforms hierarchical phrase-based model significantly.&lt;/span&gt;&lt;/p&gt; &lt;p style="margin: 0cm 0cm 0pt;"&gt;&lt;span lang="EN-US"&gt; &lt;/span&gt;&lt;/p&gt; &lt;p style="margin: 0cm 0cm 0pt;"&gt;&lt;span lang="EN-US"&gt;Short Bio: &lt;/span&gt;&lt;/p&gt; &lt;p style="margin: 0cm 0cm 0pt;"&gt;&lt;span lang="EN-US"&gt;Yang Liu is an Assistant Researcher at Institute of Computing Technology, Chinese Academy of Sciences. He graduated in Computer Science from Wuhan University in 2002. He received his PhD degree in Computer Science from Institute of Computing Technology, Chinese Academy of Sciences. His major research interests include statistical machine translation and Chinese information processing. His publications on discriminative word alignment and tree-to-string models have received wide attention. He served as PC member/Reviewer for TALIP, ACL, EMNLP, AMTA, and SSST.&lt;/span&gt;&lt;/p&gt; &lt;p style="margin: 0cm 0cm 0pt;"&gt;&lt;span lang="EN-US"&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-7455386167768639968?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/7455386167768639968/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=7455386167768639968' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7455386167768639968'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7455386167768639968'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2009/02/overview-of-tree-to-string-translation.html' title='An Overview of Tree-to-String Translation Models: Yang Liu'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-9114473705124385163</id><published>2009-01-13T08:21:00.000-08:00</published><updated>2009-01-13T08:23:04.033-08:00</updated><title type='text'>Parallel Treebanks in Machine Translation</title><content type='html'>Title:  Parallel Treebanks in Machine Translation&lt;br /&gt;Speaker: John Tinsley, Ph.D. student at the National Centre for Language Technology in DCU&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-9114473705124385163?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/9114473705124385163/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=9114473705124385163' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/9114473705124385163'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/9114473705124385163'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2009/01/parallel-treebanks-in-machine.html' title='Parallel Treebanks in Machine Translation'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4966752806794186721</id><published>2008-12-08T13:36:00.000-08:00</published><updated>2008-12-08T13:37:22.528-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='M45'/><category scheme='http://www.blogger.com/atom/ns#' term='system'/><title type='text'>Fast MT Pipeline: Introduction to tools you can use</title><content type='html'>Date: 09-Dec-2008&lt;br /&gt;&lt;br /&gt;Qin and Alok will report on their work to speed up some&lt;br /&gt;of the MT processing by using parallel processing, with&lt;br /&gt;an emphasis on the tools they have developed, to do this&lt;br /&gt;kind of work.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Title: Fast MT Pipeline: Introduction to tools you can use.&lt;br /&gt;&lt;br /&gt;Abstract: In this talk, we would like to introduce you to some&lt;br /&gt;recently developed tools available for you to use, in order to&lt;br /&gt;speed up the MT pipeline. The tools of focus are:&lt;br /&gt;(i) multi-threaded giza: faster word alignment.&lt;br /&gt;(ii) chaksi: Phrase-Extraction on the M45 cluster.&lt;br /&gt;(iii) trambo: Decoding/MERT on the M45 cluster.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4966752806794186721?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4966752806794186721/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4966752806794186721' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4966752806794186721'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4966752806794186721'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2008/12/fast-mt-pipeline-introduction-to-tools.html' title='Fast MT Pipeline: Introduction to tools you can use'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-3325745479404514674</id><published>2008-11-16T00:13:00.000-08:00</published><updated>2008-11-16T00:15:35.095-08:00</updated><title type='text'>Presentations</title><content type='html'>Date: 11 Nov 2008&lt;br /&gt;Time: 12-1:30&lt;br /&gt;Room: NSH 3305&lt;br /&gt;&lt;br /&gt;Presentations:&lt;br /&gt;&lt;br /&gt;Andreas Zollmann: Wider Pipelines: N-Best Alignments and Parses in MT Training&lt;br /&gt;&lt;br /&gt;State-of-the-art statistical machine translation systems use hypotheses from several maximum a posteriori inference steps, including word alignments and parse trees, to identify translational structure and estimate the parameters of translation models. While this approach leads to a modular pipeline of independently developed components, errors made in these “single-best” hypotheses can propagate to downstream estimation steps that treat these inputs as clean, trustworthy training data. In this work we integrate N-best alignments and parses by using a probability distribution over these alternatives to generate posterior fractional counts for use in downstream estimation. Using these fractional counts in a DOPinspired syntax-based translation system, we show significant improvements in translation quality over a single-best trained baseline.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Silja Hildebrand: Combination of Machine Translation Systems via Hypothesis Selection from Combined N-Best Lists&lt;br /&gt;&lt;br /&gt;Different approaches in machine translation achieve similar translation quality with a variety of translations in the output. Recently it has been shown, that it is possible to leverage the individual strengths of various systems and improve the overall translation quality by combining translation outputs. In this paper we present a method of hypothesis selection which is relatively simple compared to system combination methods which construct a synthesis of the input hypotheses. Our method uses information from n-best lists from several MT systems and features on the sentence level which are independent from the MT systems involved to improve the translation quality.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-3325745479404514674?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/3325745479404514674/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=3325745479404514674' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/3325745479404514674'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/3325745479404514674'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2008/11/presentations.html' title='Presentations'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4254301704114025051</id><published>2008-09-08T13:20:00.001-07:00</published><updated>2008-09-08T14:46:47.405-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='LM'/><title type='text'>Bilingual-LSA based adaptation for statistical machine translation</title><content type='html'>&lt;span class="Apple-style-span" style="border-collapse: collapse;font-family:arial;font-size:100%;"  &gt;Date: 9 Sept 2008&lt;br /&gt;Speaker: Wilson Tam&lt;br /&gt;TITLE: Bilingual-LSA based adaptation for statistical machine translation&lt;br /&gt;&lt;br /&gt;ABSTRACT:&lt;br /&gt;We propose a novel approach to crosslingual language model (LM) and translation lexicon adaptation for statistical machine translation based on bilingual Latent Semantic Analysis (bLSA). bLSA enables latent topic distributions to be efficiently transferred across languages by enforcing a one-to-one topic correspondence during training. Using the proposed bLSA framework, model adaptation can be performed by, first, inferring the topic posterior distribution of the source text and then applying the inferred distribution to an N-gram LM of the target language and translation lexicon via marginal adaptation. The background phrase table is then enhanced with the additional phrase scores computed using the adapted translation lexicon.&lt;br /&gt;&lt;br /&gt;The proposed framework also features rapid bootstrapping of LSA models for new languages based on a source LSA model of another language. Our approach was evaluated on the Chinese-to-English MT06 test set. Improvement in BLEU was observed when the adapted LM and the adapted translation lexicon were applied individually. When the adapted LM and the adapted lexicon were applied simultaneously, the gain in BLEU was additive yielding 28.91% in BLEU which is statistically significant at the 95% confidence interval with respect to the unadapted baseline with 28.06% in BLEU.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4254301704114025051?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4254301704114025051/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4254301704114025051' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4254301704114025051'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4254301704114025051'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2008/09/bilingual-lsa-based-adaptation-for.html' title='Bilingual-LSA based adaptation for statistical machine translation'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-6463923631239541157</id><published>2008-08-12T10:42:00.000-07:00</published><updated>2008-08-22T14:08:49.727-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='speech'/><title type='text'>Learning from Human Interpreter Speech</title><content type='html'>Speaker:  Matthias Paulik&lt;br /&gt;&lt;br /&gt;Title:  "Learning from Human Interpreter Speech"&lt;br /&gt;Date: 12 August 2008&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;Can spoken language translation (SLT) profit from human interpreter speech? In this talk, we explore scenarios which involve live human interpretation, off-line transcription and off-line translation on a massive scale. We consider the deployment of machine translation (&lt;span class="nfakPe"&gt;MT&lt;/span&gt;) and automatic speech recognition (ASR) for the off-line transcription and translation tasks; our systems are trained on 80+ hours of audio data and on parallel text corpora of ~40 million words. To improve performance, we use the available human interpreter speech as an auxiliary information source to bias ASR and &lt;span class="nfakPe"&gt;MT&lt;/span&gt; language models. We evaluate this approach on European Parliament Plenary Session (EPPS) data in three languages (English, Spanish and German), and report preliminary improvements in translation and transcription performance.&lt;br /&gt;&lt;span style="color: rgb(136, 136, 136);"&gt; &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-6463923631239541157?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/6463923631239541157/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=6463923631239541157' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6463923631239541157'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6463923631239541157'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2008/08/learning-from-human-interpreter-speech.html' title='Learning from Human Interpreter Speech'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-2785256316284338104</id><published>2008-07-15T10:40:00.000-07:00</published><updated>2008-08-22T14:09:57.110-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='TM'/><category scheme='http://www.blogger.com/atom/ns#' term='syntax'/><title type='text'>Improving Lexical Coverage of Syntax-driven MT by Re-structuring Non-isomorphic Trees</title><content type='html'>Speaker: Vamshi Ambati&lt;br /&gt;Date: 15 July 2008&lt;br /&gt;&lt;br /&gt;Abtract:&lt;br /&gt;Syntax-based approaches to statistical &lt;span class="nfakPe"&gt;MT&lt;/span&gt; require syntax-aware methods for acquiring their underlying translation models from parallel data.  This acquisition process can be driven by syntactic trees for either the source or target language, or by trees on both sides.  Work to date has demonstrated that using trees for both sides suffers from severe coverage problems. Approaches that project from trees on one side, on the other hand, have higher levels of recall, but suffer from lower precision, due to the lack of syntactically-aware word alignments.&lt;br /&gt;&lt;br /&gt;In this talk I first discuss extraction and the lexical coverage of the translation models learned in both of these scenarios. We will specifically look at how the non-isomorphic nature of the parse trees for the two languages effects recall and coverage.  I will then discuss a novel technique for restructuring target parse trees, that generates highly isomorphic target trees that preserve the syntactic boundaries of constituents that were aligned in the original parse trees. I will conclude by discussing some experimental evaluation with an English-French &lt;span class="nfakPe"&gt;MT&lt;/span&gt; System.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-2785256316284338104?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/2785256316284338104/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=2785256316284338104' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2785256316284338104'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2785256316284338104'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2008/07/improving-lexical-coverage-of-syntax.html' title='Improving Lexical Coverage of Syntax-driven MT by Re-structuring Non-isomorphic Trees'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4443753666812896090</id><published>2008-06-10T10:59:00.000-07:00</published><updated>2008-08-22T11:00:50.980-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='discriminative'/><title type='text'>Scalable Decoding for Syntax based SMT</title><content type='html'>Title: Scalable Decoding for Syntax based SMT&lt;br /&gt;&lt;br /&gt;Speaker: Abhaya Agarwal&lt;br /&gt;Joint work with Vamshi Ambati&lt;br /&gt;&lt;br /&gt;Date: June 10, 2008&lt;br /&gt;Time: 12am - 1:30pm&lt;br /&gt;Location: NSH 3305&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;Most large scale SMT systems these days use a hybrid model, combining a handful of generative models in a discriminative log-linear framework whose weights are trained on a small amount of development data. Attempts to train fully discriminative &lt;span class="nfakPe"&gt;MT&lt;/span&gt; models that use large number of features, face difficult scalability challenges because it requires repeated decoding of a large training set. In this talk, I will describe a decoder that aims to be efficient enough to allow such training and some initial experiments on training with large number of features.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4443753666812896090?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4443753666812896090/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4443753666812896090' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4443753666812896090'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4443753666812896090'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2008/08/scalable-decoding-for-syntax-based-smt.html' title='Scalable Decoding for Syntax based SMT'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-6312637190608557722</id><published>2008-05-13T14:36:00.000-07:00</published><updated>2008-08-22T14:38:10.011-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='system'/><category scheme='http://www.blogger.com/atom/ns#' term='xfer'/><title type='text'>Statistical Transfer MT Systems for French and German</title><content type='html'>Speaker: Greg Hanneman&lt;br /&gt;&lt;br /&gt;Date: Tuesday, May 13, at noon&lt;br /&gt;Location: Wean Hall 4623&lt;br /&gt;&lt;br /&gt;Title: Statistical Transfer MT Systems for French and German&lt;br /&gt;&lt;br /&gt;Abstract: The AVENUE research group's statistical transfer system is a general framework for creating hybrid machine translation systems.  It uses two main resources: a weighted synchronous context-free grammar, and a probabilistic bilingual lexicon of syntax-based word- and phrase-level translations.  Over the last six months, we have developed new methods for extracting these resources automatically from parsed and word-aligned parallel corpora.  In this talk, I will describe the resource-extraction process as it was applied to new French--English and German--English systems for the upcoming ACL workshop on statistical machine translation.  Preliminary evaluation results --- both automatic and human-assessed --- will also be reviewed.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-6312637190608557722?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/6312637190608557722/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=6312637190608557722' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6312637190608557722'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6312637190608557722'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2008/05/statistical-transfer-mt-systems-for.html' title='Statistical Transfer MT Systems for French and German'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-121574644200631740</id><published>2008-04-22T10:56:00.000-07:00</published><updated>2008-09-18T13:01:11.802-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='LM'/><title type='text'>Simulating Sentence Pairs Sampling Process via Source and Target Language Models</title><content type='html'>Speaker: Ngyuen Bach&lt;br /&gt;&lt;div&gt;&lt;br /&gt;Abstract: &lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;  In a traditional word alignment process, each sentence pair is equally assigned an occurrence number, which is normalized during the training to produce the empirical probability. However, some sentences could be more valuable, reliable and appropriate than others. These sentences should therefore have a higher weight in the training.  To solve this problem, we explored methods of resampling sentence pairs. We investigated three sets of features: sentence pair confidence (/sc/), genre-dependent sentence pair confidence (/gdsc/) and sentence-dependent phrase alignment confidence (/sdpc/) scores. These features were calculated over an entire training corpus and could easily be integrated into the phrase-based machine translation system.  &lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-121574644200631740?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/121574644200631740/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=121574644200631740' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/121574644200631740'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/121574644200631740'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2008/04/simulating-sentence-pairs-sampling.html' title='Simulating Sentence Pairs Sampling Process via Source and Target Language Models'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-6446152338001417579</id><published>2008-03-19T11:04:00.000-07:00</published><updated>2008-08-22T14:55:11.495-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sttk'/><title type='text'>Communicating Unknown Words in Machine Translation</title><content type='html'>&lt;span style=";font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;Speaker: &lt;/span&gt;&lt;span style=";font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;a name="schedule"&gt;Matthias Eck&lt;br /&gt;&lt;br /&gt;Title: &lt;/a&gt;&lt;/span&gt;&lt;a style="font-family: arial;" name="schedule"&gt;&lt;/a&gt;&lt;span style=";font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;a name="schedule"&gt;Communicating Unknown Words in Machine Translation&lt;br /&gt;&lt;br /&gt;&lt;/a&gt;&lt;/span&gt;&lt;a style="font-family: arial;" name="schedule"&gt;Abstract:&lt;br /&gt;Unknown words are a major problem for every machine translation system. Regular evaluations and demos do not always show this very well, but in actual communication the lack of specialty vocabulary and named entity translations can seriously affect the communication ability.&lt;br /&gt;&lt;br /&gt;A new approach is presented that uses monolingual encyclopedias and dictionaries to "communicate" unknown words. Instead of the actual unknown word, its definition is extracted and translated, which leads to considerable improvements in translation quality.&lt;br /&gt;&lt;/a&gt;&lt;span style=";font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;a name="schedule"&gt;&lt;/a&gt;&lt;/span&gt;&lt;span style=";font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;a name="schedule"&gt;&lt;/a&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-6446152338001417579?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/6446152338001417579/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=6446152338001417579' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6446152338001417579'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6446152338001417579'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2008/03/communicating-unknown-words-in-machine.html' title='Communicating Unknown Words in Machine Translation'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-2562270368431970652</id><published>2007-11-13T11:07:00.000-08:00</published><updated>2008-08-22T14:43:30.294-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='syntax'/><title type='text'>Trees that can help</title><content type='html'>Speaker: Alok Parlikar&lt;br /&gt;&lt;br /&gt;Title: (S (NP (NP Trees) (SBAR (WHNP that) (S (VP can)))) (VP help))&lt;br /&gt;&lt;br /&gt;Summary:&lt;br /&gt;&lt;br /&gt;For the past two months, I have been working with Alon Lavie and Stephan&lt;br /&gt;Vogel, on Chinese and English parse-trees, to investigate answers to the&lt;br /&gt;following questions:&lt;br /&gt;&lt;br /&gt;(a) Can constituency information and word level alignments be used to&lt;br /&gt;   align nodes in trees of parallel sentences? How precisely matched&lt;br /&gt;   (in meaning) are the yields of these aligned nodes?&lt;br /&gt;(b) Can the parse trees and word-level alignments be used for learning&lt;br /&gt;   reordering rules? If we use these rules to reorder source sentences,&lt;br /&gt;   can we do any better at translation?&lt;br /&gt;&lt;br /&gt;The current results show that:&lt;br /&gt;&lt;br /&gt;(a) - Node Alignments from hand-aligned data are very precise.&lt;br /&gt;   - Using automatic word-alignments to align nodes gives over 70%&lt;br /&gt;     precision and over 40% recall.&lt;br /&gt;(b) Using a 10-best reordering of words in the source sentences, with&lt;br /&gt;   a "dumb" reordering strategy has shown a 0.005 improvement in BLEU&lt;br /&gt;   score.&lt;br /&gt;&lt;br /&gt;I would like to talk about the approaches that we have taken here, and to&lt;br /&gt;discuss about strategies for improving these results.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-2562270368431970652?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/2562270368431970652/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=2562270368431970652' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2562270368431970652'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2562270368431970652'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2007/09/trees-that-can-help.html' title='Trees that can help'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-809611244356321304</id><published>2007-10-09T11:09:00.000-07:00</published><updated>2008-08-22T13:41:00.614-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ebmt'/><title type='text'>Sub-Phrasal Matching and Structural Templates in Example-Based MT</title><content type='html'>&lt;span style="font-family: arial;font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;a name="schedule"&gt;Speaker: Aaron Phillips&lt;br /&gt;&lt;/a&gt;&lt;/span&gt;&lt;a style="font-family: arial;" name="schedule"&gt;&lt;br /&gt;Title: Sub-Phrasal Matching and Structural Templates in Example-Based &lt;span class="nfakPe"&gt;MT&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Example-Based Machine Translation (EBMT) encompasses many different&lt;br /&gt;approaches to data-driven &lt;span class="nfakPe"&gt;MT&lt;/span&gt;. In this work I first look at two different&lt;br /&gt;paradigms of EBMT. I then combine the strengths of these two systems and&lt;br /&gt;build a new engine that combines sub-phrasal matching with structural&lt;br /&gt;templates. The end result is a melding of ideas from EBMT, SMT, and&lt;br /&gt;Xfer. This synthesis results in higher translation quality and more&lt;br /&gt;graceful degradation, yielding 1.5% to 7.5% relative improvement in BLEU&lt;br /&gt;scores.&lt;br /&gt;&lt;br /&gt;This work was recently presented at TMI. The full paper can be found&lt;br /&gt;here:&lt;br /&gt;&lt;br /&gt;&lt;/a&gt;&lt;a style="font-family: arial;" href="http://dustoftheground.net/techne/research/sub-phrasal_matching_and_structural_templates_in_example-based_mt.pdf" target="_blank"&gt;http://dustoftheground.net/&lt;wbr&gt;techne/research/sub-phrasal_&lt;wbr&gt;matching_and_structural_&lt;wbr&gt;templates_in_example-based_mt.&lt;wbr&gt;pdf&lt;/a&gt;&lt;br /&gt; &lt;a style="font-family: arial;" name="schedule"&gt;&lt;br /&gt;&lt;/a&gt;&lt;span style="font-family: arial;font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;a name="schedule"&gt;&lt;/a&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-809611244356321304?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/809611244356321304/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=809611244356321304' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/809611244356321304'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/809611244356321304'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2007/10/sub-phrasal-matching-and-structural.html' title='Sub-Phrasal Matching and Structural Templates in Example-Based MT'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-7589267336754503430</id><published>2007-08-14T13:41:00.000-07:00</published><updated>2008-08-22T13:42:28.092-07:00</updated><title type='text'>Experiments with a Noun-Phrase driven Statistical Machine Translation System</title><content type='html'>Title: Experiments with a Noun-Phrase driven Statistical Machine&lt;br /&gt;Translation System&lt;br /&gt;&lt;br /&gt;Speaker: Sanjika Hewavitharana&lt;br /&gt;Date: Aug 14, 2007&lt;br /&gt;Time: 12:00pm&lt;br /&gt;Place: NSH 3305  [During &lt;span class="nfakPe"&gt;MT&lt;/span&gt; Group Monthly &lt;span class="nfakPe"&gt;Lunch&lt;/span&gt;]&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;&lt;br /&gt;Hierarchical translation models that use phrases with words as well as&lt;br /&gt;sub-phrases have shown better performance than standard phrase based&lt;br /&gt;systems. In this talk I will present a noun-phrase (NP) driven&lt;br /&gt;statistical  machine translation system. Using noun-phrases as the&lt;br /&gt;decomposition unit, we build a two-level hierarchy of phrases. We first&lt;br /&gt;identify noun-phrases in the data and replace them with a tag to produce&lt;br /&gt;an NP tagged corpus. This corpus is then used to extract NP-tagged&lt;br /&gt;phrase translation pairs. Both noun-phrases and NP-tagged phrases are&lt;br /&gt;used in a two-level translation decoder. The two-level system shows&lt;br /&gt;significant improvements over a baseline phrase-based SMT system. It&lt;br /&gt;also produces longer matching phrases due to the generalization&lt;br /&gt;introduced by tagging noun-phrases.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-7589267336754503430?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/7589267336754503430/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=7589267336754503430' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7589267336754503430'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7589267336754503430'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2007/08/experiments-with-noun-phrase-driven.html' title='Experiments with a Noun-Phrase driven Statistical Machine Translation System'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-1862613425526662839</id><published>2007-05-22T13:45:00.000-07:00</published><updated>2008-08-22T15:13:56.571-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mteval'/><category scheme='http://www.blogger.com/atom/ns#' term='meteor'/><title type='text'>In Search of Better MT Evaluation Metric : Some Experiments</title><content type='html'>&lt;span style=";font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;a name="schedule"&gt;Date: May 22, 2007&lt;br /&gt;Presenter: Abhaya Agarwal&lt;br /&gt;Title: In Search of Better MT Evaluation Metric : Some Experiments&lt;br /&gt;&lt;br /&gt;&lt;/a&gt;&lt;/span&gt;&lt;a name="schedule"&gt;Abstract: Area of automatic metrics for MT Evaluation has seen a lot of&lt;br /&gt;activity in last 4-5 years. Starting with BLEU, many such metrics have&lt;br /&gt;been proposed over the years including METEOR, HTER and ROUGE. While&lt;br /&gt;these metrics achieve good correlation with human judgments at the&lt;br /&gt;system level, situation remains bleak at individual sentence level. In&lt;br /&gt;this talk, I will talk about some work that we have been doing towards&lt;br /&gt;developing metrics with improved correlation with human judgments at the&lt;br /&gt;sentence level. I will present the current results and some possible&lt;br /&gt;future directions.&lt;br /&gt;&lt;/a&gt;&lt;span style=";font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;a name="schedule"&gt;&lt;/a&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-1862613425526662839?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/1862613425526662839/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=1862613425526662839' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1862613425526662839'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1862613425526662839'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2007/05/in-search-of-better-mt-evaluation.html' title='In Search of Better MT Evaluation Metric : Some Experiments'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4545905318645364085</id><published>2007-04-17T13:46:00.000-07:00</published><updated>2008-08-22T13:51:39.012-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='language'/><title type='text'>An Assessment of Language Elicitation without the Supervision of a Linguist</title><content type='html'>&lt;span style=";font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;a name="schedule"&gt;Date: Apr 17, 2007&lt;br /&gt;Presenter: Lori Levin and Alison Alvarez&lt;br /&gt;Title: An Assessment of Language Elicitation without the Supervision of a Linguist&lt;br /&gt;&lt;br /&gt;&lt;/a&gt;&lt;/span&gt;&lt;a name="schedule"&gt;We created an elicitation corpus designed to elicit the morphosyntactic&lt;br /&gt;features of a target language without the supervision of a linguist.&lt;br /&gt;The corpus is composed of approximately 3200 English source sentences&lt;br /&gt;that are then translated by a native speaker into the target language.&lt;br /&gt;The design of our corpus was driven by our need to elicit morphosyntactic&lt;br /&gt;language features without the supervision of a linguist.  In a previous&lt;br /&gt;paper we reported on a reverse Treebank and that was a deep&lt;br /&gt;morphosyntactic tree with two parallel human language sentences.  The&lt;br /&gt;first is provided by reverse annotation and the second is acquired&lt;br /&gt;through elicitation. This presentation will focus on the extent to which&lt;br /&gt;we able to acquire our morphosyntactic information from our translated&lt;br /&gt;corpora and the types of errors we encountered, both from the perspective&lt;br /&gt;of the translator and the corpus itself.&lt;br /&gt;&lt;br /&gt;&lt;/a&gt;&lt;span style=";font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;a name="schedule"&gt;&lt;/a&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4545905318645364085?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4545905318645364085/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4545905318645364085' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4545905318645364085'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4545905318645364085'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2008/08/assessment-of-language-elicitation.html' title='An Assessment of Language Elicitation without the Supervision of a Linguist'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-1347993234516840567</id><published>2007-02-20T14:03:00.000-08:00</published><updated>2008-08-22T14:05:36.692-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='TM'/><title type='text'>Translation Model Pruning</title><content type='html'>&lt;a name="schedule"&gt;Date: Feb 20, 2007&lt;br /&gt;&lt;br /&gt;Presenter: Matthias Eck&lt;br /&gt;&lt;br /&gt;Title: Translation Model Pruning&lt;/a&gt;&lt;a name="schedule"&gt;&lt;br /&gt;&lt;/a&gt;&lt;span style=";font-family:helvetica,ariel,'sans serif';font-size:100%;"  &gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-1347993234516840567?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/1347993234516840567/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=1347993234516840567' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1347993234516840567'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1347993234516840567'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2007/02/translation-model-pruning.html' title='Translation Model Pruning'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-750677169701216022</id><published>2007-01-16T14:05:00.000-08:00</published><updated>2008-08-22T14:06:41.225-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='LM'/><title type='text'>SALM: Suffix Array and its Applications in Empirical Language Processing</title><content type='html'>&lt;a name="schedule"&gt;&lt;br /&gt;Date: Jan 16, 2007&lt;br /&gt;&lt;br /&gt;Presenter: Ying(Joy) Zhang&lt;br /&gt;&lt;br /&gt;Title: SALM: Suffix Array and its Applications in Empirical Language Processing&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-750677169701216022?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/750677169701216022/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=750677169701216022' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/750677169701216022'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/750677169701216022'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2007/01/salm-suffix-array-and-its-applications.html' title='SALM: Suffix Array and its Applications in Empirical Language Processing'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-5018263036218710210</id><published>2006-11-21T15:36:00.000-08:00</published><updated>2008-08-22T15:37:24.678-07:00</updated><title type='text'>Simulating Multiple Translations and ASR Transcripts for Applications in  Multilingual Spoken Document Classification</title><content type='html'>Title: Simulating Multiple Translations and ASR Transcripts for Applications in  Multilingual Spoken Document Classification&lt;br /&gt;&lt;br /&gt;Speaker: Wei-Hao Lin from the Informedia group&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;  We propose a statistical model to simulate multiple documents and&lt;br /&gt;  their translations (e.g. Chinese documents and their English&lt;br /&gt;  translations), and apply the model in the task of classifying&lt;br /&gt;  multilingual documents.  The model, based on a frequency matching&lt;br /&gt;  principle, predicts that previous approaches to building classifiers&lt;br /&gt;  from a common language (e.g., English) are not optimal for&lt;br /&gt;  multilingual collections with unbalanced numbers of documents, and a&lt;br /&gt;  proposed multilingual representation can outperform the mono-lingual&lt;br /&gt;  bag-of-words representation.  We also investigate the possibility of&lt;br /&gt;  combining multiple ASR transcripts and translations through&lt;br /&gt;  re-weighting.  The validity of our model is strongly supported by&lt;br /&gt;  the close match between predictions of the simulation model and the&lt;br /&gt;  empirical results of classifying multilingual spoken documents from&lt;br /&gt;  broadcast news in three languages.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-5018263036218710210?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/5018263036218710210/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=5018263036218710210' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/5018263036218710210'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/5018263036218710210'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2006/11/simulating-multiple-translations-and.html' title='Simulating Multiple Translations and ASR Transcripts for Applications in  Multilingual Spoken Document Classification'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-1061032727327169699</id><published>2006-10-17T15:08:00.000-07:00</published><updated>2008-08-22T15:17:54.694-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mt'/><category scheme='http://www.blogger.com/atom/ns#' term='asr'/><category scheme='http://www.blogger.com/atom/ns#' term='speech'/><title type='text'>Coupling of ASR+MT: Initial Experiments &amp; Future Directions</title><content type='html'>Speaker:  Ian Lane&lt;br /&gt;&lt;br /&gt;Title: Tighter Coupling of ASR+MT: Initial Experiments &amp;amp; Future Directions&lt;br /&gt;&lt;br /&gt;Abstract:&lt;br /&gt;In this talk, I will first give a brief overview of my PhD work entitled "Flexible Spoken Language Understanding based on Topic Classification and Domain Detection", and describe how the proposed approaches can be applied to applications other than speech-to-speech translation. I will&lt;br /&gt;then describe my current work which focuses on improving coupling between ASR and Machine-Translation Systems, specifically, when applied to conversational speech. Finally, I will propose future directions for which I hope to receive a large amount of feedback.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-1061032727327169699?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/1061032727327169699/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=1061032727327169699' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1061032727327169699'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/1061032727327169699'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2007/10/coupling-of-asrmt-initial-experiments.html' title='Coupling of ASR+MT: Initial Experiments &amp; Future Directions'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-2261146898926419807</id><published>2006-05-23T15:18:00.000-07:00</published><updated>2008-08-22T15:19:37.174-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ebmt'/><title type='text'>Anchor-Based Symmetric Probabilistic Alignment</title><content type='html'>Date: May 23, 2006&lt;br /&gt;&lt;br /&gt;Presenter: Jae Dong Kim,&lt;br /&gt;&lt;br /&gt;Title: Anchor-Based Symmetric Probabilistic Alignment&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;&lt;br /&gt; &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-2261146898926419807?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/2261146898926419807/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=2261146898926419807' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2261146898926419807'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2261146898926419807'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2006/05/anchor-based-symmetric-probabilistic.html' title='Anchor-Based Symmetric Probabilistic Alignment'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4664186209331322030</id><published>2006-04-18T15:19:00.000-07:00</published><updated>2008-08-22T15:20:26.319-07:00</updated><title type='text'>Can the Internet help improve Machine Translation?</title><content type='html'>Date: April 18, 2006&lt;br /&gt;&lt;br /&gt;Presenter: Ari Font-Llitjos&lt;br /&gt;&lt;br /&gt;Title: Can the Internet help improve Machine Translation?&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;&lt;br /&gt;  &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4664186209331322030?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4664186209331322030/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4664186209331322030' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4664186209331322030'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4664186209331322030'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2006/04/can-internet-help-improve-machine.html' title='Can the Internet help improve Machine Translation?'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-3959358419938951765</id><published>2006-02-21T15:21:00.000-08:00</published><updated>2008-08-22T15:22:13.281-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='language'/><title type='text'>The MILE Corpus for Less Commonly Taught Languages</title><content type='html'>Date: February 21, 2006&lt;br /&gt;&lt;br /&gt;Presenter: Alison Alvarez,&lt;br /&gt;&lt;br /&gt;Title: The MILE (Minor Language Elicitation) Corpus for Less Commonly Taught Languages&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;&lt;br /&gt; &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-3959358419938951765?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/3959358419938951765/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=3959358419938951765' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/3959358419938951765'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/3959358419938951765'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2006/02/mile-corpus-for-less-commonly-taught.html' title='The MILE Corpus for Less Commonly Taught Languages'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4042908488642595716</id><published>2005-12-08T15:22:00.000-08:00</published><updated>2008-08-22T15:22:51.675-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ebmt'/><title type='text'>Generalization and Context-Sensitivity for Example-Based Machine Translation</title><content type='html'>Date: December 08, 2005&lt;br /&gt;&lt;br /&gt;Presenter: Ralf Brown&lt;br /&gt;&lt;br /&gt;Title: Generalization and Context-Sensitivity for Example-Based Machine Translation&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;&lt;br /&gt;  &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4042908488642595716?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4042908488642595716/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4042908488642595716' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4042908488642595716'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4042908488642595716'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2005/12/generalization-and-context-sensitivity.html' title='Generalization and Context-Sensitivity for Example-Based Machine Translation'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4624972800092142281</id><published>2005-11-15T15:24:00.000-08:00</published><updated>2008-08-22T15:25:06.133-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='speech'/><title type='text'>Speech Translation with Multiple Speech and Translation Hypotheses</title><content type='html'>Date: November 15, 2005&lt;br /&gt;&lt;br /&gt;Presenter: Chiori Hori&lt;br /&gt;&lt;br /&gt;Title: Speech Translation with Multiple Speech and Translation Hypotheses&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4624972800092142281?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4624972800092142281/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4624972800092142281' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4624972800092142281'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4624972800092142281'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2005/11/speech-translation-with-multiple-speech.html' title='Speech Translation with Multiple Speech and Translation Hypotheses'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-6849555114209138540</id><published>2005-10-18T15:25:00.000-07:00</published><updated>2008-08-22T15:26:23.585-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='parsing'/><title type='text'>Adapting Resources for Parsing Arabic Dialects</title><content type='html'>Date: October 18, 2005&lt;br /&gt;&lt;br /&gt;Presenters: Rebecca Hwa and Carol Nichols from the NLP Lab at Pitt&lt;br /&gt;&lt;br /&gt;Title:Adapting Resources for Parsing Arabic Dialects&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-6849555114209138540?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/6849555114209138540/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=6849555114209138540' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6849555114209138540'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6849555114209138540'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2005/10/adapting-resources-for-parsing-arabic.html' title='Adapting Resources for Parsing Arabic Dialects'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-2106471834100407241</id><published>2005-09-13T15:26:00.001-07:00</published><updated>2008-08-22T15:27:33.103-07:00</updated><title type='text'>Named Entity Extraction and Translation from Multimedia Documents</title><content type='html'>Date: September 13, 2005.&lt;br /&gt;&lt;br /&gt;Presenter: Fei Huang,&lt;br /&gt;&lt;br /&gt;Title: "Named Entity Extraction and Translation from Multimedia Documents".&lt;br /&gt;&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-2106471834100407241?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/2106471834100407241/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=2106471834100407241' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2106471834100407241'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2106471834100407241'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2005/09/named-entity-extraction-and-translation.html' title='Named Entity Extraction and Translation from Multimedia Documents'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-5064697273077956622</id><published>2005-08-23T15:28:00.000-07:00</published><updated>2008-08-22T15:29:07.516-07:00</updated><title type='text'>David Chiang's ACL-2005 paper</title><content type='html'>Date: August 23, 2005.&lt;br /&gt;&lt;br /&gt;Presenter: Joy (Ying Zhang)&lt;br /&gt;&lt;br /&gt;Title: "David Chiang's ACL-2005 paper : A Hierarchical Phrase-Based Model for Statistical Machine Translation".&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-5064697273077956622?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/5064697273077956622/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=5064697273077956622' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/5064697273077956622'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/5064697273077956622'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2005/08/david-chiangs-acl-2005-paper.html' title='David Chiang&apos;s ACL-2005 paper'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-7173779905756238017</id><published>2005-06-21T15:29:00.000-07:00</published><updated>2008-08-22T15:29:57.654-07:00</updated><title type='text'>Semi-Automated Elicitation Corpus Generation</title><content type='html'>Date: June 21, 2005.&lt;br /&gt;&lt;br /&gt;Presenter: Alison Alvarez,&lt;br /&gt;&lt;br /&gt;Title: "Semi-Automated Elicitation Corpus Generation". &lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-7173779905756238017?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/7173779905756238017/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=7173779905756238017' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7173779905756238017'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/7173779905756238017'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2005/06/semi-automated-elicitation-corpus.html' title='Semi-Automated Elicitation Corpus Generation'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-6933058286032184649</id><published>2005-05-10T15:30:00.000-07:00</published><updated>2008-08-22T15:30:48.900-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mteval'/><title type='text'>Using Collocations to Assess MT Quality</title><content type='html'>Date: May 10, 2005.&lt;br /&gt;&lt;br /&gt;Presenter: Benjamin Han,&lt;br /&gt;&lt;br /&gt;Title: Using Collocations to Assess MT Quality&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-6933058286032184649?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/6933058286032184649/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=6933058286032184649' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6933058286032184649'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/6933058286032184649'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2005/05/using-collocations-to-assess-mt-quality.html' title='Using Collocations to Assess MT Quality'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-2951993451225973675</id><published>2005-04-19T15:31:00.000-07:00</published><updated>2008-08-22T15:31:56.494-07:00</updated><title type='text'>Mining Translations for Key Phrases from Web Corpora</title><content type='html'>Date: April 19, 2005.&lt;br /&gt;&lt;br /&gt;Presenters: Joy (Ying Zhang) and Fei Huang,&lt;br /&gt;&lt;br /&gt;Title: "Mining Translations for Key Phrases from Web Corpora".&lt;br /&gt;&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt; &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-2951993451225973675?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/2951993451225973675/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=2951993451225973675' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2951993451225973675'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/2951993451225973675'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2005/04/mining-translations-for-key-phrases.html' title='Mining Translations for Key Phrases from Web Corpora'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4613241126346931705</id><published>2005-03-22T15:31:00.000-08:00</published><updated>2008-08-22T15:33:29.433-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='memt'/><title type='text'>A New Approach for Multi-Engine MT</title><content type='html'>Date: March 22, 2005.&lt;br /&gt;&lt;br /&gt;Presenter: Shyamsundar Jayaraman,&lt;br /&gt;&lt;br /&gt;Title: "A New Approach for Multi-Engine MT" (MEMT) &lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4613241126346931705?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4613241126346931705/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4613241126346931705' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4613241126346931705'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4613241126346931705'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2005/03/new-approach-for-multi-engine-mt.html' title='A New Approach for Multi-Engine MT'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4671480241746391796.post-4492265800416128796</id><published>2005-02-22T15:33:00.000-08:00</published><updated>2008-08-22T15:34:08.684-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mteval'/><title type='text'>The METEOR metric</title><content type='html'>February 22, 2005.&lt;br /&gt;&lt;br /&gt;Presenter: Alon Lavie,&lt;br /&gt;&lt;br /&gt;Title: "The METEOR metric" (MT Evaluation)&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:helvetica, ariel, 'sans serif';font-size:100%;"&gt; &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4671480241746391796-4492265800416128796?l=cmu-mt.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://cmu-mt.blogspot.com/feeds/4492265800416128796/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4671480241746391796&amp;postID=4492265800416128796' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4492265800416128796'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4671480241746391796/posts/default/4492265800416128796'/><link rel='alternate' type='text/html' href='http://cmu-mt.blogspot.com/2005/02/meteor-metric.html' title='The METEOR metric'/><author><name>Vamshi Ambati</name><uri>http://www.blogger.com/profile/13989477522359132038</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
