Hybridization of MT (AMTA 2006)
August 29, 2006
Nizar Habash (Columbia University)’s contribution to the AMTA Hybird MT Panel.
The Intuition: StatMT and RuleMT have complementary advantages:
Syntactic structure produces better global target linguistic structure,
Statistical phrase-based translation is more robust locally.
The Resource Challenge
Parallel corpora as models of performance vs. Dictionaries/analyzers as models of competence
“More is better” is true for both approaches
Parallel corpora are domain/genre specific
Dictionaries and parsers can be domain/genre specific
Hybrids may need more data: Annotated resources.