Yuhuan Jiang edited Word Based Models.md  over 10 years ago

Commit id: 1996c5f9f0736e96e8ebaf45de83033f34871a97

deletions | additions      

       

It is very intuitive to think of translation as a word-to-word process. Think of a Chinese person who does not know any English traveling in New York (or you can think about Tom Hanks in the movie _The Terminal_). When he looks at the sentence: .  The idea behind Word-Based Models lies in the common situation described above.  However, the reality is always a lot crueler. If you take this verbatim approach every time you come across a foreign sentence, you'll easily end up with translations that are completely not understandable. There are several reasons for this: 1. **Word Order Issues**: The word order in one language may be different from that in another. For example, in Japanese the predicate is always at the end, while English has predicates in the middle of sentences between the subject and the object. If translates Japanese word by word with a Japanese-English dictionary, you'll probably hear sentences like these: "_I Today apple eat._", "_I the beach to want to go._", etc.   2. **Multiple Translation Possibilities**: It is common for a foreign word to have many possible translations. The Chinese word "智" could both mean _intelligence_ or the country _Chile_. Choosing the correct translation that fits the context and reflects the meaning of the foreign sentence is no easy task for Tom Hanks in that movie.   3. **Missing Words**: This is commonly seen when you translate from English to Chinese. One kind of the determiners, definite articles (the word _the_) is usually ignored, because there is no equivalence to such words in Chinese. Examples are ubiquitous. ## The Model