WORLD TRANSLATION CENTER IST EIN FÜHRENDER ANBIETER VON SPRACHÜBERSETZUNGEN UND SPRACHAUFNAHMEN DURCH PROFIS ÜBERALL AUF DER WELT.

Articles and Stories

Polish Autotranslation

Blog post written by our Polish team leader, Albin W, PhD., Poland

Both: Machine Translation and the ‘Twin Peaks’ TV series

What a strange title, you may think. Yes, it is a bit weird, but we will dive into the complicated Polish grammar here and you will see the connection.

In Polish you will find plurality of words for ‘both’. To choose a proper one you need to know what genders both subjects have. The form of ‘both’ depends also on the grammatical case used. I have counted eight basic forms of ‘both’ in Polish (obaj, obie, oboje, obu, obiema, oboma, obojgu, obojgiem) and even more longer synonyms (obydwie, obydwu, and so on).

Yet another difficulty is that in Polish the words ‘obaj, obie’ are used as adjectives or pronouns but never as a conjunction. Then, if you had to translate this title into Polish, you would need to reword it, for example, into ‘Two things:…’.

It can be one of many illustrations why an accurate machine translation could be considered only a dream in the near future. Machine translation may be an effective tool for acceleration translator’s work especially for schematic texts. I have personally written a macro in Microsoft Word, which gives me suggestions for English phrases containing from 1 to 5 words (the higher number of words having higher priority). It makes sense, because often few words constitute a meaningful phrase, while single words are meaningless. This macro used by me while translating patent documents (with many words or short phrases repeating) allows me to accelerate work as well as preserve consistency (the last being especially important in patents). My dictionary gets larger with every new document. So, in my opinion there are great possibilities to improve machine translation, but still I am very skeptical about ever seeing a high quality output even using my macro or any similar approach.

If you remember the ‘Twin Peaks’ series there is a shot (Episode 1, the first episode after pilot) with Bobby Briggs having dinner with his parents. At the end of his monologue major Briggs says: ‘To have his path made clear is the aspiration of every human being in our beclouded and tempestuous existence. Robert, you and I are going to work to make yours real clear.’ In the Polish version of the script the last sentence reads: ‘Obaj dołożymy starań, by twoje ścieżki były naprawdę proste’ which translates to: ‘We will both make efforts for your paths to be really clear.’ In this situation using ‘both’ in English could lead to ambiguity (maybe major Briggs will make efforts together with his wife?) while in Polish it is unambiguous (because ‘obaj’ is used only when two subjects have masculine gender).

Translation in this form is more attractive for a movie than for literal translation, because it is shorter and less complicated. But it is improbable to be a result of a machine translation, as it requires the knowledge of the gender of both persons involved and while this knowledge may be readily available for a human it may be not so easily accessible for a machine. If the computer does not understand all the context related to culture, human interactions, laws of physics and so on, it probably cannot produce superior results. And such an understanding requires the computer to think like a human, which is an idea put forward many years ago with actually no essential progress until now, despite the fact that we have lots of so-called artificial intelligence systems, but we are still a long way away from creating a perfect machine translation, and we might never get there when a translation like Polish has such complicated grammatical structure.

 

|
Standorte
Sprachen
Aguacateco | Amuzgo | Aymara | Chalchiteko | Chatino | Ch’orti’ | Chol | Huichol | Jakalteco | Kanjobal | Kaqchikel | Mopan | Nahuatl | Otomí | Poqomam | Poqomchi | Purépecha | Sakapulteco | Sipakapense | Tarasco | Tektiteko | Uspanteko | Acadian French | Acateco | Accented English | Accented French | Achi | Acholi | Afar | African French | Afrikaans | Akan | Albanian | Alur | Alutiiq | Amharic | Amish German | Angolan French | Angolan Portuguese | Algerian Arabic | Arabic Bahrain | Arabic | Egyptian Arabic | Jordanian Arabic | Arabic Kuwait | Arabic Lebanon | Arabic Modern | Moroccan Arabic | Arabic Oman | Palestinian Arabic | Arabic Qatar | Saudi Arabian Arabic | Syrian Arabic | Tunisian Arabic | Arabic (UAE) | Arabic Yemen | Armenian | Assamese | Awakatek | Azeri... | Bambara | Basque | Belarusian | Bemba | Bengali | Berber | Bikol | Bislama | Bodo | Bosnian | Bulgarian | Burmese | Burundi | Cajun French | Cambodian | Cantonese (Guangdong) | Cape Verdian | Carolinian | Catalan | Cebuano | Chamorro | Chechua | Cherokee | Chin | Cantonese (China) | Mandarin | Traditional Mandarin | Chinese (Singapore) | Chinese (Taiwan) | Chuj | Chuukese | Cree | Croatian | Czech | Dagbani | Danish | Dari | Darija | Dinka | Dutch | Dzongkha | English | African English | Australian English | British English | Canadian English | Indian English | Irish English | New Zealand English | Scottish English | South African English | American English | Estonian | Ewe | Fante | Faroese | Farsi | Fijian | Finnish | Flemish | French Belgian | Canadian French | French Congo | French | Moroccan French | Swiss French | Tunisian French | Frisian | Fukienese | Fula | Futunan | Ga | Galician | Garifuna | Garo | Georgian | Austrian German | German | Swiss German | Greek | Greek Cyprus | Guarani | Gujarati | Haitian Creole | Hakka Chin | Hausa | Hawaiian | Hawaiian | Hazaragi | Hebrew | Hiligaynon | Hindi | Hmong | Hopi | Hungarian | Icelandic | Igbo | Ilocano | Indonesian | Inuit | Inuktitut | Italian | Swiss Italian | Ixil | Jamaican Creole | Japanese | Javanese | Kannada | Kapampangan | Kaqchikel | Karen | Kashmiri | Kasmiri | Kazakh | Khasi | Khmer | Kiche | Kinyarwanda | Kiribatese | Kirundi | Konkani | Korean | Kosraean | Krio | Kurdish | Kyrgyz | Laotian | Latvian | Lebanese | Lingala Congo | Lithuanian | Lu Mien | Luganda | Luxembourgish | Maasai | Maay Maay | Macedonian | Malagasy | Malay | Malayalam | Maltese | Mam | Mandinka | Manipuri | Maori | Marathi | Marshallese | Maya | Mende | Miskito | Mixe | Mixteco | Mizo | Mongolian | Montenegrin | Nagamese | Navajo | Ndebele | Nepali | Nigerian Pidgin | Niuean | Norwegian | Nuer | Ojibway | Ojibwe | Oriya | Oromo | Palauan | Papiamento | Papiamentu | Pashto | Pohnpeian | Pokot | Polish | Angolan Portuguese | Brazilian Portuguese | Portuguese Creole | European Portuguese | Portuguese Mozambique | Punjabi | Quechua | Quiche | Rapa Nui | Rarotongan | Rohingya | Romani | Romanian | Rotooro | Rundi | Russian | Samoan | Sanskrit | Sámi | Scottish Gaelic | Sepedi | Serbian | Sesotho | Setswana | Shona | Sinhala | Siswati | Slovak | Slovenian | Soloman Islands Pidgin | Somali | Soninke | Soninke | Sotho | Spanish | Argentinian Spanish | Chilean Spanish | Colombian Spanish | Costa Rican Spanish | Cuban Spanish | Dominican Republic Spanish | Ecuadorian Spanish | Salvadorian Spanish | Guatemalan Spanish | Spanish Honduras | Mexican Spanish | Neutral Spanish | Spanish Panama | Paraguayan Spanish | Peruvian Spanish | Puerto Rican Spanish | Spanish (Spain) | Uruguayan Spanish | Venezuelan Spanish | Swahili | Swazi | Swedish | Tachaouit | Tagalog | Tahitian | Taiwanese | Tajik | Takbalit | Tamazight | Tamil | Tartarian | Tatar | Telugu | Temne | Thai | Tibetan | Tigrinya | Tojolobal | Tok Pisan | Tongan | Trique | Tshivenda | Tsonga | Tswana | Tulu | Turkana | Turkish | Turkish Cyprus | Tuvaluan | Twi | Tzeltal | Tzutujil | Ukrainian | Urdu | Uzbek | Valencian | North Vietnamese | South Vietnamese | Wallisian | Waray | Welsh | Wolof | Xhosa | Yapese | Yiddish | Yoruba | Yupik | Zapoteco | Zoque | ZuluMehr [+]
Sprachtalente
Acadian French Sprecher | Accented English Sprecher | African French Sprecher | Afrikaans Sprecher | Albanian Sprecher | Amharic Sprecher | Angolan Portuguese Sprecher | Algerian Arabic Sprecher | Arabic Bahrain Sprecher | Arabic Sprecher | Egyptian Arabic Sprecher | Jordanian Arabic Sprecher | Arabic Kuwait Sprecher | Arabic Lebanon Sprecher | Moroccan Arabic Sprecher | Arabic Oman Sprecher | Palestinian Arabic Sprecher | Arabic Qatar Sprecher | Saudi Arabian Arabic Sprecher | Syrian Arabic Sprecher | Tunisian Arabic Sprecher | Arabic (UAE) Sprecher | Armenian Sprecher | Assamese Sprecher | Azeri Sprecher | Bambara Sprecher | Basque Sprecher | Belarusian Sprecher | Bemba Sprecher | Bengali Sprecher | Bosnian Sprecher | Bulgarian Sprecher | Burmese Sprecher | Cajun French Sprecher | Cambodian Sprecher | Cantonese (Guangdong) Sprecher | Catalan Sprecher | Chin Sprecher | Cantonese (China) Sprecher | Mandarin Sprecher... | Traditional Mandarin Sprecher | Chinese (Singapore) Sprecher | Chinese (Taiwan) Sprecher | Chuukese Sprecher | Croatian Sprecher | Czech Sprecher | Dagbani Sprecher | Danish Sprecher | Dari Sprecher | Dinka Sprecher | Dutch Sprecher | Dzongkha Sprecher | African English Sprecher | Australian English Sprecher | British English Sprecher | Canadian English Sprecher | Indian English Sprecher | Irish English Sprecher | New Zealand English Sprecher | Scottish English Sprecher | South African English Sprecher | American English Sprecher | Estonian Sprecher | Ewe Sprecher | Farsi Sprecher | Finnish Sprecher | Flemish Sprecher | French Belgian Sprecher | Canadian French Sprecher | French Congo Sprecher | French Sprecher | Moroccan French Sprecher | Swiss French Sprecher | Tunisian French Sprecher | Ga Sprecher | Galician Sprecher | Georgian Sprecher | Austrian German Sprecher | German Sprecher | Swiss German Sprecher | Greek Sprecher | Gujarati Sprecher | Haitian Creole Sprecher | Hausa Sprecher | Hawaiian Sprecher | Hebrew Sprecher | Hindi Sprecher | Hmong Sprecher | Hungarian Sprecher | Icelandic Sprecher | Igbo Sprecher | Ilocano Sprecher | Indonesian Sprecher | Italian Sprecher | Swiss Italian Sprecher | Japanese Sprecher | Kannada Sprecher | Karen Sprecher | Kashmiri Sprecher | Kazakh Sprecher | Khasi Sprecher | Khmer Sprecher | Kinyarwanda Sprecher | Kirundi Sprecher | Konkani Sprecher | Korean Sprecher | Krio Sprecher | Kurdish Sprecher | Kyrgyz Sprecher | Laotian Sprecher | Latvian Sprecher | Lebanese Sprecher | Lingala Congo Sprecher | Lithuanian Sprecher | Luxembourgish Sprecher | Macedonian Sprecher | Malagasy Sprecher | Malay Sprecher | Malayalam Sprecher | Maltese Sprecher | Mandinka Sprecher | Manipuri Sprecher | Maori Sprecher | Marathi Sprecher | Marshallese Sprecher | Mizo Sprecher | Mongolian Sprecher | Montenegrin Sprecher | Nagamese Sprecher | Navajo Sprecher | Nepali Sprecher | Nigerian Pidgin Sprecher | Norwegian Sprecher | Nuer Sprecher | Oriya Sprecher | Oromo Sprecher | Papiamento Sprecher | Pashto Sprecher | Polish Sprecher | Angolan Portuguese Sprecher | Brazilian Portuguese Sprecher | European Portuguese Sprecher | Portuguese Mozambique Sprecher | Punjabi Sprecher | Rohingya Sprecher | Romanian Sprecher | Russian Sprecher | Samoan Sprecher | Serbian Sprecher | Sesotho Sprecher | Shona Sprecher | Sinhala Sprecher | Slovak Sprecher | Slovenian Sprecher | Somali Sprecher | Sotho Sprecher | Argentinian Spanish Sprecher | Chilean Spanish Sprecher | Colombian Spanish Sprecher | Costa Rican Spanish Sprecher | Cuban Spanish Sprecher | Dominican Republic Spanish Sprecher | Ecuadorian Spanish Sprecher | Salvadorian Spanish Sprecher | Guatemalan Spanish Sprecher | Mexican Spanish Sprecher | Neutral Spanish Sprecher | Spanish Panama Sprecher | Puerto Rican Spanish Sprecher | Spanish (Spain) Sprecher | Uruguayan Spanish Sprecher | Venezuelan Spanish Sprecher | Swahili Sprecher | Swedish Sprecher | Tagalog Sprecher | Taiwanese Sprecher | Tajik Sprecher | Tamazight Sprecher | Tamil Sprecher | Tartarian Sprecher | Telugu Sprecher | Thai Sprecher | Tibetan Sprecher | Tigrinya Sprecher | Tongan Sprecher | Tswana Sprecher | Turkish Sprecher | Twi Sprecher | Ukrainian Sprecher | Urdu Sprecher | Uzbek Sprecher | North Vietnamese Sprecher | South Vietnamese Sprecher | Welsh Sprecher | Xhosa Sprecher | Yoruba Sprecher | Zulu SprecherMehr [+]