Language Translation: A Problem of Vector Space Mathematics

To translate one language into another, find the linear transformation that maps one to the other. Simple, if you are part of an elite team of Google engineers.

A new translation technique being created by Google does not rely on versions of the same document in different languages, the old dictionary approach. Instead, it uses data mining techniques to model the structure of a single language and then compares this to the structure of another language. The new approach relies on the notion that every language must describe a similar set of ideas, so the words that do this must also be similar. For example, most languages will have words for common animals such as cat, dog, cow and so on. And these words are probably used in the same way in sentences such as “a cat is an animal that is smaller than a dog.”

The same is true of numbers. The image above shows the vector representations of the numbers one to five in English and Spanish and demonstrates how similar they are. The set of all the relationships, the so-called “language space”, can be thought of as a set of vectors that each point from one word to another. And in recent years, linguists have discovered that it is possible to handle these vectors mathematically. For example, the operation ‘king’ – ‘man’ + ‘woman’ results in a vector that is similar to ‘queen’.

Citation: Tomas Mikolov, Quoc V. Le, Ilya Sutskever, 'Exploiting Similarities among Languages for Machine Translation', arXiv:1309.4168

Link: How Google Converted Language Translation Into a Problem of Vector Space Mathematics - Technology Review

bizdean
Good to see comments and discussion coming back to Science20.com.

How Trump Is Making Taiwan Safe(r) · 3 days ago
Hank Campbell
I didn't mention the left, I said China hasn't done anything important, nor will they. I'm right. It was not about you, nor is it about RFK II. I have criticized him for 18 years...

How Trump Is Making Taiwan Safe(r) · 3 days ago
John
I highly recommend getting the vaccine if you are over 50 years old. Before I got the vaccine, I'd come down with pneumonia every year; since then, nothing. There are currently four vaccines...

New Vaccine For 21 Strains Of Pneumococcal Disease · 1 week ago
John
Hank is just ashamed that he doesn't know science. At all. If he did, he'd know that the Chinese have been monitoring earthquakes since 132 CE, have been investigating flight since 1300...

How Trump Is Making Taiwan Safe(r) · 1 week ago
John H.
You are always attacking the left and Democrats so it is remarkable that you make that charge against me. You started a science forum and use it to push your political agenda. Remarkable hypocrisy...

How Trump Is Making Taiwan Safe(r) · 1 week ago

Science 2.0 Links

Comments

Know Science And Want To Write?

Donate or Buy SWAG