Document Type

Article

Publication Date

7-1-2021

Abstract

The alignment of word embedding spaces in different languages into a common crosslingual space has recently been in vogue. Strategies that do so compute pairwise alignments and then map multiple languages to a single pivot language (most often English). These strategies, however, are biased towards the choice of the pivot language, given that language proximity and the linguistic characteristics of the target language can strongly impact the resultant crosslingual space in detriment of topologically distant languages. We present a strategy that eliminates the need for a pivot language by learning the mappings across languages in a hierarchicalway. Experiments demonstrate that our strategy significantly improves vocabulary induction scores in all existing benchmarks, as well as in a new non-English–centered benchmark we built, which we make publicly available.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Publication Information

Madrazo Azpiazu, Ion and Pera, Maria Soledad. (2021). "Hierarchical Mapping for Crosslingual Word Embedding Alignment". Transactions of the Association for Computational Linguistics, 8, 361-376. https://doi.org/10.1162/tacl_a_00320

Download

Included in

Computer Sciences Commons

COinS

ScholarWorks

Computer Science Faculty Publications and Presentations

Hierarchical Mapping for Crosslingual Word Embedding Alignment

Document Type

Publication Date

Abstract

Creative Commons License

Publication Information

Included in

Browse

Links

Search

Author Corner

ScholarWorks

Computer Science Faculty Publications and Presentations

Hierarchical Mapping for Crosslingual Word Embedding Alignment

Authors

Document Type

Publication Date

Abstract

Creative Commons License

Publication Information

Included in

Share

Browse

Links

Search

Author Corner