Science Cast

Task-Based MoE for Multitask Multilingual Machine Translation

Hai PhamFebruary 26, 2024 11:36am

Views (77)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Task-Based MoE for Multitask Multilingual Machine Translation

arXivPDF

Authors

Hai Pham, Young Jin Kim, Subhabrata Mukherjee, David P. Woodruff, Barnabas Poczos, Hany Hassan Awadalla

Abstract

Mixture-of-experts (MoE) architecture has been proven a powerful method for diverse tasks in training deep models in many applications. However, current MoE implementations are task agnostic, treating all tokens from different tasks in the same manner. In this work, we instead design a novel method that incorporates task information into MoE models at different granular levels with shared dynamic task-based adapters. Our experiments and analysis show the advantages of our approaches over the dense and canonical MoE models on multi-task multilingual machine translations. With task-specific adapters, our models can additionally generalize to new tasks efficiently.

TwitterandLinkedIn

0 comments

Add comment

Task-Based MoE for Multitask Multilingual Machine Translation

Task-Based MoE for Multitask Multilingual Machine Translation

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments