Monolingual and cross-lingual knowledge transfer for topic classification
Chatbots and virtual assistants are on the rise, but building them is not easy. Collecting and labelling conversational data takes effort, and cross-lingual topical datasets are scarce. We address this by training a multilingual model on a monolingual topical dataset. Our model can transfer topical knowledge across multiple languages, with accuracy that correlates with the size of language-specific pre-training data.