AI Researcher – Multilingual Data
About This Gig
About the Role We’re looking for an AI Researcher focused on multilingual data to help us build and scale next-generation language models across diverse languages and domains. You’ll own research and execution around data sourcing, curation, evaluation, and training strategies for multilingual and low-resource languages, with a strong emphasis on publishing high-quality research and translating it into production systems. This role is ideal for someone who enjoys working close to the frontier: balancing papers, prototypes, and real-world impact in a fast-moving startup environment. What You’ll Do Design and execute research on multilingual datasets , including data collection, filtering, deduplication, and quality measurement Develop strategies for low-resource and long-tail languages (sampling, augmentation, curriculum design) Research and improve cross-lingual transfer , alignment, and robustness in large language models Build and maintain evaluation benchmarks for multilingual perf
Skills & Tags
About the Seller
Featherless AI
on Himalayas