Transform Language Models into Real-World ApplicationsWe’re building AI systems for a global audience. We are living in an era of AI transition - this new project team will be focusing on building applications to enable more real world impact and highest usage for the world.This is a remote role based in Dublin, working closely with our HQ in Malaysia and cross-functional regional teams. You’ll operate across the stack, from backend logic and integration to frontend delivery, building intelligent systems that scale fast and matter deeply.Why This Role MattersYou’ll fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production. Your work ensures our models are not only intelligent, but also safe, trustworthy, and impactful at scale.What You’ll DoCollect, clean, and preprocess user-generated text and image data for fine-tuning large modelsDesign and manage scalable data labeling pipelines, leveraging both crowdsourcing and in-house labeling teamsBuild and maintain automated datasets for content moderation (e.g., safe vs unsafe content)Collaborate with researchers and engineers to ensure datasets are high-quality, diverse, and aligned with model training needsWhat Is It LikeLikes ownership and independenceBelieve clarity comes from action - prototype, test, and iterate without waiting for perfect plans.Stay calm and effective in startup chaos - shifting priorities and building from zero doesn’t faze you.Bias for speed - you believe it’s better to deliver something valuable now than a perfect version much later.See feedback and failure as part of growth - you’re here to level up.Possess humility, hunger, and hustle, and lift others up as you go.RequirementsProven experience preparing datasets for machine learning or fine-tuning large modelsStrong skills in data cleaning, preprocessing, and transformation for both text and image dataHands-on experience with data labeling workflows and quality assurance for labeled dataFamiliarity with building and maintaining moderation datasets (safety, compliance, and filtering)Proficiency in scripting (Python, SQL) and working with large-scale data pipelinesWhat You’ll GetFlat structure & real ownershipFull involvement in direction and consensus decision makingFlexibility in work arrangementHigh-impact role with visibility across product, data, and engineeringTop-of-market compensation and performance-based bonusesGlobal exposure to product developmentLots of perks - housing rental subsidies, a quality company cafeteria, and overtime mealsHealth, dental & vision insuranceGlobal travel insurance (for you & your dependents)Unlimited, flexible time offOur Team & CultureWe’re a densed, high-performance team focused on high quality work and global impact. We behave like owners. We value speed, clarity, and relentless ownership. If you’re hungry to grow and care deeply about excellence, join us.About BjakBJAK is Southeast Asia’s #1 insurance aggregator with 8M+ users, fully owned by its employees. Headquartered in Malaysia and operating in Thailand, Taiwan, and Japan, we help millions of users access transparent and affordable financial protection through . We simplify complex financial products through cutting-edge technologies, including APIs, automation, and AI, to build the next generation of intelligent financial systems.If you're excited to build real-world AI systems and grow fast in a high-impact environment, we’d love to hear from you.