將語言模型轉化為真實世界應用我們正在為全球用戶建構 AI 系統。這個專案團隊專注於開發能真正落地、產生大規模影響力的應用。你將負責的工作高效運行與管理開源模型,兼顧成本與可靠性確保 GPU、CPU、記憶體資源的效能與穩定性監控並排除推理問題,保持低延遲與高吞吐量與工程師合作,建置可擴展且可靠的模型服務方案必要條件有使用 vLLM、HuggingFace TGI 等模型服務平台經驗熟悉 Kubernetes、Ray、Modal、RunPod、LambdaLabs 等 GPU 調度工具能監控延遲、成本並依流量需求有效擴展系統有設定後端工程師推理 API 端點的經驗你將獲得扁平化組織與實際主導權全程參與產品方向與共識決策彈性混合工作模式高影響力角色,跨產品、數據與工程協作頂尖市場水準的薪酬與績效獎金全球產品開發經驗與曝光機會多樣福利:住房補助、優質員工餐廳、加班餐點補貼健康、牙科與視力保險全球差旅保險(本人與眷屬適用)無限制、彈性帶薪休假團隊與文化我們是一支高密度、高績效的團隊,專注於 高品質工作與全球影響力。我們像主人一樣行動,重視速度、清晰與徹底的責任感。如果你渴望成長並追求卓越,誠摯邀請你加入!關於 BjakBJAK 是東南亞第一大保險聚合平台,擁有 800 萬以上用戶,並由員工全額持股。總部設於馬來西亞,並在泰國、台灣與日本營運。我們透過 幫助數百萬人獲得透明且可負擔的金融保障。我們利用 API、自動化與 AI 前沿技術,簡化複雜金融商品,致力於打造下一代智慧金融系統。Transform Language Models into Real-World ApplicationsWe’re building AI systems for a global audience. We are living in an era of AI transition - this new project team will be focusing on building applications to enable more real world impact and highest usage for the world.This role is a global role with hybrid work arrangement - combining flexible remote work with in-office collaboration at our HQ. You’ll work closely with regional teams across product, engineering, operations, infrastructure and data to build and scale impactful AI solutions.Why This Role MattersYou’ll fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production. Your work ensures our models are not only intelligent, but also safe, trustworthy, and impactful at scale.What You’ll DoRun and manage open-source models efficiently, optimizing for cost and reliabilityEnsure high performance and stability across GPU, CPU, and memory resourcesMonitor and troubleshoot model inference to maintain low latency and high throughputCollaborate with engineers to implement scalable and reliable model serving solutionsWhat Is It LikeLikes ownership and independenceBelieve clarity comes from action - prototype, test, and iterate without waiting for perfect plans.Stay calm and effective in startup chaos - shifting priorities and building from zero doesn’t faze you.Bias for speed - you believe it’s better to deliver something valuable now than a perfect version much later.See feedback and failure as part of growth - you’re here to level up.Possess humility, hunger, and hustle, and lift others up as you go.RequirementsExperience with model serving platforms such as vLLM or HuggingFace TGIProficiency in GPU orchestration using tools like Kubernetes, Ray, Modal, RunPod, LambdaLabsAbility to monitor latency, costs, and scale systems efficiently with traffic demandsExperience setting up inference endpoints for backend engineersWhat You’ll GetFlat structure & real ownershipFull involvement in direction and consensus decision makingFlexibility in work arrangementHigh-impact role with visibility across product, data, and engineeringTop-of-market compensation and performance-based bonusesGlobal exposure to product developmentLots of perks - housing rental subsidies, a quality company cafeteria, and overtime mealsHealth, dental & vision insuranceGlobal travel insurance (for you & your dependents)Unlimited, flexible time offOur Team & CultureWe’re a densed, high-performance team focused on high quality work and global impact. We behave like owners. We value speed, clarity, and relentless ownership. If you’re hungry to grow and care deeply about excellence, join us.About BjakBJAK is Southeast Asia’s #1 insurance aggregator with 8M+ users, fully owned by its employees. Headquartered in Malaysia and operating in Thailand, Taiwan, and Japan, we help millions of users access transparent and affordable financial protection through . We simplify complex financial products through cutting-edge technologies, including APIs, automation, and AI, to build the next generation of intelligent financial systems.If you're excited to build real-world AI systems and grow fast in a high-impact environment, we’d love to hear from you.