データエンジニア (Data Engineer)

Remote

Job Description

言語モデルを現実のアプリケーションへ変革する

私たちはグローバルなユーザーを対象とした AI システムを構築しています。現在は AI トランジションの時代にあり、この新しいプロジェクトチームは、現実世界への影響力を拡大し、世界中で最大限に活用されるアプリケーションの構築に注力します。

このポジションはグローバルな役割であり、柔軟なリモートワークと本社での対面コラボレーションを組み合わせたハイブリッド勤務を採用しています。製品、エンジニアリング、オペレーション、インフラ、データの各地域チームと緊密に連携し、影響力のある AI ソリューションを構築・拡張します。

この役割が重要な理由

最先端のモデルをファインチューニングし、評価フレームワークを設計し、AI 機能を本番環境に投入します。あなたの仕事は、モデルがインテリジェントであるだけでなく、安全で信頼でき、大規模に影響力を持つことを保証します。

主な業務内容

大規模モデルのファインチューニングのために、ユーザー生成のテキストおよび画像データを収集・クレンジング・前処理する
クラウドソーシングおよび社内ラベリングチームを活用し、スケーラブルなデータラベリングパイプラインを設計・管理する
コンテンツモデレーション用の自動化データセット（例：安全コンテンツ vs 非安全コンテンツ）を構築・維持する
研究者やエンジニアと協力し、データセットが高品質、多様性を持ち、モデル学習ニーズに適合するようにする

求める人物像

主体性と独立性を好む方
「行動から明確さが生まれる」と信じ、完璧な計画を待つのではなくプロトタイプ・テスト・反復を実行できる方
スタートアップ特有の混乱下でも冷静かつ効果的に行動できる方 —— 優先順位の変化やゼロからの構築を恐れない
スピードを重視し、完璧を待つよりも「今すぐ価値ある成果」を届けることを優先できる方
フィードバックや失敗を成長の一部と捉え、常にレベルアップを目指せる方
謙虚さ、向上心、行動力を持ち、仲間を助けながら進める方

応募資格

機械学習や大規模モデルのファインチューニング用データセット準備の実務経験
テキストおよび画像データにおけるデータクレンジング、前処理、変換スキル
データラベリングワークフローやラベルデータの品質保証に関する実務経験
モデレーションデータセット（安全性、コンプライアンス、フィルタリング）の構築・維持経験
Python、SQL などのスクリプト言語に精通し、大規模データパイプラインの運用経験

待遇・福利厚生

フラットな組織構造と本当のオーナーシップ
プロダクト方向性や意思決定への全面的な関与
柔軟な勤務形態
プロダクト・データ・エンジニアリングを横断する高インパクトな役割
市場最高水準の給与と成果に基づくボーナス
グローバルなプロダクト開発への参画機会
充実した福利厚生 —— 住宅補助、高品質な社員食堂、残業食事補助
健康・歯科・眼科保険
グローバル旅行保険（本人および扶養家族対象）
無制限で柔軟な有給休暇制度

チームと文化

私たちは高密度・高パフォーマンスのチームであり、高品質な仕事とグローバルインパクトに注力しています。オーナーのように行動し、スピード、明確さ、徹底的な責任感を重視します。成長意欲があり、卓越性を大切にする方を歓迎します。

会社概要：BJAK

BJAK は東南アジア最大の保険アグリゲーターで、800 万人以上のユーザーを持ち、社員が完全に所有する企業です。本社はマレーシアにあり、タイ、台湾、日本でも事業を展開しています。Bjak.com を通じて、数百万人のユーザーに透明性が高く、手頃な金融保障を提供しています。

また、API、自動化、AI などの先端技術を駆使し、複雑な金融商品をシンプルにし、次世代のインテリジェントな金融システムを構築しています。

現実世界にインパクトを与える AI システムを構築し、高インパクトな環境で急速に成長したい方、ぜひご応募ください。

------------------------------------------

Transform Language Models into Real-World Applications

We’re building AI systems for a global audience. We are living in an era of AI transition - this new project team will be focusing on building applications to enable more real world impact and highest usage for the world.

This role is a global role with hybrid work arrangement - combining flexible remote work with in-office collaboration at our HQ. You’ll work closely with regional teams across product, engineering, operations, infrastructure and data to build and scale impactful AI solutions.

Why This Role Matters

You’ll fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production. Your work ensures our models are not only intelligent, but also safe, trustworthy, and impactful at scale.

What You’ll Do

Collect, clean, and preprocess user-generated text and image data for fine-tuning large models
Design and manage scalable data labeling pipelines, leveraging both crowdsourcing and in-house labeling teams
Build and maintain automated datasets for content moderation (e.g., safe vs unsafe content)
Collaborate with researchers and engineers to ensure datasets are high-quality, diverse, and aligned with model training needs

What Is It Like

Likes ownership and independence
Believe clarity comes from action - prototype, test, and iterate without waiting for perfect plans.
Stay calm and effective in startup chaos - shifting priorities and building from zero doesn’t faze you.
Bias for speed - you believe it’s better to deliver something valuable now than a perfect version much later.
See feedback and failure as part of growth - you’re here to level up.
Possess humility, hunger, and hustle, and lift others up as you go.

Requirements

Proven experience preparing datasets for machine learning or fine-tuning large models
Strong skills in data cleaning, preprocessing, and transformation for both text and image data
Hands-on experience with data labeling workflows and quality assurance for labeled data
Familiarity with building and maintaining moderation datasets (safety, compliance, and filtering)
Proficiency in scripting (Python, SQL) and working with large-scale data pipelines

What You’ll Get

Flat structure & real ownership
Full involvement in direction and consensus decision making
Flexibility in work arrangement
High-impact role with visibility across product, data, and engineering
Top-of-market compensation and performance-based bonuses
Global exposure to product development
Lots of perks - housing rental subsidies, a quality company cafeteria, and overtime meals
Health, dental & vision insurance
Global travel insurance (for you & your dependents)
Unlimited, flexible time off

Our Team & Culture

We’re a densed, high-performance team focused on high quality work and global impact. We behave like owners. We value speed, clarity, and relentless ownership. If you’re hungry to grow and care deeply about excellence, join us.

About Bjak

BJAK is Southeast Asia’s #1 insurance aggregator with 8M+ users, fully owned by its employees. Headquartered in Malaysia and operating in Thailand, Taiwan, and Japan, we help millions of users access transparent and affordable financial protection through Bjak.com. We simplify complex financial products through cutting-edge technologies, including APIs, automation, and AI, to build the next generation of intelligent financial systems.

If you're excited to build real-world AI systems and grow fast in a high-impact environment, we’d love to hear from you.

Please mention that you found this job on MoAIJobs, this helps us grow. Thank you!