Data Platform Engineer

Yuhao Dai

Building scalable data infrastructure and AI systems. Based in Tokyo, Japan.

About

Data Platform Engineer with 8+ years building cloud-native, scalable data infrastructure. I've owned platform-wide architecture at global pharmaceutical and FMCG enterprises in Japan, specializing in lakehouse modernization, CI/CD pipelines, and data governance.

Currently exploring the intersection of LLM engineering and enterprise data platforms—building agentic workflows that automate day-to-day operations.

English TOEFL 115 · TOEIC 950
Japanese JLPT N1
Chinese Native

Experience

2025 — Now

Associate Director, Technical Consulting

IP-Tech SaaS Company (Tokyo)

Managing 11 technical consultants. Building GenAI-powered tools for enterprise data migration. Implementing agentic workflows to improve processes.

2023 — 2024

Senior Data Engineer — Platform Owner

Global Pharmaceutical Company (Japan)

Owned Common Data Platform for Region Japan. Led 5-member platform team. Rebuilt normalization and mart layers. Implemented Dagster, dbt, Snowflake stack.

2022 — 2023

Senior Data Engineer

Major FMCG Company (Japan)

Built data migration pipelines from legacy systems to ADLS. Designed logical data warehouse and lakehouse PoC with Delta tables. Automated ETL with PySpark, Synapse, and Airflow.

2021 — 2022

Data Analyst & Infrastructure Engineer

Leading Crypto Exchange (Japan)

Solo DW migration to Snowflake. Established Data Analysis office. ML modeling for user clustering and AML anomaly detection.

Contract Projects

2026 — Now

LLM Engineer

AI Transformation Consultancy (Shibuya)

Partnering with business leadership to drive AI-shift of core operations. Designing AI-centric architecture (autonomous execution vs. HITL boundaries), full-stack LLM backends, multi-agent systems, and reusable prompt-chain libraries. Practicing spec-driven, AI-DLC development across design, coding, review, and operations.

LLM RAG LangChain LangGraph Multi-Agent Python
2025 — 2026

BI Lead

Global Luxury Goods Group (Japan)

Leading full BI platform migration from PowerBI to Looker for luxury brand analytics. Designed LookML data models aligned with HQ standards, migrated 50+ dashboards, and built DevOps automation reducing deployment time by 70%.

Looker LookML BigQuery dbt Monte Carlo
2025

Backend Engineer

AI Startup (Japan)

Built RAG chatbot systems processing 10K+ documents using LangChain. Developed semantic chunking tools for unstructured Excel data. Created internal LLM tooling including Slack bot, knowledge base UI, and automated deploy pipelines.

Python LangChain FastAPI Azure Terraform
2024 — 2025

Backend / Engineering Manager

Voice AI Startup (Japan)

Architected voice AI platform handling 1000+ daily calls. Built Terraform IaC across 5 Azure projects. Designed phone AI agent prompts and voice synthesis backend. Led engineering team while establishing CI/CD with GitHub Actions.

Python TypeScript FastAPI Next.js Azure Terraform
2024

Analytics Engineer

Retail Tech SaaS (Japan)

Implemented Data Vault 2.0 architecture for retail analytics platform. Designed and maintained scalable data models serving 100+ business users. Optimized query performance reducing report generation time by 60%.

dbt Snowflake Data Vault 2.0 Python

Skills

Data Platform

  • Snowflake
  • dbt
  • Dagster
  • Airflow
  • Azure Data Factory

Cloud

  • Azure
  • GCP
  • AWS
  • Docker
  • Kubernetes

Languages

  • Python
  • SQL
  • Scala
  • Java
  • Spark

AI / ML

  • LLM Agents
  • RAG
  • PyTorch
  • TensorFlow
  • scikit-learn

Certifications

AWS Solutions Architect Associate
GCP Professional Cloud Architect
GCP Professional Data Engineer
GCP Professional ML Engineer
Azure AI Engineer Associate

Education

BA, English Literature

Doshisha University, Kyoto

2017 — 2021