ML Engineer (Model Deployment & AI Infrastructure)
This role is based on-site in Tokyo, Japan and will be working directly with Fred Almeida.
About MiAI Law
MiAI Law is building the future of AI-powered legal research, delivering deep NLP models and AI-driven search technology dedicated to bridging the technology gap for lawyers by delivering advanced legal research tools and custom AI solutions. The company’s mission is to empower lawyers to provide efficient, affordable, and high-quality legal services while upholding data privacy and security standards. MiAI Law offers bespoke solutions tailored to individual client needs and scalable options for diverse requirements to enhance efficiency and productivity in the legal sector. We are looking for an ML Engineer with expertise in model deployment, infrastructure scaling, and AI optimization to bridge the gap between research and production.The Role
As an ML Engineer at MiAI Law, you will:- Deploy and optimize LLMs and NLP models, ensuring real-time legal AI applications.
- Build scalable AI pipelines for efficient model training, deployment, and monitoring.
- Optimize inference efficiency by implementing ONNX, TensorRT, or Hugging Face Model Serving.
- Integrate AI models with backend services to support search, ranking, and legal document processing.
- Ensure reliability and model drift detection using automated monitoring and retraining pipelines.
Key Responsibilities
Model Deployment & Optimization- Develop efficient deployment pipelines for LLMs and NLP models.
- Optimize model performance using quantization, pruning, and knowledge distillation.
- Reduce inference latency for legal AI applications.
- Deploy and manage models in cloud environments (AWS, GCP, Azure).
- Automate deployment with Docker, Kubernetes, and CI/CD pipelines.
- Implement multi-region failover strategies for legal AI services.
- Implement logging, monitoring, and failover handling for AI models.
- Track model performance using MLflow, Prometheus, or TensorBoard.
- Automate model retraining pipelines to adapt to new legal data.
Requirements: Who You Are
Technical Skills:- 3+ years of experience in ML Engineering, AI Infrastructure, or MLOps.
- Strong experience with Python, TensorFlow, PyTorch, FastAPI, and Hugging Face.
- Proficiency in Docker, Kubernetes, CI/CD pipelines for AI deployment.
- Hands-on experience with model optimization techniques (quantization, distillation, pruning).
- Knowledge of cloud-based AI scaling on AWS, GCP, or Azure.
- Strong problem-solving skills with a scalability-first mindset.
- Ability to collaborate with AI researchers, software engineers, and legal professionals.
- Experience in cross-functional AI teams, working closely with backend and product teams.
- Passionate about building AI-driven legal research tools.
Why Join MiAI Law?
- Work at the forefront of AI + Legal Tech.
- Be part of an innovative team in Tokyo, Japan.
- Solve real-world problems using scalable AI systems.
- Build models that shape the future of legal AI.
- Be part of an exciting, innovative Legaltech AI start-up that values collaboration and initiative.
- Opportunity to work closely with senior leadership and contribute to the success of MiAI and key projects.
HOW TO APPLY
Please submit your application using the form below, and be sure to include:- Your resume
- A link to your GitHub profile
- And a brief outline of how your skills and experience align with this role
Our Culture
At MiAI Law, our culture is the foundation of our success. We’re a high-performance, innovation-driven startup where every team member feels valued and empowered.
Join Us
Are you passionate about legal tech, excited by challenges, and ready to make a real impact?
If so, MiAI Law could be the place for you to thrive.