AI Architect/Insurance/12 months (Extendable/Convert to Perm)

Argyll Scott ·www.argyllscott.com

Location Hong Kong, Hong Kong
Salary HKD 90,000 - 120,000 / year
Type Full time
Level Mid
Source Shazamme
Technology
Apply direct

Job Description: Technical AI Architect

Role Overview

We are seeking a Technical AI Architect to lead the design, scaling, and governance of our Enterprise Agentic RAG platform. You will move beyond basic semantic search to architect production-grade, end-to-end multi-agent products and high-performance retrieval systems.

This role demands deep technical mastery in Agentic RAG and LangGraph, strict attention to cost/token optimization, and the ability to ship resilient, production-grade products that enforce robust enterprise guardrails and security compliance.

Key Responsibilities

  • Production-Grade Agentic Architecture: Design and build end-to-end Agentic RAG products utilizing state-driven, multi-agent systems and cyclic workflows via LangGraph. Move from sequential pipelines to iterative, self-correcting reasoning loops (e.g., query decomposition, self-reflection, and dynamic context validation).

  • Enterprise-Scale Retrieval Systems: Architect high-precision, layout-aware semantic chunking pipelines. Implement enterprise hybrid search (combining dense vectors, sparse BM25 keyword matching, and Reciprocal Rank Fusion) backed by two-stage cross-encoder reranking layers.

  • Cost & Token Optimization: Drive LLM unit economics at scale. Implement advanced strategies for token optimization, context-window compression, semantic caching, and dynamic cost-based model routing (e.g., routing lookups to lightweight models and deep reasoning to frontier models).

  • AI Governance, Security & Guardrails: Deploy production-ready enterprise safety nets. Enforce secure tool execution environments, Source Access Control Lists (ACLs), data privacy/PII redacting, and automated LLM-as-a-judge evaluation frameworks (e.g., Ragas, TruLens) tracking Faithfulness, context precision, and latency SLAs.

  • Technical Leadership & DevOps: Lead, mentor, and establish best practices for a dedicated team of AI/ML engineers. Oversee containerization (Docker, Kubernetes) and inference server optimization (e.g., vLLM, PagedAttention) to achieve low-latency SLAs.

Technical Stack & Requirements

  • Orchestration & Agents: Expert-level mastery of LangGraph (critical), LangChain, or LlamaIndex for state tracking and tool use.

  • Data & Vector Infrastructure: Deep experience with enterprise vector databases (Pinecone, Milvus, Qdrant, pgvector) and robust extraction pipelines for complex enterprise documents (PDFs, financial tables).

  • Models & Deployment: Hands-on experience with commercial APIs (OpenAI, Anthropic) and deploying, fine-tuning, or quantization of open-source models (Llama, Mistral) via production engines like vLLM.

  • Core Engineering: Strong Python foundation, asynchronous programming, microservices (FastAPI), and observability infrastructure (LangSmith, Weights & Biases).

  • Experience: 10 years of software/data experience, minimum of 3+ years in AI enterprise architecture with a proven track record of shipping end-to-end, production-ready enterprise GenAI products.

Argyll Scott Asia is acting as an Employment Agency in relation to this vacancy.

Frequently asked questions

Who is hiring for the AI Architect/Insurance/12 months (Extendable/Convert to Perm) role?
Argyll Scott is hiring for the AI Architect/Insurance/12 months (Extendable/Convert to Perm) position, a Shazamme client. Apply directly on the employer's career site.
Where is the AI Architect/Insurance/12 months (Extendable/Convert to Perm) job located?
The AI Architect/Insurance/12 months (Extendable/Convert to Perm) role with Argyll Scott is based in Hong Kong, HK.
What does the AI Architect/Insurance/12 months (Extendable/Convert to Perm) role pay?
Argyll Scott lists the AI Architect/Insurance/12 months (Extendable/Convert to Perm) role at HKD 90,000–120,000 per year.
Is the AI Architect/Insurance/12 months (Extendable/Convert to Perm) role full-time or contract?
This is a full time position at Argyll Scott.
What experience level is the AI Architect/Insurance/12 months (Extendable/Convert to Perm) role?
The AI Architect/Insurance/12 months (Extendable/Convert to Perm) position is aimed at mid-level candidates.
How do I apply for the AI Architect/Insurance/12 months (Extendable/Convert to Perm) role at Argyll Scott?
Apply directly on Argyll Scott's career page via the Apply button on this listing. ZammeJobs links straight through to the employer's ATS — no third-party form, no resume database.
Apply direct