You will design and build production-grade agent systems, including planning, tool execution, and reasoning pipelines. Additionally, you will own the reliability, observability, and evaluation frameworks to ensure high-quality agent performance and continuous model improvement.