Design and maintain an enterprise-grade Agentic AI Developer Platform to standardize the creation of AI agents across the organization. Manage the platform lifecycle, including deployment automation, monitoring, and the integration of orchestration tools and cloud services.
π Join Our Data Products and Machine Learning Development Remote Startup! π
We are looking for an Agentic AI DevOps Senior to join a strategic initiative focused on building, deploying, and operating an enterprise-grade Agentic AI Platform. This platform will enable developers across the organization to self-service infrastructure, AI capabilities, prompts, and reusable agentic components through standardized, governed, and scalable development paths.
You will work closely with the Technical Lead, contributing hands-on expertise in both DevOps and Agentic AI technologies while helping drive the platform's long-term sustainability and adoption.
\n
π What We Do
- Leveraging our expertise, we build modern Machine Learning systems for demand planning and budget forecasting.
- Developing scalable data infrastructures, we enhance high-level decision-making, tailored to each client.
- Offering comprehensive Data Engineering and custom AI solutions, we optimize cloud-based systems.
- Using Generative AI, we help e-commerce platforms and retailers create higher-quality ads, faster.
- Building deep learning models, we enhance visual recognition and automation for various industries, improving product categorization, quality control, and information retrieval.
- Developing recommendation models, we personalize user experiences in e-commerce, streaming, and digital platforms, driving engagement and conversions.
π Our Partnerships
- Amazon Web Services
- Astronomer
- Databricks
π Our Values
- π We are Data Nerds
- π€ We are Open Team Players
- π We Take Ownership
- π We Have a Positive Mindset
π Curious about what weβre up to? Check out
our case studies and dive into our
blog post to learn more about our culture and the exciting projects weβre working on! π
Responsibilities π€
- Design, develop, deploy, and maintain an enterprise-wide Agentic AI Developer Platform that enables and standardizes the development of AI agents across the organization.
- Implement and operate platform components, integrations, frameworks, and supporting infrastructure required to deliver scalable and reliable agentic AI solutions.
- Collaborate with architects and technical stakeholders to translate platform designs into production-ready implementations, ensuring alignment with security, governance, and engineering standards.
- Manage the lifecycle of the platform, including deployment automation, monitoring, troubleshooting, performance optimization, and operational support.
- Evaluate, integrate, and customize existing AI platforms, orchestration tools, frameworks, and cloud-native services to build a cohesive developer experience.
- Create abstractions, reusable components, and development standards that accelerate agent development while maintaining consistency across teams.
- Provide technical guidance and mentorship to junior and mid-level engineers, promoting engineering best practices and knowledge sharing.
- Work autonomously on complex technical initiatives, taking ownership of implementation, delivery, and operational excellence with minimal supervision.
- Partner with cross-functional teams to understand business requirements and continuously improve platform capabilities, scalability, and developer productivity.
- Contribute to platform documentation, operational procedures, and governance practices to ensure long-term maintainability and adoption across the organization.
Required Skills π»DevOps & Platform Engineering
- 4+ years of experience in DevOps, Platform Engineering, SRE, Backend Engineering, or similar infrastructure-focused roles
- Strong hands-on experience with Kubernetes, including cluster administration, networking, deployments, and operational support
- Experience with Terraform and Infrastructure as Code (IaC) practices.
- Experience designing, implementing, and maintaining CI/CD pipelines and deployment automation frameworks.
- Solid understanding of Linux administration, containerization technologies, container registries, IAM, and RBAC concepts.
- Knowledge of monitoring, observability, logging, and troubleshooting within Kubernetes environments.
- Understanding of cloud-native architectures, platform operations, and production infrastructure best practices.
Agentic AI & LLM Operations
- 1β2+ years of experience developing or maintaining solutions based on LLMs and AI agents.
- Experience with frameworks such as LangChain, LangGraph, CrewAI, or AutoGen.
- Knowledge of prompt engineering, evaluation, and monitoring of GenAI applications.
- Familiarity with observability tools such as LangSmith, LangFuse, or similar platforms.
- Experience integrating AI agents with APIs, databases, and external tools.
- Strong programming skills in Python.
- Experience building, testing, and deploying AI-powered applications in production environments.
- Understanding of RAG (Retrieval-Augmented Generation) architectures and vector databases is a plus.
- Experience working with cloud platforms (AWS, GCP, or Azure) is desirable.
π Perks
- π Remote-first culture β work from anywhere!
- π In-Company English Lessons.
- πͺ Wellhub or sports club stipend to stay active
- π AWS, DBT, Google Cloud, Azure & Databricks certifications fully covered
- π Food credits via Pedidos Ya β because great work deserves great food.
- π Birthday off + an extra vacation week (Mutt Week! ποΈ)
- π€ Referral bonuses β help us grow the team & get rewarded!
- βοΈποΈ Annual Mutters' Trip β an unforgettable getaway with the team!
\n