CopeCart

SRE / DevOps Engineer (human)

Posted a month ago

Germany, United Kingdom

⭐ 5-10 years experience

Apply Now

Please mention DailyRemote when applying

AI Summary

The role focuses on improving deployment speed and reliability of an AWS-based platform while reducing manual operational bottlenecks. It involves implementing agentic engineering workflows to assist engineers in diagnosing and fixing production issues safely.

Who we are

CopeCart helps entrepreneurs sell digital products professionally. With our headquarters in Berlin and our Product Development & Support Center in Lviv, Ukraine, we provide an all-in-one solution for coaches, course creators, and digital entrepreneurs.

With our diverse, international team, CopeCart is already active in numerous countries and offers vendors a true full-service platform: from product uploads and sales to payment processing, accounting, and comprehensive analytics - whether for e-books, coaching programs, online seminars, or event tickets.

We take care of the paperwork so our vendors can focus on what truly matters: growing their business.

And this is where you come in. As a Junior Sales Manager, you will be the one convincing new vendors of this vision and helping CopeCart continue to grow.

About the role

We are looking for an SRE / DevOps engineer who can help us make delivery faster, safer, and less dependent on manual operations work.
For the first three to six months, the focus will be reducing friction around deploying our new system, improving the reliability of our current AWS-based platform, and introducing agentic engineering practices that help engineers diagnose, fix, and safely ship changes without getting blocked on operations.
This is a hands-on production role. You will work with real infrastructure, real deployment pipelines, real incident patterns, and real engineering teams. You should be comfortable operating production-grade systems with the right checks and balances: VPN access, least-privilege permissions, runbooks, repo instructions, service maps, deployment procedures, audit trails, and rollback paths.
You will also help us use AI agents in a practical way: not as magic automation, but as supervised engineering assistants that can inspect code, read operational context, suggest fixes, generate runbooks, improve CI/CD, and help engineers debug production issues safely.

Your tasks

In the first three to six months, you will:

Improve the deployment experience for our new system.
Reduce operational bottlenecks that slow down engineering and feature delivery.
Strengthen our AWS production setup, currently based on ECS and containers.
Improve our GitHub Actions CI/CD workflows.
Work with Terraform / OpenTofu to make infrastructure safer, clearer, and easier to change.
Improve production debugging across AWS, containers, networking, Linux, and application-level issues.
Improve our observability across the three pillars: metrics, logs, and traces.
Create or improve runbooks, repo instructions, service maps, deployment guides, and operational documentation.
Introduce agentic engineering workflows that help engineers diagnose issues, propose fixes, and validate changes before they reach production.
Design safe guardrails for agent-assisted work: permissions, approval gates, auditability, sandboxing, rollback procedures, and human review.

Current and future platform context

Today, we run on AWS and ECS. Strong AWS experience is required.

You should be comfortable with:

AWS production environments
ECS and containerized services
GitHub Actions
Terraform and/or OpenTofu
CI/CD pipelines
Linux and networking
Production debugging
Secrets, VPNs, access control, and operational security
Observability across metrics, logs, and traces

We are also considering a move toward Kubernetes, so Kubernetes experience is highly valuable. You do not need to migrate us on day one, but you should be able to reason clearly about when Kubernetes helps, when it adds complexity, and what a safe migration path would look like.

Your profile

We are looking for someone who has moved beyond simply “using ChatGPT.”
You should be able to use AI agents as part of a serious engineering workflow. That means you can:

Break a large operational problem into smaller agent-assisted tasks.
Give agents the right repo context, runbooks, constraints, and success criteria.
Use agents to inspect code, infrastructure, CI/CD, logs, and service behavior.
Verify agent output instead of trusting it blindly.
Turn useful agent output into production-grade changes.
Design workflows where agents can suggest fixes, but humans approve risky actions.
Know when not to use an agent.

Examples of useful agentic workflows include:

An agent helps diagnose a failed deployment by inspecting CI logs, ECS task events, recent commits, and application logs.
An agent proposes a Terraform or GitHub Actions change, but only as a pull request with tests and rollback notes.
An agent generates or updates a runbook after an incident, using evidence from logs, traces, metrics, and deploy history.
An agent creates a first-pass service map from repositories, infrastructure definitions, and runtime configuration.
An agent helps reproduce a production issue in a safe sandbox before any production change is made.

Required experience

Strong hands-on experience in SRE, DevOps, platform engineering, infrastructure engineering, or production operations.
Production AWS experience.
Experience with ECS and containerized services.
Experience with GitHub Actions.
Experience with Terraform and/or OpenTofu.
Experience with CI/CD, Linux, networking, and production debugging.
Strong observability skills across metrics, logs, and traces.
Ability to write production-quality code or scripts in TypeScript and Bash.
Ability to read and modify infrastructure, CI/CD, and application code.
Good judgment around production risk, automation, permissions, and rollback.

Nice to have

Kubernetes experience.
Ruby experience.
Experience building internal developer platforms or self-service infrastructure.
Experience with coding agents, AI-assisted engineering workflows, repo-level agent instructions, evals, or agent guardrails.
Experience improving incident response, deploy safety, or on-call quality.

What good looks like

After three to six months, we expect that:

Deployments are less fragile and less dependent on manual intervention.
Engineers are blocked less often by operational uncertainty.
The production environment is easier to understand through service maps, runbooks, repo instructions, and observability.
CI/CD gives faster and more trustworthy feedback.
Agentic workflows are being used in practical, safe ways to help with debugging, deployment support, and operational improvements.
We have clearer boundaries around what agents can inspect, suggest, and change.
Production changes remain controlled, reviewable, auditable, and reversible.

What we do not want

This role is probably not right for someone who:

Only wants to operate infrastructure manually.
Treats AI agents as magic instead of unreliable junior collaborators with tool access.
Optimizes for impressive demos over production safety.
Cannot explain how they would constrain, observe, test, and roll back automated workflows.
Thinks agentic engineering means removing humans from production decisions.
Wants to bypass operational controls in the name of speed.

What you can expect at CopeCart

Remote & flexibility | Work from anywhere. Flexible working hours based on trust — no rigid schedules, but real ownership and autonomy.

Room for impact | In your day-to-day work, you will face a variety of exciting challenges that allow for a high level of ownership. Got a creative idea or a suggestion for improvement? Feel free to share it. We are always open to innovation, new ideas, and thinking beyond the obvious.

Benefits | Access to attractive corporate benefits as well as a company pension plan are naturally included.

Company fitness | The health and well-being of our employees matter to us. Through our partner EGYM Wellpass, you get access to more than 6,300 fitness and yoga studios, swimming pools, as well as CrossFit and bouldering gymsacross Germany and Austria.

Team events | Remote does not mean isolated. Several times a year, we meet in different locations for real exchange, real team building, and real connection

Hints for your application

Show us who you are. What do you want to learn with us? Where can you help us move forward? What drives you, and what keeps you going? Share this with us in a short personal message. We read every application individually and take the time to respond to you personally.

CopeCart is an equal opportunity employer. All qualified applicants will be considered without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, age, or disability.

Automatically Apply to the Best Remote Jobs

Stop the endless job search. Our AI finds and applies to the best jobs for you.

Try it Now

CopeCart

SRE / DevOps Engineer (human)

AI Summary

Who we are

About the role

Your tasks

Current and future platform context

Your profile

Required experience

Nice to have

What good looks like

What we do not want

What you can expect at CopeCart

Hints for your application

Automatically Apply to the Best Remote Jobs

Ace Your Job Interview

How to Answer "How Do You Handle Criticism"?

How to Answer "Tell Me About Yourself?" in an Interview

How to Answer "What is your Experience with Customer Service?"

How to Answer "Describe Your Experience Working With Diverse Teams Or Different Cultures?"

How to Answer The Interview Question "What Sets You Apart From Other Candidates?"

How to Answer "Why Are You The Best Person For This Job"?

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Why Should We Hire You?"

How to Answer "What Areas Need Improvement?"

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Tell Me About a Time You Received Constructive Feedback"

How to Answer "What Is Your Greatest Accomplishment?"

Similar Jobs

Senior Software Engineer, Block Production

Senior Data Engineer

AP - Senior AI Engineer (India) - Remote

Fashion Designer & Product Developer (024-1133)

Help Desk/IT Support Specialist (Remote)

QA Specialist / Lead (018-1124)

CopeCart

SRE / DevOps Engineer (human)

AI Summary

Who we are

About the role

Your tasks

Current and future platform context

Your profile

Required experience

Nice to have

What good looks like

What we do not want

What you can expect at CopeCart

Hints for your application

Automatically Apply to the Best Remote Jobs

Share This Job:

Similar Jobs

Senior Software Engineer, Block Production

Senior Data Engineer

AP - Senior AI Engineer (India) - Remote

Fashion Designer & Product Developer (024-1133)

Help Desk/IT Support Specialist (Remote)

QA Specialist / Lead (018-1124)

Personalize your Remote Job Search in 3 Easy Steps!