We run both ECS and EKS in production. Which we use for what, and the actual decision criteria — not the marketing comparison.

On this page

AWS ECS vs EKS: A Practical Comparison

We have services running on both ECS and EKS. The two platforms can both do the basic job: run containerized services with rolling deploys, autoscaling, and load balancing. They differ in operational shape, ecosystem, and where the rough edges show up. This is the comparison from someone who's run both for a few years.

What ECS gets right #

ECS is the AWS-native container platform. Things it does well:

Tight AWS integration. ECS task roles are IAM roles attached to tasks. CloudWatch Logs come for free. Service discovery via Cloud Map is built in. ALB integration is one config block.

Lower operational burden. No control plane to manage. No upgrades to schedule. ECS just exists; you deploy services to it.

Fargate. Run tasks without managing nodes. You pay per-task CPU/memory. We use this for bursty workloads where node management would be overhead.

Predictable cost. Per-task pricing on Fargate, EC2 costs on EC2 mode. No "control plane fee" beyond what's bundled with Fargate. Easy to model and explain to finance.

What ECS gets wrong #

The flip side:

The ecosystem is small. Outside AWS, ECS-specific tools barely exist. There's no "ECS operator" community, no helm-charts equivalent, no rich set of CRDs.

Configuration is verbose. Task definitions are JSON with lots of repeated boilerplate. We use Terraform modules to abstract this; the underlying definitions are still verbose.

Service-to-service communication is awkward. Service discovery via Cloud Map works but requires DNS-based resolution. There's no native concept of a service mesh or mTLS without bolt-on tools.

Limited scheduler flexibility. ECS's scheduler is OK but not as sophisticated as Kubernetes. Pod affinity/anti-affinity equivalent is bin-packing strategies, which are coarser.

Vendor lock-in. ECS task definitions don't translate to anything else. If you wanted to move off AWS, you'd be rewriting deployment configs.

What EKS gets right #

EKS is AWS's managed Kubernetes:

The ecosystem is huge. Helm charts, operators, CRDs for everything. New tools target Kubernetes first; ECS support is afterthought or absent.

Portable. Workloads on EKS run on GKE / AKS / on-prem K8s with minor modifications. Real exit ramp.

Mature scheduling. Pod affinity, taints, tolerations, topology constraints. Can express complex placement requirements.

Service mesh, GitOps, observability all have rich tools that work across K8s clusters.

The K8s API itself. Once you know it, it's a powerful abstraction across many clouds and on-prem.

What EKS gets wrong #

The cost:

Operational complexity. Even with managed control plane, you operate worker nodes, networking (CNI tuning, load balancers, ingress), storage classes, RBAC, namespaces, etc. There's a lot to know.

Upgrade cadence. Every 4 months a new K8s minor version. AWS deprecates old versions after ~14 months. We spend ~1 quarter per year on EKS / addon upgrades.

Resource overhead. Kubelet, kube-proxy, CNI agents, monitoring agents all run on every node. ~20-30% of node resources are platform overhead. ECS has less.

Cost is harder to attribute. With multi-tenant pods on shared nodes, "what does this service cost" requires kubecost or similar tooling. ECS Fargate's per-task pricing is simpler.

Failure modes are deeper. When something is wrong with K8s, the failure mode could be in API server, etcd, kubelet, scheduler, controllers, CNI, your CRD, your operator, your pod. Many places to look.

What we run where #

Our actual split:

On ECS Fargate:

Cron jobs and one-off tasks (no node management, scale to zero)
Internal tools that don't need K8s features
A few legacy services we haven't migrated
Workloads with strict tenant isolation requirements (Fargate gives microVM isolation per task)

On EKS:

Customer-facing apps (the bulk of our compute)
Workloads using K8s-native operators (ArgoCD, cert-manager, sealed-secrets, etc.)
Anything that benefits from rich scheduling
Stateful workloads that need PVs, StatefulSets, operators

The split is roughly: EKS for the "platform" workloads where we want consistency across many services and rich tooling; ECS Fargate for the workloads where simplicity wins.

Specific decision points #

When a new service comes up, the decision tree:

Does it need K8s-specific features? (Operators, CRDs, complex scheduling, service mesh) → EKS.
Is it a periodic / cron-triggered task? → ECS Fargate. Per-task billing is cheaper than holding K8s capacity.
Is it a one-off or experimental? → ECS Fargate. Less plumbing to set up.
Is it a customer-facing service that fits the typical web-service pattern? → EKS. Goes in our existing platform.
Does the team running it know K8s? → If no, ECS Fargate is friendlier for non-K8s teams.

The decision tree handles ~95% of cases. The remaining 5% are judgment calls.

Cost comparison: not as simple as it looks #

A common claim: "ECS Fargate is more expensive than EKS." It's complicated.

For a single service running 24/7 at known capacity:

EKS on EC2 spot: cheapest. ~$0.06/hr for the equivalent of 1 vCPU/2GB.
EKS on EC2 on-demand: ~$0.12/hr.
ECS Fargate: ~$0.20/hr.
ECS Fargate Spot: ~$0.06/hr (when capacity is available).

For bursty / scale-to-zero workloads:

EKS: you're paying for capacity even when nothing's running.
ECS Fargate: you pay only when tasks run.

For a fleet of 50+ services with varying load:

EKS amortizes node costs across many services. Per-service cost is lower.
ECS Fargate's per-task cost adds up.

Our calculation: at our scale (~600 pods on EKS, ~80 ECS tasks), EKS is ~30% cheaper for the EKS workloads, but ECS Fargate is ~5x cheaper for the bursty/intermittent workloads. We use both for different reasons.

Migration considerations #

Going from ECS to EKS:

Task definitions don't translate; rewrite as Deployments/Services.
Cloud Map discovery → K8s Services.
IAM task roles → IRSA.
ALB integration → ALB Controller (different model, similar end result).
Logs → already in CloudWatch, just need different filtering.

Effort: ~1-2 days per service for the conversion, longer if the service has unusual patterns.

Going from EKS to ECS:

Deployments/Services → Task definitions and ECS services.
Loss of K8s-specific features (CRDs, operators) — usually need to find AWS-native replacements.
IAM mapping is generally simpler.

We've done a few migrations in each direction. ECS → EKS is more common (services outgrowing ECS's simplicity). EKS → ECS is rarer but happens for specific workloads (often Fargate for bursty isolated tasks).

What about Kubernetes on ECS?#

Some teams run K8s on ECS-managed nodes (e.g., EKS-anywhere on EC2 + ECS for the control plane). This is unusual; we don't.

The point of ECS is "you don't need K8s." The point of EKS is "you do need K8s." Trying to combine is muddling the choice.

What I'd tell a team choosing #

If you have one service or a small handful, use ECS. The lower setup cost wins. EKS is overkill until you have a fleet.

If you have 20+ services or anticipate K8s ecosystem dependencies, use EKS. Operator/CRD/service-mesh tooling exists on K8s; on ECS you'd be rebuilding from scratch.

If your team doesn't know K8s, ECS Fargate is friendlier. Less to learn, less to break.

If portability matters, use EKS. ECS task definitions don't go anywhere else.

Use both if they fit. Fargate for bursty/cron workloads; EKS for the platform. Don't force one to do everything.

The ECS vs EKS choice is mostly a fit question, not a quality question. Both work. Match the platform to the operational shape of your team and the workload patterns you have. The teams that struggle are the ones that picked one for ideological reasons (we love AWS / we love K8s) without checking the actual fit.

AWS ECS vs EKS: Choosing the Right Container Platform

AWS ECS vs EKS: A Practical Comparison

What ECS gets right #

What ECS gets wrong #

What EKS gets right #

What EKS gets wrong #

What we run where #

Specific decision points #

Cost comparison: not as simple as it looks #

Migration considerations #

What about Kubernetes on ECS?#

What I'd tell a team choosing #

Stay Updated

How We Stopped Terraform Drift from Surprising On-Call

Real-World RAG Incidents: Lessons from a Production Rollout

More from Cloud

External Secrets Operator: One Secrets Workflow Across Clouds

AWS Graviton Migration: What Broke and What We Saved

Serverless Cold Starts: Measuring and Fixing Them on Lambda

External Secrets Operator: One Secrets Workflow Across Clouds

AWS Graviton Migration: What Broke and What We Saved

Serverless Cold Starts: Measuring and Fixing Them on Lambda

Multi-Region Failover with Route 53: Health Checks and Gotchas

Kustomize Overlays That Scale Across Environments

NAT Gateway Costs: The Silent Line Item and How to Cut It

You might have missed

GitOps with Argo CD: Best Practices for 2025

Prompt Engineering Best Practices: Maximizing LLM Performance

Process Management and Monitoring in Linux

About Kiril Urbonas