Harness Engineering Complete Guide — From CI/CD to Cloud Cost Management

Table of Contents

As engineering teams grow, software delivery pipelines become increasingly complex. Plugin hell in Jenkins, manually managed environment variables, inconsistent deployment procedures across teams, and no clear visibility into where cloud costs are leaking. Harness was built to solve these problems structurally.

Harness is not just another CI/CD tool. It is a unified software delivery platform that provides CI, CD, GitOps, Feature Flags, Cloud Cost Management, and Security Testing Orchestration under a single roof. This guide covers Harness from its core architecture through real-world operational tips, written from a developer’s perspective.

1. What Is Harness — Redefining CI/CD
#

The Limits of Traditional CI/CD Tools
#

Jenkins has long been the de facto standard for CI/CD. But in modern microservice and multi-cloud environments, it shows its age.

Plugin management burden: Hundreds of plugins that conflict with each other or go unmaintained
Script-heavy pipelines: Groovy-based Jenkinsfiles are difficult to maintain at scale
Poor visibility: Deployment status, failure root causes, and rollback history are hard to surface in one place
No cost control: No way to know which pipelines consume how much cloud infrastructure

GitHub Actions and GitLab CI are good choices, but they still fall short in large enterprise environments with multi-cloud, multi-team, and governance requirements.

The Harness Approach
#

Harness was founded in 2017 by Jyoti Bansal, the creator of AppDynamics. The core philosophy is: “Minimize the time developers spend on deployments, and let them focus on creating business value.”

Harness modules at a glance:

Module	Description
CI (Continuous Integration)	Build, test, and artifact management
CD (Continuous Delivery)	Deployment strategies, environment management
GitOps	Declarative infrastructure management via Git
Feature Flags	Feature toggles and A/B testing
Cloud Cost Management	Cloud spend visibility and optimization
Security Testing Orchestration	Integrated security vulnerability scanning
Chaos Engineering	Fault injection and resilience testing

2. Harness Architecture — Control Plane and Delegate
#

Understanding the Harness architecture is the starting point for effective operations.

Control Plane (SaaS)
#

The brain of Harness, the Control Plane is the SaaS layer hosted by Harness itself. Pipeline definitions, execution orchestration, dashboards, and log collection all happen here.

Web UI accessible at app.harness.io
REST API and GraphQL API
Pipeline YAML storage and versioning
Execution history and artifact registry integrations

Delegate — Your Agent Inside Customer Infrastructure
#

The Delegate is a lightweight agent deployed inside the customer’s own infrastructure (on-premises, AWS, GCP, Azure, or a Kubernetes cluster). It receives instructions from the Control Plane and carries out the actual work.

What the Delegate does:

Executes build jobs (Docker builds, tests, etc.)
Deploys to Kubernetes clusters
Accesses secrets (Vault, AWS Secrets Manager, etc.)
Communicates with services behind firewalls

Why does this architecture matter? Harness never directly accesses customer infrastructure. The Delegate makes outbound connections to the Control Plane, meaning no inbound ports need to be opened. This is a primary reason Harness is adopted in security-sensitive industries like finance and healthcare.

Self-Managed Enterprise Edition (SMEE)
#

For environments where SaaS is not an option, Harness provides SMEE, an on-premises installable version. It deploys to a Kubernetes cluster via Helm chart and supports air-gapped environments.

1
2
3
4
5
6


# Install Harness SMEE with Helm
helm repo add harness https://harness.github.io/helm-charts
helm install harness harness/harness \
  --namespace harness \
  --create-namespace \
  -f values.yaml

3. Six Key Benefits of Harness
#

Teams that adopt Harness consistently report the following benefits:

Faster deployment velocity: Pipeline automation and parallel execution shorten release cycles
Increased developer productivity: Automate repetitive infrastructure tasks and minimize context-switching related to deployments
Cloud cost optimization: Real-time visibility into wasted resources and automatic recommendations
Release flexibility: Minimize risk with canary, blue/green, and rolling deployment strategies
Security and compliance: Enforced OPA policies, RBAC, and audit trails
Unified DevOps platform: CI, CD, cost management, and security in a single platform

4. CI (Continuous Integration) — Pipeline Configuration
#

YAML-Based Pipeline Definition
#

Every Harness CI pipeline is defined in YAML. Create a .harness/ directory in your Git repository and store pipeline files there — changes are automatically reflected.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39


pipeline:
  name: Build and Test
  identifier: build_and_test
  projectIdentifier: my_project
  orgIdentifier: default
  stages:
    - stage:
        name: Build
        identifier: Build
        type: CI
        spec:
          cloneCodebase: true
          infrastructure:
            type: KubernetesDirect
            spec:
              connectorRef: my_k8s_cluster
              namespace: harness-builds
          execution:
            steps:
              - step:
                  type: Run
                  name: Run Tests
                  identifier: Run_Tests
                  spec:
                    connectorRef: dockerhub
                    image: golang:1.21
                    command: |
                      go test ./... -v -coverprofile=coverage.out
                      go tool cover -html=coverage.out -o coverage.html
              - step:
                  type: BuildAndPushDockerRegistry
                  name: Build and Push Image
                  identifier: Build_Push
                  spec:
                    connectorRef: dockerhub
                    repo: myorg/myapp
                    tags:
                      - <+pipeline.sequenceId>
                      - latest

Build Infrastructure Options
#

Harness CI supports a range of build infrastructure types:

Infrastructure Type	Description	Best For
Kubernetes	Runs on your own K8s cluster	Cost optimization, custom environments
Harness Cloud	Managed cloud runners by Harness	Quick start, no maintenance overhead
Docker	Runs via local Docker	Development and test environments
VM	Runs on virtual machines	Special hardware requirements

Test Intelligence
#

One of the most powerful features in Harness CI is Test Intelligence. It uses ML to identify and run only the tests related to the code that changed.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10


- step:
    type: RunTests
    name: Run Tests with TI
    identifier: Run_Tests_TI
    spec:
      language: Java
      buildTool: Maven
      args: test
      runOnlySelectedTests: true  # Enable Test Intelligence
      testAnnotations: "@Test,@org.junit.Test"

Real-world impact: running 300 relevant tests out of a 10,000-test suite — 70% reduction in test time.

Caching Strategy
#

Caching is key to build speed. Harness supports both layer caching and dependency caching.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19


- step:
    type: RestoreCacheGCS
    name: Restore Cache
    identifier: Restore_Cache
    spec:
      connectorRef: gcs_connector
      bucket: my-build-cache
      key: maven-{{ checksum "pom.xml" }}

- step:
    type: SaveCacheGCS
    name: Save Cache
    identifier: Save_Cache
    spec:
      connectorRef: gcs_connector
      bucket: my-build-cache
      key: maven-{{ checksum "pom.xml" }}
      sourcePaths:
        - /root/.m2/repository

5. CD (Continuous Delivery) — Deployment Strategies
#

Overview of Deployment Strategies
#

Harness CD ships with modern deployment patterns built in. Teams can use proven strategies through configuration alone, without writing custom deployment scripts.

Canary Deployment
#

Route a portion of traffic (e.g. 10%) to the new version, monitor error rates and latency, then gradually expand the rollout.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26


- step:
    type: K8sCanaryDeploy
    name: Canary Deploy
    identifier: canary_deploy
    spec:
      instanceSelection:
        type: Percentage
        spec:
          percentage: 10
      skipDryRun: false
- step:
    type: Verify
    name: Verify Canary
    identifier: verify_canary
    spec:
      type: Continuous
      spec:
        monitoredServiceRef: my_service
        healthSources:
          - prometheus
        duration: 15m
        sensitivity: MEDIUM
- step:
    type: K8sCanaryDelete
    name: Delete Canary
    identifier: delete_canary

Blue/Green Deployment
#

Maintain two identical environments. Validate the new version, then switch traffic. Enables instant rollback if issues arise.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12


- step:
    type: K8sBlueGreenDeploy
    name: Blue Green Deploy
    identifier: bg_deploy
    spec:
      skipDryRun: false
- step:
    type: K8sBGSwapServices
    name: Swap Services
    identifier: swap_services
    spec:
      skipDryRun: false

Rolling Deployment
#

Replaces existing instances sequentially. Simplest approach, but full redeployment is needed for rollback.

Environment Management
#

Harness separates Environments from Infrastructure Definitions, allowing independent management of each.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11


environment:
  name: Production
  identifier: production
  type: Production
  variables:
    - name: REPLICA_COUNT
      type: String
      value: "10"
    - name: MEMORY_LIMIT
      type: String
      value: "2Gi"

Different configuration values, secrets, and approval policies can be applied per environment, enabling safe isolation between development, staging, and production.

Automatic Rollback
#

The core safety net of Harness CD is automatic rollback. Integrate monitoring tools (Prometheus, Datadog, New Relic, etc.) and Harness will automatically roll back to the previous version when anomalous metrics are detected post-deployment.

1
2
3
4
5
6


rollbackSteps:
  - step:
      type: K8sRollingRollback
      name: Rollback
      identifier: rollback
      spec: {}

6. GitOps Workflow — Git as the Source of Truth
#

Harness GitOps workflow — from Git commit to pipeline trigger to build and deploy

What Is GitOps?
#

GitOps is the practice of managing infrastructure and application configuration declaratively in a Git repository. Git becomes the Single Source of Truth, and all changes flow through Pull Requests.

Harness GitOps is built on Argo CD, bringing powerful Kubernetes GitOps capabilities into the Harness platform for unified management.

The GitOps Workflow
#

Developer pushes application code or Kubernetes manifests to Git
Harness CI runs build/test and produces a new container image
Harness automatically updates the image tag in the Git config repository (opens a PR)
On PR merge, the Harness GitOps Agent syncs the cluster state to match Git
Any drift between cluster state and Git state is detected and auto-corrected

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21


# GitOps Application definition
apiVersion: argoproj.io/v1alpha1
kind: Application
metadata:
  name: my-app
  namespace: harness-gitops
spec:
  project: default
  source:
    repoURL: https://github.com/myorg/k8s-configs
    targetRevision: main
    path: apps/my-app/production
  destination:
    server: https://kubernetes.default.svc
    namespace: production
  syncPolicy:
    automated:
      prune: true
      selfHeal: true
    syncOptions:
      - CreateNamespace=true

PR Pipeline Trigger
#

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16


triggers:
  - trigger:
      name: PR Trigger
      identifier: pr_trigger
      enabled: true
      source:
        type: Webhook
        spec:
          type: Github
          spec:
            event: PullRequest
            actions:
              - Open
              - Reopen
              - Edit
            autoAbortPreviousExecutions: true

7. Feature Flags — Safe Feature Releases
#

What Are Feature Flags?
#

A Feature Flag is a technique that decouples code deployment from feature release. Code is already deployed to production, but a feature is only activated for specific user groups.

Also known as “deploying dark,” the benefits include:

Business teams control the timing of feature releases
A/B testing against specific user segments
Instantly disable a feature without a code rollback
Minimize risk through gradual rollout

Using the Harness Feature Flags SDK
#

JavaScript/Node.js:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21


import { initialize, variation } from '@harnessio/ff-javascript-client-sdk';

// Initialize SDK
const client = initialize('YOUR_SDK_KEY', {
  identifier: 'user123',
  attributes: {
    email: 'user@example.com',
    plan: 'premium',
    country: 'US'
  }
});

// Evaluate flag
await client.waitForInitialization();
const showNewDashboard = variation('new_dashboard_ui', false);

if (showNewDashboard) {
  renderNewDashboard();
} else {
  renderLegacyDashboard();
}

Java/Spring Boot:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16


@Service
public class FeatureService {

    @Autowired
    private CfClient cfClient;

    public boolean isNewCheckoutEnabled(String userId) {
        Target target = Target.builder()
            .identifier(userId)
            .name(userId)
            .attribute("plan", getUserPlan(userId))
            .build();

        return cfClient.boolVariation("new_checkout_flow", target, false);
    }
}

Targeting Rules
#

The strength of Harness Feature Flags lies in its flexible targeting rules.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26


# Feature flag configuration example
feature_flag:
  identifier: new_checkout_flow
  name: New Checkout Flow
  kind: boolean
  defaultEnabled: false

  targeting_rules:
    # Enable 100% for beta users
    - clauses:
        - attribute: group
          op: equal
          values: ["beta_users"]
      serve: true

    # Gradual rollout: 20% of premium plan users
    - clauses:
        - attribute: plan
          op: equal
          values: ["premium"]
      serve:
        distribution:
          - variation: true
            weight: 20000  # 20%
          - variation: false
            weight: 80000  # 80%

A/B Testing Integration
#

Connect feature flags with analytics tools to measure A/B test outcomes.

1
2
3
4
5
6
7


// Integration with Amplitude/Mixpanel
const variant = variation('checkout_button_color', 'blue');

analytics.track('Checkout Initiated', {
  ff_checkout_button: variant,
  userId: currentUser.id
});

8. Cloud Cost Management — Taking Control of Cloud Spend
#

The Reality of Cloud Costs
#

A common pain point for teams on AWS, GCP, or Azure: “Why is this month’s cloud bill so high?” Without per-service, per-team, and per-environment cost breakdowns, waste is impossible to prevent.

Harness Cloud Cost Management (CCM) addresses this structurally.

Cost Visibility
#

CCM enables cost analysis across multiple dimensions:

By service: Which microservice is spending how much?
By environment: Comparing dev / staging / production costs
By team: Cost allocation based on team labels
Time trends: Detecting anomalous spend spikes

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14


# CCM Perspective configuration
perspective:
  name: Backend Services
  rules:
    - viewConditions:
        - viewField:
            fieldId: label.k8s-app
            fieldName: k8s-app
            identifier: LABEL
          operator: IN
          values:
            - api-server
            - auth-service
            - payment-service

AutoStopping and Recommendations
#

AutoStopping: Automatically stops non-production resources when there is no traffic. Teams consistently report ~70% cost savings on dev and QA environments.

1
2
3
4
5
6
7
8


autostopping_rule:
  name: Dev Cluster Auto-Stop
  cloud_provider: AWS
  resources:
    - type: ECS
      cluster: dev-cluster
  idle_time_minutes: 15
  warmup_time_seconds: 30

Spot/Reserved Instance Recommendations: Analyzes usage patterns and recommends migrating from on-demand to Spot or Reserved instances.

Budget Alerts
#

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12


budget:
  name: Production Monthly Budget
  amount: 50000  # $50,000
  period: MONTHLY
  alerts:
    - threshold: 80  # Alert at 80% of budget
      channels:
        - slack: "#cloud-costs"
        - email: "devops@company.com"
    - threshold: 100  # Alert on budget breach
      channels:
        - pagerduty: "cloud-cost-oncall"

9. Security and Governance
#

OPA (Open Policy Agent) Policies
#

Harness can enforce OPA policies before and after pipeline execution, letting teams manage organizational deployment rules as code.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13


# Rego policy: Production deployments require approval
package pipeline

deny[msg] {
  input.pipeline.stages[_].stage.spec.execution.steps[_].step.type == "K8sRollingDeploy"
  input.pipeline.stages[_].stage.spec.environment.identifier == "production"
  not has_approval_step
  msg := "Production deployments require an approval step"
}

has_approval_step {
  input.pipeline.stages[_].stage.spec.execution.steps[_].step.type == "HarnessApproval"
}

RBAC (Role-Based Access Control)
#

Harness provides fine-grained permission management:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10


role:
  name: Developer
  identifier: developer
  permissions:
    - core_pipeline_view
    - core_pipeline_execute
    - core_environment_view
  # No direct production deployments
  resourceGroups:
    - non_production_resources

Secret Management
#

Harness integrates with external secret stores in addition to its own secret manager:

HashiCorp Vault: Dynamic secrets, automatic rotation
AWS Secrets Manager: Native AWS secrets
GCP Secret Manager: GCP environment support
Azure Key Vault: Azure environment support

1
2
3
4
5
6
7


# Referencing secrets in a pipeline
- step:
    type: Run
    spec:
      envVariables:
        DB_PASSWORD: <+secrets.getValue("production_db_password")>
        API_KEY: <+secrets.getValue("external_api_key")>

Audit Trail
#

All actions — pipeline executions, environment changes, secret access — are recorded in the audit log. This is essential for meeting compliance requirements such as SOC 2, ISO 27001, and HIPAA.

10. Operational Tips — Delegate Management and Pipeline Optimization
#

Delegate Operations Best Practices
#

1. High-Availability Delegate Setup

A single Delegate is a single point of failure. Run at least two Delegates and use Delegate selectors to route specific jobs to specific Delegate groups.

1
2
3
4
5
6


# Using Delegate Selectors
- step:
    type: Run
    spec:
      delegateSelectors:
        - prod-delegate  # Target a specific Delegate group

2. Delegate Resource Configuration

1
2
3
4
5
6
7
8


# K8s Delegate resource settings
resources:
  limits:
    cpu: "1"
    memory: 4Gi
  requests:
    cpu: 500m
    memory: 2Gi

3. Delegate Version Pinning

Harness supports automatic upgrades, but for production environments, it is recommended to pin to a specific version.

1
2


DELEGATE_IMAGE: harness/delegate:23.10.81202
AUTO_UPGRADE: "false"

Pipeline Optimization
#

Parallel stage execution:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14


stages:
  - parallel:
      - stage:
          name: Unit Tests
          # ...
      - stage:
          name: Lint
          # ...
      - stage:
          name: Security Scan
          # ...
  - stage:
      name: Build Docker Image
      # Runs after all parallel stages complete

Conditional execution:

1
2
3
4
5


- step:
    name: Deploy to Prod
    when:
      condition: <+trigger.targetBranch> == "main"
      stageStatus: Success

Pipeline templates:

Turn recurring pipeline patterns into templates for reuse. Dozens of microservice pipelines can be managed through a single template.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21


template:
  name: Standard Build Template
  identifier: standard_build
  versionLabel: v1
  type: Stage
  spec:
    type: CI
    spec:
      cloneCodebase: true
      execution:
        steps:
          - step:
              type: Run
              name: Build
              spec:
                command: make build
          - step:
              type: Run
              name: Test
              spec:
                command: make test

Troubleshooting Guide
#

Delegate connectivity issues:

1
2
3
4
5
6
7
8
9


# Check Delegate logs
kubectl logs -n harness-delegate deployment/harness-delegate -f

# Check Delegate pod status
kubectl get pods -n harness-delegate

# Test network connectivity
kubectl exec -n harness-delegate deploy/harness-delegate -- \
  curl -s https://app.harness.io/api/health

Debugging pipeline failures:

Select the failed step on the pipeline execution page
Click “View Logs” for detailed log output
Use “Re-run from Failed Stage” to resume from the point of failure

Common errors and solutions:

Error	Cause	Solution
`Delegate not available`	Delegate offline or overloaded	Check pod status, scale out
`Secret not found`	Wrong secret reference or insufficient permissions	Verify secret name and scope
`Image pull failed`	Container registry authentication issue	Update Connector credentials
`Timeout exceeded`	Build/deploy exceeding time limit	Increase timeout value, consider parallelization

11. Comparison with Other CI/CD Tools — When to Choose Harness
#

Tool Comparison
#

Criteria	Harness	GitHub Actions	Jenkins	ArgoCD
Configuration	YAML + UI	YAML	Groovy	YAML
Learning curve	Moderate	Low	High	Low
Enterprise features	Very strong	Moderate	Plugin-dependent	CD-focused
Cost management	Built-in	None	None	None
Feature flags	Built-in	None	None	None
Governance	OPA built-in	Limited	Plugin required	None
Pricing	Paid (free tier available)	Usage-based	Open source	Open source

When Harness Is the Right Fit
#

Large enterprises: Dozens to hundreds of microservices, multiple teams, multi-cloud environments
Compliance-critical industries: Finance, healthcare, government — sectors with strict audit and governance requirements
Cost optimization priority: Organizations that want to track and optimize cloud spend per team and per service
Platform consolidation: Teams that want CI, CD, feature flags, and cost management in one tool
Security requirements: On-premises or private cloud deployments where no inbound connections are acceptable (Delegate model)

When Harness May Not Be the Best Fit
#

Small teams or simple deployments: GitHub Actions or GitLab CI may be sufficient
Open-source requirements: Some Harness features are behind a paid tier
Maximum customization: Teams that want to control nearly everything through code, like Jenkins, may find the abstraction layer limiting

Conclusion — Modern DevOps with Harness
#

Harness is not simply a Jenkins replacement. It proposes a paradigm shift: managing the entire software delivery lifecycle through a unified platform.

It starts with CI/CD pipeline automation, extends to GitOps for infrastructure-as-code, uses Feature Flags for safe feature releases, and optimizes cloud spend with Cloud Cost Management — all within a single platform.

If you are evaluating Harness, start with the free tier. After a 14-day trial, the Developer plan provides 500 Harness Cloud build credits, 25,000 Feature Flags MAU, and access to core CI/CD — all at no cost.

The real question is not which tool to pick. It is “how much cognitive overhead can we remove from developers around deployment?” Harness answers that question at the platform level.

References: Harness Developer Hub, Harness University

1. What Is Harness — Redefining CI/CD #

The Limits of Traditional CI/CD Tools #

The Harness Approach #

2. Harness Architecture — Control Plane and Delegate #

Control Plane (SaaS) #

Delegate — Your Agent Inside Customer Infrastructure #

Self-Managed Enterprise Edition (SMEE) #

3. Six Key Benefits of Harness #

4. CI (Continuous Integration) — Pipeline Configuration #

YAML-Based Pipeline Definition #

Build Infrastructure Options #

Test Intelligence #

Caching Strategy #

5. CD (Continuous Delivery) — Deployment Strategies #

Overview of Deployment Strategies #

Canary Deployment #

Blue/Green Deployment #

Rolling Deployment #

Environment Management #

Automatic Rollback #

6. GitOps Workflow — Git as the Source of Truth #

What Is GitOps? #

The GitOps Workflow #

PR Pipeline Trigger #

7. Feature Flags — Safe Feature Releases #

What Are Feature Flags? #

Using the Harness Feature Flags SDK #

Targeting Rules #

A/B Testing Integration #

8. Cloud Cost Management — Taking Control of Cloud Spend #

The Reality of Cloud Costs #

Cost Visibility #

AutoStopping and Recommendations #

Budget Alerts #

9. Security and Governance #

OPA (Open Policy Agent) Policies #

RBAC (Role-Based Access Control) #

Secret Management #

Audit Trail #

10. Operational Tips — Delegate Management and Pipeline Optimization #

Delegate Operations Best Practices #

Pipeline Optimization #

Troubleshooting Guide #

11. Comparison with Other CI/CD Tools — When to Choose Harness #

Tool Comparison #

When Harness Is the Right Fit #

When Harness May Not Be the Best Fit #

Conclusion — Modern DevOps with Harness #