Service Mesh¶

NovaEdge includes a sidecar-free service mesh for east-west (pod-to-pod) traffic. It intercepts ClusterIP traffic transparently using nftables (preferred) or iptables NAT REDIRECT rules, authenticates services with SPIFFE-based mTLS, and enforces authorization policies -- all without injecting sidecar containers.

Overview¶

Traditional service meshes inject a sidecar proxy into every pod, adding latency, memory overhead, and operational complexity. NovaEdge takes a different approach: the node agent (DaemonSet) intercepts service traffic at the kernel level using NAT REDIRECT rules and tunnels it over mTLS HTTP/2 connections between nodes.

Key properties:

No sidecars -- traffic interception happens at the node level via nftables/iptables NAT REDIRECT
Opt-in per service -- annotate services with novaedge.io/mesh: "enabled" to enroll them
SPIFFE identities -- each agent gets a workload certificate with a SPIFFE URI SAN
mTLS everywhere -- node-to-node tunnel traffic is encrypted with TLS 1.3
Authorization policies -- control which services can communicate using ALLOW/DENY rules
Automatic certificate rotation -- certificates are renewed at 80% of their 24-hour lifetime

Architecture¶

flowchart TB
    subgraph Node1["Node 1 (NovaEdge Agent)"]
        Pod1["Pod A<br/>(client)"] -->|"ClusterIP:port"| IPT1["nftables/iptables<br/>NAT REDIRECT"]
        IPT1 -->|"redirect"| TL1["Transparent Listener<br/>:15001"]
        TL1 --> PD1["Protocol Detect"]
        PD1 --> ST1["Service Table<br/>Lookup"]
        ST1 --> TP1["Tunnel Pool<br/>(HTTP/2 mTLS client)"]
    end

    TP1 -->|"mTLS HTTP/2<br/>CONNECT :15002"| TS2

    subgraph Node2["Node 2 (NovaEdge Agent)"]
        TS2["Tunnel Server<br/>:15002"] --> AZ2["Authorizer<br/>(ALLOW/DENY)"]
        AZ2 --> Pod2["Pod B<br/>(backend)"]
    end

    subgraph Controller["NovaEdge Controller"]
        CA["Mesh CA<br/>(ECDSA P-384)"]
        SB["Config Snapshot<br/>Builder"]
    end

    CA -.->|"Sign CSR<br/>(gRPC)"| TL1
    SB -.->|"Push services +<br/>authz policies"| Node1
    SB -.->|"Push services +<br/>authz policies"| Node2

    style Pod1 fill:#e1f5ff
    style Pod2 fill:#e1f5ff
    style CA fill:#fff4e6
    style AZ2 fill:#f3e5f5
    style TS2 fill:#e8f5e9
    style TL1 fill:#e8f5e9

Components¶

Component	File	Port	Purpose
TPROXY Manager	`internal/agent/mesh/tproxy.go`	--	Manages nftables/iptables REDIRECT rules via `RuleBackend` interface
Transparent Listener	`internal/agent/mesh/listener.go`	15001	Accepts redirected connections
Protocol Detector	`internal/agent/mesh/detect.go`	--	Peeks at first bytes to identify HTTP/1, HTTP/2, TLS, or opaque TCP
Service Table	`internal/agent/mesh/manager.go`	--	Maps ClusterIP:port to backend endpoints with round-robin LB
Tunnel Server	`internal/agent/mesh/tunnel.go`	15002	HTTP/2 CONNECT server for incoming mTLS tunnels
Tunnel Pool	`internal/agent/mesh/tunnel.go`	--	Persistent HTTP/2 client pool for outbound tunnels
TLS Provider	`internal/agent/mesh/tls.go`	--	Manages TLS certificates with mutex-protected rotation
Certificate Requester	`internal/agent/mesh/cert.go`	--	Generates CSR, requests cert from controller, auto-renews
Authorizer	`internal/agent/mesh/authz.go`	--	Evaluates ALLOW/DENY policies per service
Mesh CA	`internal/controller/meshca/ca.go`	--	Controller-side CA that signs workload certificates

Enabling the Service Mesh¶

Helm Configuration¶

The mesh CA must be enabled in the controller for mTLS certificate issuance. Set the following values in your Helm chart or ArgoCD application:

controller:
  meshCA:
    enabled: true              # Enable the embedded mesh certificate authority
    keyAlgorithm: ecdsa-p256   # Key algorithm for workload certs (ecdsa-p256, ecdsa-p384, rsa-2048)
    certLifetime: 24h          # Lifetime of issued workload certificates
    caValidity: 87600h         # Validity period of the root CA certificate (~10 years)

agent:
  mesh:
    enabled: true              # Enable mesh traffic interception on agents

When meshCA.enabled is false (default), the controller does not initialize the certificate authority and agents cannot obtain SPIFFE workload certificates. Mesh traffic interception still works but without mTLS — connections are forwarded as plain TCP.

Annotate services¶

Add the novaedge.io/mesh annotation to any Kubernetes Service you want to enroll:

apiVersion: v1
kind: Service
metadata:
  name: my-backend
  annotations:
    novaedge.io/mesh: "enabled"
spec:
  selector:
    app: my-backend
  ports:
    - port: 8080
      targetPort: 8080

When the NovaEdge controller detects this annotation, it includes the service in the InternalService list pushed to agents via ConfigSnapshot. The agent then creates NAT REDIRECT rules to intercept traffic to the service's ClusterIP.

Disable mesh for a service¶

Remove the annotation or set it to any value other than "enabled":

annotations:
  novaedge.io/mesh: "disabled"

The agent will remove the corresponding REDIRECT rules on the next config reconciliation.

How Traffic Interception Works¶

NovaEdge uses two interception backends, auto-detected in priority order:

nftables (preferred) — NAT REDIRECT rules applied atomically via netlink.
iptables (fallback) — exec-based NAT REDIRECT rules. Used when nftables is not available.

eBPF Mesh Redirect via NovaNet

When NovaNet is installed, it can provide eBPF SK_LOOKUP-based mesh redirect that bypasses the nftables/iptables rule chain entirely. This is handled transparently by NovaNet -- NovaEdge itself no longer loads or manages eBPF programs. See eBPF Acceleration (NovaNet) for details.

Both backends use DNAT to 127.0.0.1:<listener-port> instead of plain REDIRECT for universal CNI compatibility (see Why DNAT instead of REDIRECT below). Because the DNAT destination is a loopback address arriving on a non-loopback interface, the agent sets the net.ipv4.conf.all.route_localnet=1 sysctl at startup. Without this sysctl the kernel treats 127.0.0.0/8 as a martian address on non-loopback interfaces and silently drops the packets.

Conntrack records the original ClusterIP destination, which the listener retrieves via SO_ORIGINAL_DST.

Packet flow¶

sequenceDiagram
    participant App as Pod A (client)
    participant IPT as nftables/iptables
    participant TL as Transparent Listener (:15001)
    participant ST as Service Table
    participant Backend as Pod B (backend)

    App->>IPT: TCP SYN to 10.43.0.50:8080 (ClusterIP)
    Note over IPT: PREROUTING -> mesh_redirect chain<br/>Match: -d 10.43.0.50 --dport 8080<br/>Action: REDIRECT to :15001<br/>(before kube-proxy DNAT)

    IPT->>TL: Connection redirected to localhost:15001
    TL->>TL: Extract original destination via SO_ORIGINAL_DST
    TL->>TL: DetectProtocol (peek first 16 bytes)
    TL->>ST: Lookup("10.43.0.50", 8080)
    ST-->>TL: Endpoint{address: "10.42.3.15", port: 8080}
    TL->>Backend: TCP connect to 10.42.3.15:8080
    Note over TL,Backend: Bidirectional proxy (io.Copy)

Rules created¶

The TPROXY manager creates NAT REDIRECT rules. The agent auto-selects the backend at startup and logs "Selected TPROXY backend" backend=nftables (or iptables).

nftables (preferred)¶

Rules are applied atomically in a single netlink batch -- no brief inconsistency windows:

# Table and chain (created once at startup)
nft add table ip novaedge_mesh
nft add chain ip novaedge_mesh mesh_redirect \
  '{ type nat hook prerouting priority dstnat - 1; }'

# Per-service REDIRECT rules (one per ClusterIP:port, replaced atomically)
nft add rule ip novaedge_mesh mesh_redirect \
  ip protocol tcp ip daddr 10.43.0.50 tcp dport 8080 \
  redirect to :15001

The chain priority dstnat - 1 (-101) ensures our REDIRECT fires before kube-proxy's DNAT rules at priority -100, preserving the original ClusterIP in conntrack.

iptables (fallback)¶

Used when the kernel does not support nftables or the nft subsystem is unavailable:

# 1. Custom chain in the nat table
iptables -t nat -N NOVAEDGE_MESH

# 2. Insert at top of PREROUTING (before kube-proxy's KUBE-SERVICES)
iptables -t nat -I PREROUTING 1 -j NOVAEDGE_MESH

# 3. Per-service REDIRECT rules (one per ClusterIP:port)
iptables -t nat -A NOVAEDGE_MESH \
  -p tcp -d 10.43.0.50 --dport 8080 \
  -j REDIRECT --to-ports 15001

Rules are reconciled on every config update. On shutdown, all rules are cleaned up.

Why REDIRECT instead of TPROXY¶

REDIRECT uses standard NAT conntrack to record the original destination before rewriting the port. The transparent listener retrieves the original ClusterIP:port via getsockopt(SO_ORIGINAL_DST). This approach is compatible with all CNI plugins (Flannel, Calico, Cilium, etc.) and bridge-based network topologies, unlike TPROXY which requires specific kernel socket lookup behavior that varies across network configurations. REDIRECT also eliminates the need for policy routing (ip rule/ip route), fwmark management, and conntrack bypass (notrack) rules -- resulting in a simpler, more portable implementation.

How the mTLS Tunnel Works¶

When a connection needs to reach a pod on a different node, the agent establishes an HTTP/2 CONNECT tunnel through the peer agent's tunnel server. All tunnel traffic is encrypted with mTLS using SPIFFE certificates.

sequenceDiagram
    participant Client as Source Agent (Node 1)
    participant Server as Dest Agent (Node 2)
    participant Backend as Backend Pod

    Note over Client,Server: TLS 1.3 handshake<br/>ALPN: h2<br/>Both sides present SPIFFE certs

    Client->>Server: HTTP/2 CONNECT 10.42.3.15:8080<br/>X-NovaEdge-Source-ID: spiffe://cluster.local/agent/node-1<br/>X-NovaEdge-Dest-Service: my-backend.default

    Server->>Server: Verify client cert (mTLS)
    Server->>Server: Extract SPIFFE ID from peer cert
    Server->>Server: Authorize(source, dest, method, path)

    alt Authorized
        Server->>Backend: TCP connect to 10.42.3.15:8080
        Server-->>Client: 200 OK
        Note over Client,Backend: Bidirectional data over HTTP/2 stream
    else Denied
        Server-->>Client: 403 Forbidden
    end

Tunnel configuration¶

Parameter	Value	Description
Port	15002	Tunnel server listen port
TLS version	TLS 1.3 minimum	Enforced via `MinVersion: tls.VersionTLS13`
Client auth	`RequireAndVerifyClientCert`	Both sides must present valid certificates
ALPN	`h2`	HTTP/2 protocol negotiation
Connect timeout	5 seconds	Timeout for dialing backend pods

Connection pooling¶

The TunnelPool maintains persistent HTTP/2 connections to peer agents, keyed by node address. Multiple tunnel streams are multiplexed over a single TLS connection, reducing handshake overhead for subsequent requests to the same node.

Certificate Lifecycle¶

NovaEdge uses SPIFFE-compatible workload certificates for mesh identity. The certificate lifecycle is fully automatic.

sequenceDiagram
    participant Agent as Node Agent
    participant CR as Cert Requester
    participant Controller as Controller (Mesh CA)

    Note over Agent: Agent starts mesh manager

    CR->>CR: Generate ECDSA P-256 key pair
    CR->>CR: Build CSR with SPIFFE URI SAN<br/>spiffe://cluster.local/agent/<node-name>
    CR->>Controller: gRPC: RequestMeshCertificate(CSR, nodeName)

    Controller->>Controller: Verify CSR signature
    Controller->>Controller: Sign with CA key (ECDSA P-384)<br/>Validity: 24 hours<br/>ExtKeyUsage: ClientAuth + ServerAuth
    Controller-->>CR: Certificate + CA bundle + SPIFFE ID + Expiry

    CR->>Agent: UpdateTLSCertificate(cert, key, ca, spiffeID)
    Agent->>Agent: TLSProvider updates cert under write lock
    Note over Agent: Tunnel server and pool use<br/>dynamic TLS callbacks (read lock)

    Note over CR: Wait for 80% of lifetime (19.2h)

    CR->>CR: Generate new key pair + CSR
    CR->>Controller: gRPC: RequestMeshCertificate(CSR, nodeName)
    Controller-->>CR: New certificate
    CR->>Agent: UpdateTLSCertificate(...)
    Note over Agent: Zero-downtime rotation<br/>(mutex-protected swap)

Certificate properties¶

Property	Value
Key algorithm	ECDSA P-256 (workload), ECDSA P-384 (CA)
SPIFFE URI SAN	`spiffe://<trust-domain>/agent/<node-name>`
Default trust domain	`cluster.local`
Workload cert validity	24 hours
Root CA validity	~10 years
Renewal threshold	80% of lifetime (19.2 hours for 24h certs)
Minimum renewal interval	30 seconds (prevents tight loops)
CSR request timeout	30 seconds
Retry delay on failure	5 seconds

Mesh CA¶

The controller runs an embedded Mesh CA (internal/controller/meshca/) that signs workload certificates:

Root CA key: ECDSA P-384, stored in Kubernetes Secret novaedge-mesh-ca in namespace nova-system
On first startup, the CA generates a new root key and persists it to the Secret
On subsequent startups, it loads the existing key from the Secret
Issued certificates include SPIFFE URI SANs and both ClientAuth and ServerAuth extended key usage

TLS rotation¶

The TLSProvider uses dynamic TLS callbacks (GetCertificate, GetClientCertificate, GetConfigForClient) so that certificate rotation is transparent to active connections. New connections automatically use the latest certificate without restarting the tunnel server or pool.

Authorization Policies¶

The mesh authorizer enforces service-level access control. Policies are pushed by the controller as part of the ConfigSnapshot.

Policy evaluation order¶

DENY policies are evaluated first. If any DENY rule matches, the request is denied immediately.
ALLOW policies are evaluated next. If any ALLOW rule matches, the request is allowed.
If ALLOW policies exist but none match, the request is denied (default-deny when explicit ALLOW rules are present).
If only DENY policies exist and none match, the request is allowed.
If no policies exist for the destination service, the request is allowed (default-allow).

Policy structure¶

Policies are defined per target service and include source (from) and destination (to) constraints:

MeshAuthorizationPolicy:
  name: string
  action: "ALLOW" | "DENY"
  target_service: string        # e.g., "my-backend"
  target_namespace: string      # e.g., "default"
  rules:
    - from:                     # Source constraints (empty = match all)
        - namespaces: [...]
          serviceAccounts: [...]
          spiffeIds: [...]      # Glob patterns
      to:                       # Destination constraints (empty = match all)
        - methods: [...]        # HTTP methods (case-insensitive)
          paths: [...]          # Glob patterns

Source matching¶

Field	Match type	Example
`namespaces`	Exact	`["production", "staging"]`
`serviceAccounts`	Exact	`["frontend-sa"]`
`spiffeIds`	Glob	`["spiffe://cluster.local/ns//sa/frontend-"]`

Destination matching¶

Field	Match type	Example
`methods`	Case-insensitive exact	`["GET", "POST"]`
`paths`	Glob	`["/api/*", "/health"]`

For opaque TCP connections (non-HTTP), destination rules with methods or paths set will not match. Use source-only rules for L4 authorization.

Example: allow only frontend to access backend¶

# Pushed via ConfigSnapshot (protobuf MeshAuthorizationPolicy)
action: ALLOW
target_service: my-backend
target_namespace: default
rules:
  - from:
      - namespaces: ["default"]
        serviceAccounts: ["frontend-sa"]
    to:
      - methods: ["GET", "POST"]
        paths: ["/api/*"]

Example: deny a specific namespace¶

action: DENY
target_service: my-backend
target_namespace: default
rules:
  - from:
      - namespaces: ["untrusted"]

Troubleshooting¶

Check if mesh is active on a node¶

# nftables backend: list the novaedge_mesh table
nft list table ip novaedge_mesh

# Expected output:
# table ip novaedge_mesh {
#   chain mesh_redirect {
#     type nat hook prerouting priority dstnat - 1; policy accept;
#     ip daddr 10.43.0.50 tcp dport 8080 redirect to :15001
#   }
# }

# iptables fallback: check the NOVAEDGE_MESH chain
iptables -t nat -L NOVAEDGE_MESH -n -v

# Expected output shows per-service REDIRECT rules:
# Chain NOVAEDGE_MESH (1 references)
#  pkts bytes target     prot opt in     out     source     destination
#   142  8520 REDIRECT   tcp  --  *      *       0.0.0.0/0  10.43.0.50    tcp dpt:8080 redir ports 15001

Check the transparent listener¶

# Verify port 15001 is listening
ss -tlnp | grep 15001

Check the tunnel server¶

# Verify port 15002 is listening
ss -tlnp | grep 15002

Check certificate status¶

# Check agent logs for certificate lifecycle events
kubectl logs -n nova-system -l app.kubernetes.io/name=novaedge-agent | grep "mesh.*cert"

# Expected log lines:
# "Mesh certificate obtained, scheduling renewal" expiry=... lifetime=24h0m0s renew_in=19h12m0s
# "Mesh certificate applied" spiffe_id=spiffe://cluster.local/agent/node-1

# Verify the CA secret exists
kubectl get secret novaedge-mesh-ca -n nova-system

Check mesh service count¶

# Look for mesh config application in agent logs
kubectl logs -n nova-system -l app.kubernetes.io/name=novaedge-agent | grep "Mesh config applied"

# Expected: "Mesh config applied" services=5 intercept_rules=8 routing_entries=8 authz_policies=3

Connection not being intercepted¶

If traffic to a mesh-enrolled service is not being intercepted:

Verify the service has the annotation: kubectl get svc <name> -o jsonpath='{.metadata.annotations.novaedge\.io/mesh}'
Check that the corresponding rule exists: nft list table ip novaedge_mesh 2>/dev/null | grep <clusterIP> or iptables -t nat -L NOVAEDGE_MESH -n | grep <clusterIP>
Verify the agent received the service in its config: check agent logs for intercept_rules count
Confirm the transparent listener is accepting connections: ss -tlnp | grep 15001

Tunnel connection failures¶

# Check for tunnel errors in agent logs
kubectl logs -n nova-system -l app.kubernetes.io/name=novaedge-agent | grep -i tunnel

# Common issues:
# - "no mesh TLS certificate loaded" -> cert requester has not obtained a cert yet
# - "CONNECT ... returned status 403" -> authorization policy is denying the connection
# - "Failed to dial backend" -> backend pod is unreachable from the destination node

Authorization denied unexpectedly¶

# Check authorizer debug logs (set log level to debug)
kubectl logs -n nova-system -l app.kubernetes.io/name=novaedge-agent | grep "mesh authorization"

# Expected for denials:
# "mesh authorization denied by DENY policy" policy=... source=... dest=...
# "mesh authorization denied: no ALLOW policy matched" source=... dest=...

Protocol Detection¶

The transparent listener peeks at the first 16 bytes of each intercepted connection to detect the application protocol:

Protocol	Detection method	Handling
HTTP/1.x	Starts with `GET`, `POST`, `PUT`, etc.	L4 proxy (L7 routing planned)
HTTP/2	Starts with `PRI * HTTP/2` (connection preface)	L4 proxy
TLS	Starts with `0x16 0x03` (ClientHello)	L4 proxy
Opaque TCP	None of the above	L4 proxy (passthrough)

All protocols are currently proxied as L4 TCP. HTTP-aware routing (L7 mesh) is planned for a future release.

TLS -- TLS certificate management for ingress traffic
Policies -- Rate limiting, authentication, and WAF policies for north-south traffic