The AI Interview - Master AI/ML Interviews

Latency Optimization for Agent Pipelines

Multi-step agent pipelines are often slow. This question focuses on maximizing parallelism and caching.

Task

Implement LatencyOptimizer that:

Analyzes a DAG of pipeline steps to identify parallelism opportunities.
Computes the critical path (bottleneck sequence).
Executes steps in parallel groups with dependency ordering.
Caches cacheable step results with TTL.
Supports speculative execution for probable inputs.
Reports latency gains vs sequential baseline.

Non-Functional Requirements

Achieve at least 50% latency reduction on non-critical-path steps.
Cache hit should add <1ms overhead.
Speculative results must be invalidated if actual input differs.

Constraints

Step dependencies form a DAG (no cycles).
Cache key: (step_id, sorted_args_hash).
Critical path: sequence with maximum total estimated latency.

Examples

Example 1:

Input:

steps = [
  PipelineStep('search', search_fn, [], estimated_latency_ms=300),
  PipelineStep('fetch', fetch_fn, [], estimated_latency_ms=200),
  PipelineStep('analyze', analyze_fn, ['search','fetch'], estimated_latency_ms=400)
]
optimizer.analyze_dag(steps)

Output: ExecutionPlan(parallel_groups=[[search,fetch],[analyze]], estimated_total_ms=700, critical_path=['search','analyze'])

Explanation: Search and fetch run in parallel (300ms), then analyze runs (400ms). Critical path: search→analyze=700ms.

Starter Code