The AI Interview - Master AI/ML Interviews

Task: Calculate Statistical Power for A/B Testing

Statistical power is a crucial concept in experiment design. It represents the probability of correctly rejecting a null hypothesis when there is a true effect (i.e., detecting an effect when it exists). Power analysis helps researchers determine the appropriate sample size needed for their experiments.

In this task, you will implement a function to calculate the statistical power for a two-sample z-test, commonly used in A/B testing scenarios.

Your Task:

Implement the function calculate_power(effect_size, sample_size_per_group, alpha, two_tailed) that:

Takes the following parameters:
- effect_size: Cohen's d (standardized effect size), the difference between group means divided by the pooled standard deviation
- sample_size_per_group: Number of observations in each group
- alpha: Significance level (default: 0.05)
- two_tailed: Boolean indicating if the test is two-tailed (default: True)
Returns the statistical power as a float rounded to 4 decimal places

Notes:

You may use the math library for mathematical functions
The standard normal CDF can be computed using the error function: CDF(x) = 0.5 * (1 + erf(x / sqrt(2)))
For the inverse CDF (quantile function), you may use a numerical approximation

Examples

Example 1:

Input: effect_size=0.5, sample_size_per_group=64, alpha=0.05, two_tailed=True

Output: 0.8074

Explanation: With Cohen's d=0.5 (medium effect), 64 participants per group, and alpha=0.05 (two-tailed), we calculate the non-centrality parameter as 0.5 * sqrt(64/2) = 2.83. The critical z-value for alpha=0.05 two-tailed is 1.96. Power = 1 - CDF(1.96 - 2.83) + CDF(-1.96 - 2.83) = 1 - CDF(-0.87) + CDF(-4.79) = 0.8074. This means there's about 80.7% chance of detecting the effect if it truly exists.

Starter Code

import math

def calculate_power(effect_size: float, sample_size_per_group: int, alpha: float = 0.05, two_tailed: bool = True) -> float:
    """
    Calculate statistical power for a two-sample z-test.
    
    Parameters:
    effect_size: Cohen's d (standardized effect size)
    sample_size_per_group: Number of observations per group
    alpha: Significance level (default 0.05)
    two_tailed: Whether the test is two-tailed (default True)
    
    Returns:
    Statistical power as a float rounded to 4 decimal places
    """
    # Your code here
    pass

Calculate Statistical Power for Experiment Design

Task: Calculate Statistical Power for A/B Testing

Your Task:

Notes:

Examples

Starter Code