Conditional Probability Distribution

Probability behavior of $$X$$ given $$Y=y$$

1. Theoretical Definition

Let the joint density function of $$(X, Y)$$ be $$f(x, y)$$, and the marginal density function of $$Y$$ be $$f_Y(y)$$.

Conditional Probability Density Function:

$$f_{X|Y}(x|y) = \frac{f(x, y)}{f_Y(y)}$$

Note: Defined only when $$f_Y(y) > 0$$

Physical Interpretation: When we fix $$Y = y_0$$, the conditional distribution describes the "cross-section" cut from the joint distribution surface along the line $$y=y_0$$, scaled so that the area under this section equals 1.

Intuitive Explanation: Conditional distribution is essentially "new probability after narrowing the scope". It tells you how the probability of an event changes under specific conditions.

Applications:
Recommendation Systems: Predict the probability of a user clicking or purchasing a product given their historical behavior and profile - the core engine of e-commerce and content platforms.
Risk Assessment: Calculate the probability of loan default given a borrower's income, debt, and credit history - the foundation of financial credit approval.
Medical Diagnosis: Infer the probability of a disease given patient symptoms, age, and test results - assisting physicians in differential diagnosis.
Predictive Maintenance: Predict the probability of equipment failure given historical data and current status - key to predictive maintenance.
Natural Language Processing: Predict the most likely next word or phrase given preceding context - the fundamental principle driving language models, machine translation, and chatbots.

2. Results Under Normal Distribution

If $$(X, Y) \sim N(\mu_x, \mu_y, \sigma_x^2, \sigma_y^2, \rho)$$, then given $$Y=y$$:

$$X|Y=y \sim N\left(\mu_x + \rho\frac{\sigma_x}{\sigma_y}(y-\mu_y), \sigma_x^2(1-\rho^2)\right)$$

Observation: Conditional mean changes linearly with $$y$$, while conditional variance decreases.

Blue: Conditional Distribution $f_{X|Y}(x|y)$ Gray dashed: Marginal Distribution $f_X(x)$ (reference)
Conditional Mean (Regression)

0.00

Conditional Std Dev

0.00