Through the Bayesian approach, it would be possible to construct an optimal classifier if both the prior probabilities and the class-conditional densities
were known perfectly. Typically, such information is rarely available, and the adopted approach is to build a classifier from a set of examples (the training set).
To model , a parametric approach is typically employed, and whenever possible, this distribution is aligned with that of a Gaussian or with spline functions.
The most commonly used estimation techniques are Maximum-Likelihood (ML) and Bayesian Estimation, which, although differing in their underlying logic, yield nearly identical results. The Gaussian distribution is typically an appropriate model for most pattern recognition problems.
Let us examine the quite common case in which the probabilities of the various classes follow a multivariate Gaussian distribution with mean
and covariance matrix
. The optimal Bayesian classifier is given by
Paolo medici