4. Statistics

4.1 Intro

We estimate probability models parameters θ\theta from data D\mathcal{D}. Most methods are optimizations of the form:

θ^=arg minθL(θ)\begin{equation} \hat{\theta}=\argmin_{\theta} \mathcal{L}(\theta) \end{equation}