-
Casella & Berger, "Statistical Inference"
An introductory grad-level textbook on major concepts.
-
Gelman, Carlin, Stern & Rubin, "Bayesian Data Analysis"
A practical and well-written guide to Bayesian analysis.
No deep philosophy with mindshaking Bayesian-vs-frequentist fights.
-
Haykin, "Neural Networks, a Comprehenisve Foundation"
A really comprehensive review of general machine-learning concepts,
neural networks, radial basis function and kernel methods,
principal component analysis, support vector machines,
boosting methods, decision trees, entropy and mutual information
approaches, and stochastic machines. The last 4 of 15 chapters
cover topics on neurodynamics; I do not find these particularly
useful at the moment. The previous 11 chapters, however, are very
relevant for HEP data analysis and recommended for consumption
by every physicist. This book does a really good
job of explaining the underlying math - one does not have to be
a professional mathematician to understand it.
-
Webb, "Statistical Pattern Recognition"
There is a substantial overlap between this book
and Haykin's "Neural Networks." Webb offers additional topics
in non-parametric density estimation and discusses
linear (and quadratic) discriminant analysis, projection pursuit,
multivariate adaptive regression splines, feature selection
and clustering algorithms. Appendix A includes various measures
of dissimilarity that I found particularly amusing in connection
with the recent spark of curiosity to multidimensional
goodness-of-fit measures.
This book covers even more relevant material than Haykin's;
the disadvantage is that the discussion is very succinct, and
it may be not so easy to digest certain topics without prior
background.
-
Hastie, Tibshirani & Friedman, "The Elements of Statistical Learning"
My favorite modern introduction to machine learning methods.
A brief yet very informative overview of most methods and issues.
-
Haupt & Haupt, "Practical Genetic Algorithms"
Genetic algorithms are not covered in any other machine
learning book mentioned here. This is a good overview of the
basics.
-
Efron & Tibshirani, "An Introduction to the Bootstrap"
Bootstrap (and jackknife as a specific subclass) is a method
for evaluating bias and variance of an estimator
by resampling available data. This method should be considered
for HEP analysis whenever it is hard or impossible to obtain
a new sample, for example, Monte Carlo is too expensive or
there is no more data available.
-
D'Agostino & Stephens, "Goodness-of-Fit Techniques"
An extensive (and somewhat exhaustive) review of GOF techniques,
mostly for univariate problems.