The first several chapters do an excellent job of providing some structure to the study of machine learning, covering math fundamentals (including information theory), general problems and principles (there is “no free lunch” in principle, but we fortunately live in a “smooth world,” so we need not work with all possible distributions) and outlining the general anatomy of a machine learning algorithm.