Each section begins by covering the relevant hardware architecture in detail, which is something I've seen other books skimp out on, despite the fact that you cannot program effectively in assembly language if you don't understand the architecture you're programming for. The best part about this book is that it covers the newer Intel technologies (SSE and AVX) making it essential for HPC and general algorithmic optimization.