Uff, I've finally managed to get this one through - and this was quite a journey.
Why did I reach for it? Based on recommendations of some knowledgeable people who called it a best book to upgrade your practical skills on building modern AI systems.
What did I expect? I call it a "Karpathy" level experience ;D Andrej is able to explain complex concepts in a way that still requires a lot of effort, but it reduces significantly the "steepness" of the learning curve.
Did I get it? Not really, but it doesn't mean it's a bad book:
- it doesn't assume much when it comes to your starting level (with Gen AI, LLMs), so you're taught concepts like finetuning, RAG or recall
- it indeed does not focus on a single aspect of using LLMs (e.g., prompting) - there are decent chapters on finetuning (even - very decent), quality assurance, inference optimisation, etc.
- sometimes very unevenly it jumps between pretty high and very deep level - I don't think the author is bragging, it's just that she was no sure who the book is for: the folks who just start with building simple things with AI, or the ones who know all the basics bits and pieces and now need to dive really deep
- it's clear that the author is very knowledgeable, I won't question that - but I missed a bit of "practical decision-making" when it comes to going beyond the basic arch (which is covered); that may be feel I'm not getting much beyond what is already freely available online in abundance
Good book, or even a very good one (4.3-4.5), but it didn't rock my world. Yes, there were some sub-chapters definitely beyond my level of knowledge (which I didn't follow straight to the very end), but still - I think I've expected (after the enthusiastic reviews) slightly "more". Maybe this more is more "focus" or more "practicability". And no, unfortunately it wasn't "Karpathy level" of explaining ;) But I'll definitely check out what that author releases next - this one was a good start.