Jump to ratings and reviews
Rate this book

Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG

Rate this book

463 pages, Paperback

Published May 21, 2024

43 people are currently reading
434 people want to read

About the author

Louis-François Bouchard

3 books11 followers
Louis-François Bouchard is from Montréal, Canada, and is known as What's AI on YouTube.

He focuses on making AI accessible by sharing and explaining it in simple terms, sharing the new research state and applications for everyone, demystifying the AI “black box” for everyone, and sensitizing people about the risks of using it.

Louis-François recently dropped out of his Ph.D. at Mila/Polytechnique Montréal to focus on his love of education on YouTube and as a co-founder and CTO at Towards AI. He aims to build an industry-relevant skillset for working with AI and popularizing the field.

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
17 (38%)
4 stars
16 (36%)
3 stars
10 (22%)
2 stars
1 (2%)
1 star
0 (0%)
Displaying 1 - 6 of 6 reviews
1 review
June 22, 2025
Overall, the book serves as a strong introduction to building LLM pipelines, offering valuable insights into foundational concepts. It provides clear explanations of topics such as quantization and evaluation metrics, which are particularly useful for readers seeking to understand model optimization and performance assessment.

However, the book falls short in its coverage of production level deployment. While it effectively addresses research-oriented topics, it lacks sufficient depth on practical aspects such as model hosting, serving infrastructure, and deployment strategies areas that are critical for taking LLM applications into real world production environments.

In many respects, the book reads more like an applied research-oriented text than a practical engineering guide. Despite this, I continue to reference it regularly, particularly for its discussion on generation speed measured in tokens per minute as a useful proxy for understanding model latency. I would have appreciated a deeper exploration of similarly actionable metrics and implementation details tailored for production use cases.
1 review
October 6, 2024
I highly recommend Building LLMs for Production by Louis-Francois Bouchard and Louie Peters for anyone looking to dive into the world of large language models. The book not only provides a solid theoretical foundation but also offers practical guidance on implementing, fine-tuning, and deploying LLMs effectively. With clear explanations, real-world examples, and coverage of essential frameworks like LangChain and LlamaIndex, it’s perfect for both beginners and experienced practitioners aiming to build reliable, production-grade AI systems. This is a must-read resource for understanding and leveraging the full potential of LLMs for anyone who's interested in either starting his freelancing career or just wanting to dive in to these things.
Profile Image for Josua Naiborhu.
77 reviews4 followers
May 17, 2025
top-notch book on covering the how llm works from the ground up. i like how the author uses simple analogy on laying out every concepts in this book. You will get better understanding if you combine reading this book alongside building large language models from scratch by Sebastian. additionally, the last four chapters that walk through the hands-on approaches regarding neccessary libraries to fine-tuning, quantization, distillation, inferences help me better comprehending on what the theoritical concepts explained in this book.
Profile Image for Ferhat Culfaz.
270 reviews18 followers
October 4, 2024
Great snippets of code to be able to deep dive specific latest concepts for LLMs from a practical perspective. Covers wide breadth. Not for reading cover to cover line by line but for people who already know the basics and have already some experience using LLMs. Good for keeping up with latest developments. Well worth going to the website to run the code on colab and also see the references.
Displaying 1 - 6 of 6 reviews

Can't find what you're looking for?

Get help and learn more about the design.