Status Updates from All Editions of Reinforcement Learning: An Introduction (Kindle Edition)

Vitor is on page 163 of 552

— Apr 30, 2026 12:13PM Add a comment

Dheeraj is on page 56 of 322

— Apr 15, 2026 10:08PM Add a comment

Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning)

Dheeraj is on page 35 of 322

— Apr 08, 2026 01:07AM Add a comment

Pawan is on page 460 of 552

Parameters such as α, n, γ, λ can be picked by trials (n ~ 5 in most cases), instead of grid search, as they seem fairly independent. Methods can be specific to the problem (which maybe a prediction or control, amongst others) with tradeoffs between latency and memory. LLMs help with method selection, discretisation and reduction of state-action space to explore feasibility of ideas.

Had to relearn backgammon.

— Apr 05, 2026 01:06PM Add a comment

Pawan is on page 357 of 552

This subject is rife with higher algebra (especially progressions and probability). Notations can cause mental havoc. Part III seems like interesting prose.

— Apr 04, 2026 10:58AM Add a comment

Dheeraj is on page 5 of 322

— Mar 31, 2026 10:27PM Add a comment

Dheeraj is on page 50 of 322

— Mar 31, 2026 10:27PM Add a comment

Pawan is on page 335 of 552

Tabular methods can be understood without programming by creating a scenario with a small state-action space, initialising policy intuitively, and then iterating through the algorithms by setting the variable values. This may not be possible for complex algorithms in the second part - especially off-policy and online search and approximation.

It gets increasingly interesting. Need to revisit a few chapters.

— Mar 14, 2026 07:39PM Add a comment

Gustavo is on page 28 of 322

— Feb 17, 2026 11:05PM Add a comment

Martin is on page 37 of 322

— Jan 27, 2026 02:24PM Add a comment

Martin is on page 29 of 322

— Jan 27, 2026 09:32AM Add a comment

Martin is on page 16 of 322

— Jan 27, 2026 06:47AM Add a comment

Martin is on page 9 of 322

— Jan 26, 2026 02:23PM Add a comment

Nicolas is on page 70 of 552

— Jan 21, 2026 11:33AM Add a comment

Nicolas is on page 50 of 552

— Jan 14, 2026 11:24AM Add a comment

Pawan is on page 255 of 552

Not as difficult to read now. Going to try non-model based implementations in JAX, but first, remembering my kite equations using plotting libraries.

— Jan 11, 2026 08:57PM Add a comment

Pawan is on page 140 of 552

I had read a different edition of this book for a coursework until a few chapters back, which was available online for free, through none other than Google search. My mind was too shallow to estimate values for bootstrapping, back then. Time to dig.

— Jan 05, 2026 09:16PM Add a comment

Vitor is on page 113 of 552

— Jan 01, 2026 02:37AM Add a comment