📚 Auto-publish: Add/update 2 blog posts

Generated on: Sat Aug 16 21:07:37 UTC 2025 Source: md-personal repository
2025-08-16 21:07:37 +00:00
parent a7f1af6c7f
commit bcab4969f4
2 changed files with 2 additions and 2 deletions
--- a/content/posts/a-comprehensive-guide-to-breville-barista-pro-maintenance.md
+++ b/content/posts/a-comprehensive-guide-to-breville-barista-pro-maintenance.md
@@ -1,6 +1,6 @@
 ---
 title: "A Comprehensive Guide to Breville Barista Pro Maintenance"
-date: 2025-08-16T20:48:28
+date: 2025-08-16T21:07:33
 draft: false
 ---

--- a/content/posts/a-deep-dive-into-ppo-for-language-models.md
+++ b/content/posts/a-deep-dive-into-ppo-for-language-models.md
@@ -9,7 +9,7 @@ Large Language Models (LLMs) have demonstrated astonishing capabilities, but out

 You may have seen diagrams like the one below, which outlines the RLHF training process. It can look intimidating, with a web of interconnected models, losses, and data flows.

-![](/images/a-deep-dive-into-ppo-for-language-models/64bfdb4b-678e-4bfc-8b62-0c05c243f6a9.png)
+![[Pasted image 20250816140700.png]]

 This post will decode that diagram, piece by piece. We'll explore the "why" behind each component, moving from high-level concepts to the deep technical reasoning that makes this process work.