diff --git a/.image_mappings/a-deep-dive-into-ppo-for-language-models.txt b/.image_mappings/a-deep-dive-into-ppo-for-language-models.txt new file mode 100644 index 0000000..0f6d140 --- /dev/null +++ b/.image_mappings/a-deep-dive-into-ppo-for-language-models.txt @@ -0,0 +1 @@ +Pasted image 20250730232756.png|.png diff --git a/content/posts/a-deep-dive-into-ppo-for-language-models.md b/content/posts/a-deep-dive-into-ppo-for-language-models.md index 5081ca1..8b4d804 100644 --- a/content/posts/a-deep-dive-into-ppo-for-language-models.md +++ b/content/posts/a-deep-dive-into-ppo-for-language-models.md @@ -1,6 +1,6 @@ --- title: "A Deep Dive into PPO for Language Models" -date: 2025-08-03T01:47:10 +date: 2025-08-03T03:10:41 draft: false --- @@ -9,7 +9,7 @@ Large Language Models (LLMs) have demonstrated astonishing capabilities, but out You may have seen diagrams like the one below, which outlines the RLHF training process. It can look intimidating, with a web of interconnected models, losses, and data flows. -![[Pasted image 20250730232756.png]] +![](/images/a-deep-dive-into-ppo-for-language-models/.png) This post will decode that diagram, piece by piece. We'll explore the "why" behind each component, moving from high-level concepts to the deep technical reasoning that makes this process work. diff --git a/content/posts/t5-the-transformer-that-zigged-when-others-zagged-an-architectural-deep-dive.md b/content/posts/t5-the-transformer-that-zigged-when-others-zagged-an-architectural-deep-dive.md index d9ce001..fe2ae8f 100644 --- a/content/posts/t5-the-transformer-that-zigged-when-others-zagged-an-architectural-deep-dive.md +++ b/content/posts/t5-the-transformer-that-zigged-when-others-zagged-an-architectural-deep-dive.md @@ -1,6 +1,6 @@ --- title: "T5 - The Transformer That Zigged When Others Zagged - An Architectural Deep Dive" -date: 2025-08-03T01:47:10 +date: 2025-08-03T03:10:41 draft: false --- diff --git a/static/images/a-deep-dive-into-ppo-for-language-models/.png b/static/images/a-deep-dive-into-ppo-for-language-models/.png new file mode 100644 index 0000000..2d74573 Binary files /dev/null and b/static/images/a-deep-dive-into-ppo-for-language-models/.png differ