abed5c59ab
refactor(content): update rootCA.pem link to use local path and remove vpnclient.ovpn
...
Hugo Publish CI / build-and-deploy (push) Successful in 31s
feat(static): add rootCA.crt file for local certificate usage
2025-08-03 08:37:28 -07:00
Automated Publisher
eba296fed3
📚 Auto-publish: Add/update 1 blog posts
...
Hugo Publish CI / build-and-deploy (push) Successful in 3m5s
Generated on: Sun Aug 3 06:02:47 UTC 2025
Source: md-personal repository
2025-08-03 06:02:48 +00:00
Automated Publisher
fd19c595b6
📚 Auto-publish: Add/update 1 blog posts
...
Hugo Publish CI / build-and-deploy (push) Successful in 3m1s
Generated on: Sun Aug 3 04:20:20 UTC 2025
Source: md-personal repository
2025-08-03 04:20:20 +00:00
Automated Publisher
5253081653
📚 Auto-publish: Add/update 1 blog posts
...
Hugo Publish CI / build-and-deploy (push) Failing after 12m36s
Generated on: Sun Aug 3 03:49:57 UTC 2025
Source: md-personal repository
2025-08-03 03:49:59 +00:00
Automated Publisher
f90b459eda
📚 Auto-publish: Add/update 1 blog posts
...
Hugo Publish CI / build-and-deploy (push) Successful in 5m8s
Generated on: Sun Aug 3 03:41:10 UTC 2025
Source: md-personal repository
2025-08-03 03:41:10 +00:00
Automated Publisher
9c5d4a2102
📚 Auto-publish: Add/update 1 blog posts
...
Hugo Publish CI / build-and-deploy (push) Successful in 2m3s
Generated on: Sun Aug 3 03:29:23 UTC 2025
Source: md-personal repository
2025-08-03 03:29:23 +00:00
Automated Publisher
47dfa82e54
📚 Auto-publish: Add/update 3 blog posts
...
Hugo Publish CI / build-and-deploy (push) Has started running
Generated on: Sun Aug 3 03:28:39 UTC 2025
Source: md-personal repository
2025-08-03 03:28:39 +00:00
Automated Publisher
23b9adc43a
📚 Auto-publish: Add/update 5 blog posts
...
Hugo Publish CI / build-and-deploy (push) Successful in 34s
Generated on: Sun Aug 3 03:19:53 UTC 2025
Source: md-personal repository
2025-08-03 03:19:53 +00:00
38bbe8cbae
🗑️ (posts): remove unused image and its reference in markdown file
Hugo Publish CI / build-and-deploy (push) Successful in 42s
2025-08-02 19:24:41 -07:00
Automated Publisher
b6192ca3ca
📚 Auto-publish: Add/update 2 blog posts
...
Hugo Publish CI / build-and-deploy (push) Successful in 3m31s
Generated on: Sun Aug 3 01:47:39 UTC 2025
Source: md-personal repository
2025-08-03 01:47:39 +00:00
Automated Publisher
0b377b2189
📚 Auto-publish: Add/update 2 blog posts
...
Hugo Publish CI / build-and-deploy (push) Failing after 11m2s
Generated on: Sat Aug 2 18:07:06 PDT 2025
Source: md-personal repository
2025-08-02 18:07:06 -07:00
a3ccac4cd2
✨ (content): add new image file to posts directory
Hugo Publish CI / build-and-deploy (push) Successful in 16s
2025-08-02 15:49:50 -07:00
88cbb7efd5
✨ (posts): add deep dive into PPO for language models post
...
Hugo Publish CI / build-and-deploy (push) Successful in 14s
This commit introduces a new blog post detailing the Proximal Policy Optimization (PPO) algorithm as used in Reinforcement Learning from Human Feedback (RLHF) for Large Language Models (LLMs).
The post covers:
- The mapping of RL concepts to text generation.
- The roles of the Actor, Critic, and Reward Model.
- The use of Generalized Advantage Estimation (GAE) for stable credit assignment.
- The PPO clipped surrogate objective for safe policy updates.
- The importance of pretraining loss to prevent catastrophic forgetting.
- The full iterative training loop.
2025-08-02 15:46:24 -07:00
291f598d8c
Delete content/posts/credit_card.html
continuous-integration/drone Build is passing
2023-09-24 05:25:39 +00:00
Eric Liu
0794ee0bce
Add rootCA
2020-10-26 04:47:36 +00:00
Eric Liu
74b5002bff
Add credit card spending
2020-06-16 23:30:17 -07:00
Eric Liu
2f0990f161
initial commit
2019-02-05 05:18:26 +00:00