Latest Blogs
-
Distributed Rl Training Step
I was learning how we can do distributed RL training, saw karpathy posting this and thought why not make a complete blog about what I learned so here it is.
-
Rlhf
Before starting, it’s advisable to first complete David Silver’s Course on RL and read Lilian’s notes on RL which explains/provides notes on the David’s cour...
-
Mixture Of Experts
Image Source:https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-mixture-of-experts
-
Using Genetic Algorithm For Weights Optimization
BackStory