Week 3 - Research Papers

These notes were developed using lectures/material/transcripts from the DeepLearning.AI & AWS - Generative AI with Large Language Models course

Reinforcement Learning from Human Feedback (RLHF)

Proximal Policy Optimization (PPO)

Scaling Human Feedback

Advanced Prompting Techniques

LLM-powered Application Architectures