Creating a LLM-as-a-Judge That Drives Business Results –A step-by-step guide with my learnings from 30+ AI implementations.https://hamel.dev/blog/posts/llm-judge/Finding GPT-4’s mistakes with GPT-4CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHFhttps://openai.com/index/finding-gpt4s-mistakes-with-gpt-4/