davidak<p>Reinforcement Learning with Heuristic Imperatives (<a href="https://chaos.social/tags/RLHI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>RLHI</span></a>) - Ep 01 - Synthesizing Scenarios</p><p><a href="https://www.youtube.com/watch?v=Q8lhWvKdQOc" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">youtube.com/watch?v=Q8lhWvKdQO</span><span class="invisible">c</span></a></p><p><a href="https://chaos.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://chaos.social/tags/ControlProblem" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ControlProblem</span></a> <a href="https://chaos.social/tags/AIAlignment" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AIAlignment</span></a> <a href="https://chaos.social/tags/HeuristicImperatives" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HeuristicImperatives</span></a> <a href="https://chaos.social/tags/LLM" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>LLM</span></a></p>