github
/posts /about
tagged: alignment

    Reward Models: Alignment for the GPU Poor

    2023-10-16

    :: #alignment, #reward-model