OpenAIのInstructGPT, DeepMindのSparrow, MetaのGalacticaにおける対話AIの信頼性/安全性向上のためのアプローチをまとめます Words have the power to both destroy and heal. When words are both true and kind, they can change our world. 言葉は人を傷つける事も癒す事も出来る。言葉から憎しみと偽りが消えた時、…

どこから見てもメンダコ

安全で信頼できる対話AIのためのアプローチ：InstructGPT, Sparrow, Galactica