[B! データ] Open-sourcing circuit-tracing tools

記事へのコメント2件

注目コメント
新着コメント

nhd7htr https://acortar.link/VHwsOR https://acortar.link/Ta8xaK https://acortar.link/n1xTWU https://acortar.link/aGapBx

2025/05/31 リンク

misshiki “最近の解釈可能性研究において、大規模言語モデルの思考をトレースする新しい手法を導入しました。本日、この手法をオープンソース化し、誰でも私たちの研究を発展させることができるようにします。”

2025/05/30 リンク

注目コメント算出アルゴリズムの一部にLINEヤフー株式会社の「建設的コメント順位付けモデルAPI」を使用しています

規約違反を報告

Open-sourcing circuit-tracing tools

In our recent interpretability research, we introduced a new method to trace the thoughts of a la... In our recent interpretability research, we introduced a new method to trace the thoughts of a large language model. Today, we’re open-sourcing the method so that anyone can build on our research. Our approach is to generate attribution graphs, which (partially) reveal the steps a model took internally to decide on a particular output. The open-source library we’re releasing supports the generatio