Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt

要約

命令追従モデルのゼロショットパフォーマンスを強化するには、トレーニングデータセットの総数またはモデルサイズをスケーリングすることにより、大量の計算が必要になります。
この研究では、プロンプトチューニングを通じて取得したソフトプロンプトを取得することで、ゼロショットタスクの一般化においてハードプロンプトを効率的に支援できる方法を検討します。
具体的には、プロンプトチューニングを通じて各プロンプトのソフトプロンプトエンベディングをトレーニングし、プロンプトエンベディングにマップされたトレーニングインスタンスのサンプルを保存し、推論中にクエリインスタンスに最も近いトレーニングインスタンスの対応するプロンプトエンベディングを取得します。
追加パラメーターを 0.007% 追加するだけで、ソフトプロンプトの取得は、11 個のデータセットのうち 10 個のデータセットでパフォーマンスを上回り、BIG ベンチベンチマークでの T0 の平均精度を 2.39% ポイント改善することで、目に見えないタスクでの T0 のパフォーマンスを向上させます。
また、同様の回答選択形式でトレーニングされたソースエンベディングを取得することは、同様のタスクタイプでトレーニングされたソースエンベディングよりも重要であるという興味深い発見を報告します。

要約(オリジナル)

Enhancing the zero-shot performance of instruction-following models requires heavy computation, either by scaling the total number of training datasets or the model size. In this work, we explore how retrieval of soft prompts obtained through prompt tuning can efficiently assist hard prompts in zero-shot task generalization. Specifically, we train soft prompt embeddings for each prompt through prompt tuning, store the samples of the training instances mapped with the prompt embeddings, and retrieve the corresponding prompt embedding of the training instance closest to the query instance during inference. While only adding 0.007% additional parameters, retrieval of soft prompt enhances the performance of T0 on unseen tasks by outperforming it on 10 out of 11 datasets as well as improving the mean accuracy of T0 on BIG-bench benchmark by 2.39% points. Also, we report an interesting finding that retrieving source embeddings trained on similar answer choice formats is more important than those on similar task types.

arxiv情報

著者	Seonghyeon Ye,Joel Jang,Doyoung Kim,Yongrae Jo,Minjoon Seo
発行日	2023-10-16 04:57:33+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー