Generate, Transform, Answer: Question Specific Tool Synthesis for Tabular Data

要約

表形式の質問応答 (TQA) は、大量の半構造化データを使用した自然言語の共同推論を必要とするため、ニューラルシステムにとって困難な設定を提示します。
処理前にフィルターなどのプログラムツールを使用してデータを変換する人間とは異なり、TQA の言語モデルはテーブルを直接処理するため、テーブルのサイズが大きくなると情報が失われます。
このホワイトペーパーでは、ToolWriter を使用してクエリ固有のプログラムを生成し、それらを変換テーブルに適用するタイミングを検出して、TQA モデルの機能に合わせることを提案します。
行フィルタリングツールを生成するために ToolWriter をフォーカスすると、WikiTableQuestions と WikiSQL の最新技術が向上し、長いテーブルで最大のパフォーマンスが得られます。
ヘッドルームを調査することにより、私たちの研究は、大量の構造化データを操作するためにニューラルコンポーネントと組み合わせたプログラマティックツールの幅広い可能性を強調しています。

要約(オリジナル)

Tabular question answering (TQA) presents a challenging setting for neural systems by requiring joint reasoning of natural language with large amounts of semi-structured data. Unlike humans who use programmatic tools like filters to transform data before processing, language models in TQA process tables directly, resulting in information loss as table size increases. In this paper we propose ToolWriter to generate query specific programs and detect when to apply them to transform tables and align them with the TQA model’s capabilities. Focusing ToolWriter to generate row-filtering tools improves the state-of-the-art for WikiTableQuestions and WikiSQL with the most performance gained on long tables. By investigating headroom, our work highlights the broader potential for programmatic tools combined with neural components to manipulate large amounts of structured data.

arxiv情報

著者	Carlos Gemmell,Jeffrey Dalton
発行日	2023-03-17 17:26:56+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Generate, Transform, Answer: Question Specific Tool Synthesis for Tabular Data

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー