Multilingual Nonce Dependency Treebanks: Understanding how LLMs represent and process syntactic structure

要約

多言語のユニバーサル依存関係 (UD) コーパスのノンスツリーバンクを作成するためのフレームワークである SPUD (Semantically Perturbed Universal dependency) を紹介します。
SPUD データは構文的な引数構造を満たし、構文的な注釈を提供し、言語固有のルールによって文法性を保証します。
アラビア語、英語、フランス語、ドイツ語、ロシア語で nonce データを作成し、SPUD ツリーバンクの 2 つの使用例を示します。
まず、自己回帰 (ALM) およびマスク言語モデル (MLM) のパープレキシティスコアによって測定される、単語の共起統計に対するノンスデータの影響を調査します。
ALM スコアは MLM スコアよりもノンスデータの影響を大きく受けることがわかりました。
次に、ノンスデータが構文依存関係プローブのパフォーマンスにどのような影響を与えるかを示します。
M\’uller-Eberstein らの発見を再現します。
(2022) ノンステストデータに基づいて、MLM と ALM の両方でパフォーマンスが低下することを示しています。
オリジナルのテストデータ。
ただし、パフォーマンスの大部分は維持されており、プローブが実際にセマンティクスから独立して構文を学習していることを示唆しています。

要約(オリジナル)

We introduce SPUD (Semantically Perturbed Universal Dependencies), a framework for creating nonce treebanks for the multilingual Universal Dependencies (UD) corpora. SPUD data satisfies syntactic argument structure, provides syntactic annotations, and ensures grammaticality via language-specific rules. We create nonce data in Arabic, English, French, German, and Russian, and demonstrate two use cases of SPUD treebanks. First, we investigate the effect of nonce data on word co-occurrence statistics, as measured by perplexity scores of autoregressive (ALM) and masked language models (MLM). We find that ALM scores are significantly more affected by nonce data than MLM scores. Second, we show how nonce data affects the performance of syntactic dependency probes. We replicate the findings of M\’uller-Eberstein et al. (2022) on nonce test data and show that the performance declines on both MLMs and ALMs wrt. original test data. However, a majority of the performance is kept, suggesting that the probe indeed learns syntax independently from semantics.

arxiv情報

著者	David Arps,Laura Kallmeyer,Younes Samih,Hassan Sajjad
発行日	2023-11-13 17:36:58+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Multilingual Nonce Dependency Treebanks: Understanding how LLMs represent and process syntactic structure

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー