「cs.CL」カテゴリーアーカイブ

A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders

投稿日: 2024年9月26日作成者: jarxiv

要約スパースオートエンコーダ (SAE) は、大規模言語モデル (LLM) … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Towards Enhancing Linked Data Retrieval in Conversational UIs using Large Language Models

投稿日: 2024年9月26日作成者: jarxiv

要約最近、さまざまなドメインで大規模言語モデル (LLM) が広く採用されてい … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

Can We Count on LLMs? The Fixed-Effect Fallacy and Claims of GPT-4 Capabilities

投稿日: 2024年9月26日作成者: jarxiv

要約このペーパーでは、LLM 機能の評価について検討します。いくつかの決定論 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

HA-FGOVD: Highlighting Fine-grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detection

投稿日: 2024年9月26日作成者: jarxiv

要約オープン語彙オブジェクト検出 (OVD) モデルは、その広範なトレーニング … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

OmniBench: Towards The Future of Universal Omni-Language Models

投稿日: 2024年9月26日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) の最近の進歩は、多様なモダリ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

A Controlled Study on Long Context Extension and Generalization in LLMs

投稿日: 2024年9月24日作成者: jarxiv

要約広範なテキストの理解とコンテキスト内の学習には、完全なドキュメントのコンテ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Autoregressive + Chain of Thought = Recurrent: Recurrence’s Role in Language Models’ Computability and a Revisit of Recurrent Transformer

投稿日: 2024年9月24日作成者: jarxiv

要約 Transformer アーキテクチャは、さまざまな言語モデリングタスク … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Contextual Breach: Assessing the Robustness of Transformer-based QA Models

投稿日: 2024年9月23日作成者: jarxiv

要約コンテキスト質問応答モデルは、現実世界のシナリオで一般的に観察される、入力 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

ExtractGPT: Exploring the Potential of Large Language Models for Product Attribute Value Extraction

投稿日: 2024年9月23日作成者: jarxiv

要約電子商取引プラットフォームでは、ファセット製品検索や属性ベースの製品比較な … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Gender Representation and Bias in Indian Civil Service Mock Interviews

投稿日: 2024年9月23日作成者: jarxiv

要約この論文は 3 つの重要な貢献をしています。まず、インドの公務員候補者の … 続きを読む →

カテゴリー: cs.CL, cs.CY | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders

Towards Enhancing Linked Data Retrieval in Conversational UIs using Large Language Models

Can We Count on LLMs? The Fixed-Effect Fallacy and Claims of GPT-4 Capabilities

HA-FGOVD: Highlighting Fine-grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detection

OmniBench: Towards The Future of Universal Omni-Language Models

A Controlled Study on Long Context Extension and Generalization in LLMs

Autoregressive + Chain of Thought = Recurrent: Recurrence’s Role in Language Models’ Computability and a Revisit of Recurrent Transformer

Contextual Breach: Assessing the Robustness of Transformer-based QA Models

ExtractGPT: Exploring the Potential of Large Language Models for Product Attribute Value Extraction

Gender Representation and Bias in Indian Civil Service Mock Interviews

最近の投稿

最近のコメント

アーカイブ

カテゴリー