「cs.CL」カテゴリーアーカイブ

Disability Representations: Finding Biases in Automatic Image Generation

投稿日: 2024年6月24日作成者: jarxiv

要約画像生成技術の最近の進歩により、AI 生成画像への広範なアクセスが可能にな … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding

投稿日: 2024年6月24日作成者: jarxiv

要約ロボットやウェアラブルを介して導入される AI パーソナルアシスタントに … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis

投稿日: 2024年6月24日作成者: jarxiv

要約 Medical Visual Question Answering (Me … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, I.2.10 | コメントを受け付けていません

Investigating the impact of 2D gesture representation on co-speech gesture generation

投稿日: 2024年6月24日作成者: jarxiv

要約共同スピーチジェスチャーは、人間と身体的会話エージェント (ECA) との … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

投稿日: 2024年6月24日作成者: jarxiv

要約大規模視覚言語モデル (LVLM) は、さまざまなマルチモーダルタスクで … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning

投稿日: 2024年6月24日作成者: jarxiv

要約少数ショット学習におけるインターリーブ大規模マルチモーダルモデル (LM … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving

投稿日: 2024年6月24日作成者: jarxiv

要約形式検証 (FV) は、進化する大規模言語モデル (LLM) による現在の … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.MS | コメントを受け付けていません

Infusing clinical knowledge into tokenisers for language models

投稿日: 2024年6月21日作成者: jarxiv

要約この研究では、臨床テキスト処理のための新しい知識強化型トークン化メカニズム … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Robust Few-shot Transfer Learning for Knowledge Base Question Answering with Unanswerable Questions

投稿日: 2024年6月21日作成者: jarxiv

要約実際の KBQA アプリケーションには、(1) 堅牢なモデル (回答可能な … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Identifying User Goals from UI Trajectories

投稿日: 2024年6月21日作成者: jarxiv

要約グラフィカルユーザーインターフェイス (GUI) と対話する自律エージ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Disability Representations: Finding Biases in Automatic Image Generation

AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding

Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis

Investigating the impact of 2D gesture representation on co-speech gesture generation

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning

FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving

Infusing clinical knowledge into tokenisers for language models

Robust Few-shot Transfer Learning for Knowledge Base Question Answering with Unanswerable Questions

Identifying User Goals from UI Trajectories

最近の投稿

最近のコメント

アーカイブ

カテゴリー