「cs.CL」カテゴリーアーカイブ

Erasing Conceptual Knowledge from Language Models

投稿日: 2024年10月4日作成者: jarxiv

要約言語モデルにおける概念消去は、従来、包括的な評価の枠組みを欠いていたため、 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Which questions should I answer? Salience Prediction of Inquisitive Questions

投稿日: 2024年10月4日作成者: jarxiv

要約探究的な質問（人が読書をする際にする、オープンエンドで好奇心主導の質問）は … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation

投稿日: 2024年10月4日作成者: jarxiv

要約大規模言語モデル（Large Language Models: LLM）は … 続きを読む →

カテゴリー: cs.CL, cs.CV, eess.IV | コメントを受け付けていません

NL-Eye: Abductive NLI for Images

投稿日: 2024年10月4日作成者: jarxiv

要約視覚言語モデル（VLM）ベースのボットは、床が濡れていることを検知したら、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution

投稿日: 2024年10月4日作成者: jarxiv

要約 Qwen2-VLは、従来のQwen-VLをさらに進化させたモデルであり、従 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Measuring and Improving Persuasiveness of Generative Models

投稿日: 2024年10月4日作成者: jarxiv

要約 LLMは、人間が消費するコンテンツを生成するワークフロー（マーケティングな … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

LLaVA-Critic: Learning to Evaluate Multimodal Models

投稿日: 2024年10月4日作成者: jarxiv

要約 LLaVA-Criticを紹介する。LLaVA-Criticは、幅広いマル … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Video Instruction Tuning With Synthetic Data

投稿日: 2024年10月4日作成者: jarxiv

要約動画ラージ・マルチモーダルモデル（LMM）の開発は、ウェブから大量の高品質 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Autoregressive Pre-Training on Pixels and Texts

投稿日: 2024年10月4日作成者: jarxiv

要約視覚情報とテキスト情報の統合は、言語モデルの進歩において有望な方向性を示し … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

投稿日: 2024年10月4日作成者: jarxiv

要約未知の環境におけるオブジェクトナビゲーションは、実世界のアプリケーションに … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.RO | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Erasing Conceptual Knowledge from Language Models

Which questions should I answer? Salience Prediction of Inquisitive Questions

MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation

NL-Eye: Abductive NLI for Images

Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution

Measuring and Improving Persuasiveness of Generative Models

LLaVA-Critic: Learning to Evaluate Multimodal Models

Video Instruction Tuning With Synthetic Data

Autoregressive Pre-Training on Pixels and Texts

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

最近の投稿

最近のコメント

アーカイブ

カテゴリー