「cs.CL」カテゴリーアーカイブ

Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method

投稿日: 2025年1月31日作成者: jarxiv

要約実際のオープンドメインの質問は、特にそれらに答えるには複数の情報源からの情 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

More Expressive Attention with Negative Weights

投稿日: 2025年1月31日作成者: jarxiv

要約 COG Attencesという名前の新しい注意メカニズムを提案します。これ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

R.I.P.: Better Models by Survival of the Fittest Prompts

投稿日: 2025年1月31日作成者: jarxiv

要約トレーニングデータ品質は、最終的なモデル品質の最も重要なドライバーの1つで … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

A Video-grounded Dialogue Dataset and Metric for Event-driven Activities

投稿日: 2025年1月31日作成者: jarxiv

要約このペーパーでは、タスク用に特別に設計されたセッションベースのコンテキスト … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

投稿日: 2025年1月31日作成者: jarxiv

要約専門家レベルの医療知識と高度な推論を評価するために、非常に挑戦的で包括的な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

DreamArtist++: Controllable One-Shot Text-to-Image Generation via Positive-Negative Adapter

投稿日: 2025年1月31日作成者: jarxiv

要約 Imagenや安定した拡散モデルなどの最先端のテキストからイメージからイメ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM | コメントを受け付けていません

Return of the Encoder: Maximizing Parameter Efficiency for SLMs

投稿日: 2025年1月31日作成者: jarxiv

要約大規模なデコーダーのみの言語モデルの優位性は、シーケンス処理における基本的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

投稿日: 2025年1月31日作成者: jarxiv

要約ビジョン言語モデル（VLM）は最近、ロボットアクションを生成するために活用 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Temporal Preference Optimization for Long-Form Video Understanding

投稿日: 2025年1月31日作成者: jarxiv

要約ビデオの大規模なマルチモーダルモデル（ビデオLMMS）の大幅な進歩にもかか … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models

投稿日: 2025年1月31日作成者: jarxiv

要約大規模なビジョン言語モデル（VLM）は、幅広いタスクで顕著なパフォーマンス … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.CV | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method

More Expressive Attention with Negative Weights

R.I.P.: Better Models by Survival of the Fittest Prompts

A Video-grounded Dialogue Dataset and Metric for Event-driven Activities

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

DreamArtist++: Controllable One-Shot Text-to-Image Generation via Positive-Negative Adapter

Return of the Encoder: Maximizing Parameter Efficiency for SLMs

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Temporal Preference Optimization for Long-Form Video Understanding

Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー