「cs.AI」カテゴリーアーカイブ

Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control

投稿日: 2025年3月19日作成者: jarxiv

要約セグメンテーション、深さ、エッジなどのさまざまなモダリティの複数の空間制御 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

State Space Model Meets Transformer: A New Paradigm for 3D Object Detection

投稿日: 2025年3月19日作成者: jarxiv

要約マルチレイヤートランスデコーダーを使用してオブジェクトクエリを繰り返し改良 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

The Power of Context: How Multimodality Improves Image Super-Resolution

投稿日: 2025年3月19日作成者: jarxiv

要約シングルイメージの超解像度（SISR）は、細かい詳細を回復し、低解像度の入 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

MusicInfuser: Making Video Diffusion Listen and Dance

投稿日: 2025年3月19日作成者: jarxiv

要約 MusicInfuserを紹介します。これは、指定された音楽トラックに同期 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

LLM-Match: An Open-Sourced Patient Matching Model Based on Large Language Models and Retrieval-Augmented Generation

投稿日: 2025年3月19日作成者: jarxiv

要約患者のマッチングとは、医療記録を試験の適格性基準と正確に特定して一致させる … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective

投稿日: 2025年3月19日作成者: jarxiv

要約大規模な言語モデル（LLM）は、主に適切に設計されたプロンプトによって駆動 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Valley: Video Assistant with Large Language model Enhanced abilitY

投稿日: 2025年3月18日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、驚くべき会話能力を備えた、視覚的およびテ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Leveraging Large Language Models for Collective Decision-Making

投稿日: 2025年3月18日作成者: jarxiv

要約会議のスケジューリング、コラボレーション、プロジェクト計画などのさまざまな … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, cs.SI | コメントを受け付けていません

A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges

投稿日: 2025年3月18日作成者: jarxiv

要約マルチモーダルビジョン言語モデル（VLM）は、コンピュータービジョンと自然 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

MAP: Multi-user Personalization with Collaborative LLM-powered Agents

投稿日: 2025年3月18日作成者: jarxiv

要約マルチユーザー設定における大規模な言語モデル（LLMS）およびLLM駆動エ … 続きを読む →

カテゴリー: cs.AI, cs.HC, cs.RO, I.2.1 | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control

State Space Model Meets Transformer: A New Paradigm for 3D Object Detection

The Power of Context: How Multimodality Improves Image Super-Resolution

MusicInfuser: Making Video Diffusion Listen and Dance

LLM-Match: An Open-Sourced Patient Matching Model Based on Large Language Models and Retrieval-Augmented Generation

DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective

Valley: Video Assistant with Large Language model Enhanced abilitY

Leveraging Large Language Models for Collective Decision-Making

A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges

MAP: Multi-user Personalization with Collaborative LLM-powered Agents

最近の投稿

最近のコメント

アーカイブ

カテゴリー