「cs.AI」カテゴリーアーカイブ

InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models

投稿日: 2025年1月22日作成者: jarxiv

要約生成モデルの能力の向上は、言語を超えたモダリティを活用するマルチモーダル仮 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification

投稿日: 2025年1月22日作成者: jarxiv

要約医療ワークフローにおける深層学習ベースのソリューションの導入を制限する主な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

With Great Backbones Comes Great Adversarial Transferability

投稿日: 2025年1月22日作成者: jarxiv

要約マシンビジョンの自己教師あり学習 (SSL) の進歩により、表現の堅牢性と … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CV, cs.LG, cs.MA | コメントを受け付けていません

Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement

投稿日: 2025年1月22日作成者: jarxiv

要約感情は、ユーザーのコンテンツ消費とオンラインエンゲージメントとの関係を仲 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC | コメントを受け付けていません

RALAD: Bridging the Real-to-Sim Domain Gap in Autonomous Driving with Retrieval-Augmented Learning

投稿日: 2025年1月22日作成者: jarxiv

要約堅牢な自動運転システムを追求する中で、現実世界のデータセットでトレーニング … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

投稿日: 2025年1月22日作成者: jarxiv

要約このペーパーでは、スクリーンショットを入力としてのみ認識し、人間のような対 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.HC | コメントを受け付けていません

DARB-Splatting: Generalizing Splatting with Decaying Anisotropic Radial Basis Functions

投稿日: 2025年1月22日作成者: jarxiv

要約スプラッティングベースの 3D 再構成手法は、3D ガウススプラッティ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

投稿日: 2025年1月22日作成者: jarxiv

要約 Depth Anything は、強力な一般化能力により、単眼の深度推定に … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

投稿日: 2025年1月22日作成者: jarxiv

要約ビデオ理解における基礎モデルを評価するための、専門家レベルの包括的な複数分 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Learning segmentation from point trajectories

投稿日: 2025年1月22日作成者: jarxiv

要約私たちは、他の形式の監視ではなく、動きに基づいてビデオ内のオブジェクトをセ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models

CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification

With Great Backbones Comes Great Adversarial Transferability

Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement

RALAD: Bridging the Real-to-Sim Domain Gap in Autonomous Driving with Retrieval-Augmented Learning

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

DARB-Splatting: Generalizing Splatting with Decaying Anisotropic Radial Basis Functions

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Learning segmentation from point trajectories

最近の投稿

最近のコメント

アーカイブ

カテゴリー