「cs.AI」カテゴリーアーカイブ

Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions

投稿日: 2025年2月13日作成者: jarxiv

要約限られた語彙を持つ非ネイティブスピーカーは、それらを視覚化することができた … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR, cs.MM | コメントを受け付けていません

mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data

投稿日: 2025年2月13日作成者: jarxiv

要約マルチモーダル埋め込みモデルは、テキストや画像などのさまざまなモダリティか … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation

投稿日: 2025年2月13日作成者: jarxiv

要約注意ベースの方法は、従来の幾何学的深部学習（GDL）モデルを上回り、球状の … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Human-Centric Foundation Models: Perception, Generation and Agentic Modeling

投稿日: 2025年2月13日作成者: jarxiv

要約人間の理解と生成は、デジタル人間とヒューマノイドの実施形態をモデル化するた … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

投稿日: 2025年2月13日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLMS）は、短いビデオ理解で印象的なパフ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

Brain Latent Progression: Individual-based Spatiotemporal Disease Progression on 3D Brain MRIs via Latent Diffusion

投稿日: 2025年2月13日作成者: jarxiv

要約縦方向の磁気共鳴イメージング（MRI）データセットの利用可能性の増加により … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Novel Approach to for Multimodal Emotion Recognition : Multimodal semantic information fusion

投稿日: 2025年2月13日作成者: jarxiv

要約人工知能とコンピュータービジョンテクノロジーの進歩により、マルチモーダル感 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Randomness of Low-Layer Parameters Determines Confusing Samples in Terms of Interaction Representations of a DNN

投稿日: 2025年2月13日作成者: jarxiv

要約この論文では、ディープニューラルネットワーク（DNN）によってエンコードさ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs

投稿日: 2025年2月13日作成者: jarxiv

要約 AISが急速に前進し、よりエージェントになるにつれて、彼らが提起するリスク … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.CY, cs.LG | コメントを受け付けていません

A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards

投稿日: 2025年2月13日作成者: jarxiv

要約オープンワールド環境でのロボット操作のタスク仕様は挑戦的であり、人間の意図 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions

mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data

Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation

Human-Centric Foundation Models: Perception, Generation and Agentic Modeling

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Brain Latent Progression: Individual-based Spatiotemporal Disease Progression on 3D Brain MRIs via Latent Diffusion

A Novel Approach to for Multimodal Emotion Recognition : Multimodal semantic information fusion

Randomness of Low-Layer Parameters Determines Confusing Samples in Terms of Interaction Representations of a DNN

Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs

A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards

最近の投稿

最近のコメント

アーカイブ

カテゴリー