「cs.CV」カテゴリーアーカイブ

Randomness of Low-Layer Parameters Determines Confusing Samples in Terms of Interaction Representations of a DNN

投稿日: 2025年2月13日作成者: jarxiv

要約この論文では、ディープニューラルネットワーク（DNN）によってエンコードさ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

投稿日: 2025年2月13日作成者: jarxiv

要約特にGPT-4Oに続く大規模な言語モデルの最近の進歩により、より多くのモダ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM, cs.SD, eess.AS, eess.IV | コメントを受け付けていません

Rapid Whole Brain Mesoscale In-vivo MR Imaging using Multi-scale Implicit Neural Representation

投稿日: 2025年2月13日作成者: jarxiv

要約目的：スキャン時間を削減しながら高信号対雑音比（SNR）を維持しながら、マ … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

PulseCheck457: A Diagnostic Benchmark for Comprehensive Spatial Reasoning of Large Multimodal Models

投稿日: 2025年2月13日作成者: jarxiv

要約大規模なマルチモーダルモデル（LMM）は、視覚的なシーンの解釈と推論におい … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

投稿日: 2025年2月13日作成者: jarxiv

要約この作業では、3Dが認識し、制御可能なテキストからビデオへの生成のための新 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs

投稿日: 2025年2月13日作成者: jarxiv

要約 AISが急速に前進し、よりエージェントになるにつれて、彼らが提起するリスク … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.CY, cs.LG | コメントを受け付けていません

SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation

投稿日: 2025年2月13日作成者: jarxiv

要約大規模なビジョン言語モデルの最近の進歩により、非常に表現力豊かで多様なベク … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards

投稿日: 2025年2月13日作成者: jarxiv

要約オープンワールド環境でのロボット操作のタスク仕様は挑戦的であり、人間の意図 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Poly-Autoregressive Prediction for Modeling Interactions

投稿日: 2025年2月13日作成者: jarxiv

要約マルチエージェント設定でエージェントの動作を予測するための簡単なフレームワ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Next Block Prediction: Video Generation via Semi-Autoregressive Modeling

投稿日: 2025年2月13日作成者: jarxiv

要約 Next-Token Prediction（NTP）は、自己回帰（AR）ビ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Randomness of Low-Layer Parameters Determines Confusing Samples in Terms of Interaction Representations of a DNN

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Rapid Whole Brain Mesoscale In-vivo MR Imaging using Multi-scale Implicit Neural Representation

PulseCheck457: A Diagnostic Benchmark for Comprehensive Spatial Reasoning of Large Multimodal Models

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs

SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation

A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards

Poly-Autoregressive Prediction for Modeling Interactions

Next Block Prediction: Video Generation via Semi-Autoregressive Modeling

最近の投稿

最近のコメント

アーカイブ

カテゴリー