「cs.AI」カテゴリーアーカイブ

Visual Acoustic Fields

投稿日: 2025年4月1日作成者: jarxiv

要約オブジェクトはヒットすると異なる音を生成し、人間はその外観と材料特性に基づ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion

投稿日: 2025年4月1日作成者: jarxiv

要約パラメーター生成は、ニューラルネットワーク開発の新しいパラダイムとして浮上 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

投稿日: 2025年4月1日作成者: jarxiv

要約 Chain of Thound（COT）の最近の進歩により、大規模な言語モ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

投稿日: 2025年4月1日作成者: jarxiv

要約現在のビデオ生成コミュニティ内の正確なユーザー意図解釈のボトルネックに対処 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving

投稿日: 2025年4月1日作成者: jarxiv

要約 UNIOCCは、カメラ画像からの占有予測（つまり、歴史的情報に基づいて将来 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MA, cs.RO | コメントを受け付けていません

Evil twins are not that evil: Qualitative insights into machine-generated prompts

投稿日: 2025年4月1日作成者: jarxiv

要約言語モデル（LMS）は、予測可能な方法で、一見理解できないように見えるアル … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

投稿日: 2025年4月1日作成者: jarxiv

要約アクションモデルは、自律エージェントが複雑なタスクを実行できるようにするた … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning

投稿日: 2025年4月1日作成者: jarxiv

要約エントロピー誘導シーケンス重み付け（EGSW）を導入します。これは、強化学 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Multimodal Object Detection using Depth and Image Data for Manufacturing Parts

投稿日: 2025年3月31日作成者: jarxiv

要約製造には、多様な種類の製造部品とコンポーネントの正確なピッキングと取り扱い … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Data-Agnostic Robotic Long-Horizon Manipulation with Vision-Language-Guided Closed-Loop Feedback

投稿日: 2025年3月31日作成者: jarxiv

要約言語条件付きのロボット操作の最近の進歩により、ロボットが人間のコマンドから … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Visual Acoustic Fields

ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving

Evil twins are not that evil: Qualitative insights into machine-generated prompts

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning

Multimodal Object Detection using Depth and Image Data for Manufacturing Parts

Data-Agnostic Robotic Long-Horizon Manipulation with Vision-Language-Guided Closed-Loop Feedback

最近の投稿

最近のコメント

アーカイブ

カテゴリー