月別アーカイブ: 2025年4月

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

投稿日: 2025年4月1日作成者: jarxiv

要約 Chain of Thound（COT）の最近の進歩により、大規模な言語モ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

投稿日: 2025年4月1日作成者: jarxiv

要約現在のビデオ生成コミュニティ内の正確なユーザー意図解釈のボトルネックに対処 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving

投稿日: 2025年4月1日作成者: jarxiv

要約 UNIOCCは、カメラ画像からの占有予測（つまり、歴史的情報に基づいて将来 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MA, cs.RO | コメントを受け付けていません

Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views

投稿日: 2025年4月1日作成者: jarxiv

要約ニューラルレンダリングは、高品質の3D神経再構成と密な入力ビューと正確なポ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Consistent Subject Generation via Contrastive Instantiated Concepts

投稿日: 2025年4月1日作成者: jarxiv

要約テキストから画像への生成モデルは、多様で忠実なコンテンツを合成できますが、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SU-YOLO: Spiking Neural Network for Efficient Underwater Object Detection

投稿日: 2025年4月1日作成者: jarxiv

要約水中オブジェクトの検出は、海洋研究と産業安全検査にとって重要です。ただし … 続きを読む →

カテゴリー: cs.CV, cs.NE | コメントを受け付けていません

Easi3R: Estimating Disentangled Motion from DUSt3R Without Training

投稿日: 2025年4月1日作成者: jarxiv

要約 Dust3Rの最近の進歩により、静的なシーンの密なポイント雲とカメラパラメ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Evil twins are not that evil: Qualitative insights into machine-generated prompts

投稿日: 2025年4月1日作成者: jarxiv

要約言語モデル（LMS）は、予測可能な方法で、一見理解できないように見えるアル … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit Dialogues

投稿日: 2025年4月1日作成者: jarxiv

要約大規模な言語モデル（LLM）ベースのチャットボットは、クレジットの対話に効 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

投稿日: 2025年4月1日作成者: jarxiv

要約アクションモデルは、自律エージェントが複雑なタスクを実行できるようにするた … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

月別アーカイブ: 2025年4月

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving

Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views

Consistent Subject Generation via Contrastive Instantiated Concepts

SU-YOLO: Spiking Neural Network for Efficient Underwater Object Detection

Easi3R: Estimating Disentangled Motion from DUSt3R Without Training

Evil twins are not that evil: Qualitative insights into machine-generated prompts

EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit Dialogues

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー