「cs.AI」カテゴリーアーカイブ

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

投稿日: 2025年6月6日作成者: jarxiv

要約推論のための大規模な強化学習（RL）の最近の進歩にもかかわらず、高性能の推 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Exploring Diffusion Transformer Designs via Grafting

投稿日: 2025年6月6日作成者: jarxiv

要約モデルアーキテクチャの設計には、オペレーター（注意、畳み込みなど）や構成（ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Rectified Point Flow: Generic Point Cloud Pose Estimation

投稿日: 2025年6月6日作成者: jarxiv

要約ペアワイズポイントクラウド登録とマルチパート形状アセンブリを単一の条件付き … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning

投稿日: 2025年6月6日作成者: jarxiv

要約具体化されたAIおよびデジタルコンテンツの作成には、現実的な3D屋内シーン … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Refer to Anything with Vision-Language Prompts

投稿日: 2025年6月6日作成者: jarxiv

要約最近の画像セグメンテーションモデルは、画像を視覚エンティティの高品質のマス … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

macOSWorld: A Multilingual Interactive Benchmark for GUI Agents

投稿日: 2025年6月6日作成者: jarxiv

要約グラフィカルユーザーインターフェイス（GUI）エージェントは、コンピュータ … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

投稿日: 2025年6月6日作成者: jarxiv

要約既存の統一モデルは、ビジョン言語の理解とテキストからイメージの生成において … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving

投稿日: 2025年6月6日作成者: jarxiv

要約自律運転は、モビリティ、交通安全、交通効率の重要な進歩を約束しますが、補強 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Biased by Design: Leveraging Inherent AI Biases to Enhance Critical Thinking of News Readers

投稿日: 2025年6月6日作成者: jarxiv

要約このペーパーでは、大規模な言語モデル（LLMS）を使用したプロパガンダ検出 … 続きを読む →

カテゴリー: cs.AI, cs.HC | コメントを受け付けていません

Grounded Vision-Language Interpreter for Integrated Task and Motion Planning

投稿日: 2025年6月5日作成者: jarxiv

要約ビジョン言語モデル（VLM）の最近の進歩により、言語誘導ロボットプランナー … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Exploring Diffusion Transformer Designs via Grafting

Rectified Point Flow: Generic Point Cloud Pose Estimation

Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning

Refer to Anything with Vision-Language Prompts

macOSWorld: A Multilingual Interactive Benchmark for GUI Agents

UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving

Biased by Design: Leveraging Inherent AI Biases to Enhance Critical Thinking of News Readers

Grounded Vision-Language Interpreter for Integrated Task and Motion Planning

最近の投稿

最近のコメント

アーカイブ

カテゴリー