「cs.AI」カテゴリーアーカイブ

SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot

投稿日: 2024年12月9日作成者: jarxiv

要約外科的介入、特に神経内科における外科的介入は、外科チームに多大な認知的負担 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, cs.RO | コメントを受け付けていません

Archaeoscape: Bringing Aerial Laser Scanning Archaeology to the Deep Learning Era

投稿日: 2024年12月9日作成者: jarxiv

要約航空機レーザースキャン (ALS) テクノロジーは、密集した植生の下に隠 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

CompCap: Improving Multimodal Large Language Models with Composite Captions

投稿日: 2024年12月9日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は合成画像をどの程度理解でき … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

From classical techniques to convolution-based models: A review of object detection algorithms

投稿日: 2024年12月9日作成者: jarxiv

要約オブジェクト検出は、コンピュータービジョンと画像理解における基本的なタス … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft

投稿日: 2024年12月9日作成者: jarxiv

要約コラボレーションは社会の基礎です。現実の世界では、人間のチームメイトは多 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MA | コメントを受け付けていません

Extrapolated Urban View Synthesis Benchmark

投稿日: 2024年12月9日作成者: jarxiv

要約フォトリアリスティックなシミュレーターは、ビジョン中心の自動運転車 (AV … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models

投稿日: 2024年12月9日作成者: jarxiv

要約 Text-to-Video モデルは、多様で魅力的なビデオコンテンツを生 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

投稿日: 2024年12月9日作成者: jarxiv

要約現実的な自動運転シミュレーターの開発には4D運転シミュレーションが不可欠で … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Artificial intelligence and the internal processes of creativity

投稿日: 2024年12月9日作成者: jarxiv

要約創造的な成果を生成できる人工知能 (AI) システムは、創造性に対する私た … 続きを読む →

カテゴリー: cs.AI, cs.CY, q-bio.NC | コメントを受け付けていません

EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding

投稿日: 2024年12月9日作成者: jarxiv

要約 3D 占有予測は周囲のシーンの包括的な説明を提供し、3D 認識にとって不可 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot

Archaeoscape: Bringing Aerial Laser Scanning Archaeology to the Deep Learning Era

CompCap: Improving Multimodal Large Language Models with Composite Captions

From classical techniques to convolution-based models: A review of object detection algorithms

TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft

Extrapolated Urban View Synthesis Benchmark

MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

Artificial intelligence and the internal processes of creativity

EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding

最近の投稿

最近のコメント

アーカイブ

カテゴリー