「cs.CV」カテゴリーアーカイブ

Neuromorphic spatiotemporal optical flow: Enabling ultrafast visual perception beyond human capabilities

投稿日: 2025年1月31日作成者: jarxiv

要約生物学的視覚システムのメカニズムに触発された光学フローは、ロボット工学が複 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

LMFusion: Adapting Pretrained Language Models for Multimodal Generation

投稿日: 2025年1月31日作成者: jarxiv

要約 LMFusionを、マルチモーダル生成機能を備えた事前に守られたテキストの … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Surface Defect Identification using Bayesian Filtering on a 3D Mesh

投稿日: 2025年1月31日作成者: jarxiv

要約このペーパーでは、自動化された表面欠陥検出のためのCADベースのアプローチ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

A Video-grounded Dialogue Dataset and Metric for Event-driven Activities

投稿日: 2025年1月31日作成者: jarxiv

要約このペーパーでは、タスク用に特別に設計されたセッションベースのコンテキスト … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

CodeBrain: Impute Any Brain MRI via Instance-specific Scalar-quantized Codes

投稿日: 2025年1月31日作成者: jarxiv

要約 MRI代入は、1つ以上の利用可能なモダリティから欠落しているモダリティを合 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

The Batch Artifact Scanning Protocol: A new method using computed tomography (CT) to rapidly create three-dimensional models of objects from large collections en masse

投稿日: 2025年1月31日作成者: jarxiv

要約人類学では、3次元（3D）イメージングの使用は、広範囲の主要な人類学的問題 … 続きを読む →

カテゴリー: 68U05, 68W99, cs.CV, J.0 | コメントを受け付けていません

Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models

投稿日: 2025年1月31日作成者: jarxiv

要約ロボット手術ビデオにおける手術ツールキーポイントの自動追跡は、スキル評価、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

投稿日: 2025年1月31日作成者: jarxiv

要約専門家レベルの医療知識と高度な推論を評価するために、非常に挑戦的で包括的な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Cracks in concrete

投稿日: 2025年1月31日作成者: jarxiv

要約コンクリートの画像の亀裂を見つけて適切にセグメント化することは、困難な作業 … 続きを読む →

カテゴリー: 60D05, cs.CV, eess.IV, stat.AP | コメントを受け付けていません

Dual Thinking and Logical Processing — Are Multi-modal Large Language Models Closing the Gap with Human Vision ?

投稿日: 2025年1月31日作成者: jarxiv

要約デュアル思考フレームワークでは、高速で直感的な処理と遅い論理処理を考慮しま … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Neuromorphic spatiotemporal optical flow: Enabling ultrafast visual perception beyond human capabilities

LMFusion: Adapting Pretrained Language Models for Multimodal Generation

Surface Defect Identification using Bayesian Filtering on a 3D Mesh

A Video-grounded Dialogue Dataset and Metric for Event-driven Activities

CodeBrain: Impute Any Brain MRI via Instance-specific Scalar-quantized Codes

The Batch Artifact Scanning Protocol: A new method using computed tomography (CT) to rapidly create three-dimensional models of objects from large collections en masse

Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Cracks in concrete

Dual Thinking and Logical Processing — Are Multi-modal Large Language Models Closing the Gap with Human Vision ?

最近の投稿

最近のコメント

アーカイブ

カテゴリー