月別アーカイブ: 2024年7月

Network Inversion of Convolutional Neural Nets

投稿日: 2024年7月26日作成者: jarxiv

要約ニューラルネットワークは、さまざまなアプリケーションにわたる強力なツール … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation

投稿日: 2024年7月26日作成者: jarxiv

要約シーングラフ生成 (SGG) タスクには、画像内のオブジェクトの検出と、 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions

投稿日: 2024年7月26日作成者: jarxiv

要約加速された MRI 再構成などの逆問題は不適切な設定であり、考えられるもっ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

PARSE-Ego4D: Personal Action Recommendation Suggestions for Egocentric Videos

投稿日: 2024年7月26日作成者: jarxiv

要約インテリジェントな支援には、理解するだけでなく行動も含まれます。既存の自 … 続きを読む →

カテゴリー: cs.CV, cs.HC, cs.NE | コメントを受け付けていません

AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild

投稿日: 2024年7月26日作成者: jarxiv

要約最近、人間とコンピューターのさまざまな形式の対話を使用するための 3D 手 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models

投稿日: 2024年7月26日作成者: jarxiv

要約モバイルデバイスでキャプチャされた自然画像には、ノイズ、ぼやけ、低照度な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images

投稿日: 2024年7月26日作成者: jarxiv

要約継続学習 (CL) は、一方向のトレーニング方法を打破し、モデルが新しいデ … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework

投稿日: 2024年7月26日作成者: jarxiv

要約セマンティックセグメンテーションとステレオマッチングは、それぞれ人間の … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

InternVideo2: Scaling Foundation Models for Multimodal Video Understanding

投稿日: 2024年7月26日作成者: jarxiv

要約ビデオ認識、ビデオテキストタスク、およびビデオ中心の対話において最先端 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

YOCO: You Only Calibrate Once for Accurate Extrinsic Parameter in LiDAR-Camera Systems

投稿日: 2024年7月26日作成者: jarxiv

要約カメラと LiDAR で構成されるマルチセンサーフュージョンシステムで … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

月別アーカイブ: 2024年7月

Network Inversion of Convolutional Neural Nets

Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation

Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions

PARSE-Ego4D: Personal Action Recommendation Suggestions for Egocentric Videos

AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild

RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models

Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images

TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework

InternVideo2: Scaling Foundation Models for Multimodal Video Understanding

YOCO: You Only Calibrate Once for Accurate Extrinsic Parameter in LiDAR-Camera Systems

最近の投稿

最近のコメント

アーカイブ

カテゴリー