月別アーカイブ: 2025年2月

ContextFormer: Redefining Efficiency in Semantic Segmentation

投稿日: 2025年2月3日作成者: jarxiv

要約セマンティックセグメンテーションは、コンピュータービジョンにおける重要であ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs

投稿日: 2025年2月3日作成者: jarxiv

要約マルチモーダルモデルの最近の進歩により、視覚的認識、推論能力、視覚言語の理 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Classifying Deepfakes Using Swin Transformers

投稿日: 2025年2月3日作成者: jarxiv

要約ディープフェイクテクノロジーの急増は、デジタルメディアの信頼性と信頼性に大 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge

投稿日: 2025年2月3日作成者: jarxiv

要約自律システムへの人間の直感的な相互作用の統合は限られています。従来の自然 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.NE, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

$α$-OCC: Uncertainty-Aware Camera-based 3D Semantic Occupancy Prediction

投稿日: 2025年2月3日作成者: jarxiv

要約自律的な車両の認識の領域では、計画やマッピングなどのタスクの3Dシーンを理 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Medical Semantic Segmentation with Diffusion Pretrain

投稿日: 2025年2月3日作成者: jarxiv

要約深い学習の最近の進歩により、学習堅牢な機能表現は、医療画像セグメンテーショ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Imagine with the Teacher: Complete Shape in a Multi-View Distillation Way

投稿日: 2025年2月3日作成者: jarxiv

要約ポイントクラウドの完了は、オクルージョン、センサーの制限、ノイズなどによっ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Referential communication in heterogeneous communities of pre-trained visual deep networks

投稿日: 2025年2月3日作成者: jarxiv

要約大規模な事前に訓練された画像処理ニューラルネットワークが自動運転車やロボッ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Application of Generative Adversarial Network (GAN) for Synthetic Training Data Creation to improve performance of ANN Classifier for extracting Built-Up pixels from Landsat Satellite Imagery

投稿日: 2025年2月3日作成者: jarxiv

要約低解像度のランドサット画像を使用したピクセルベースの分類タスクのニューラル … 続きを読む →

カテゴリー: cs.CV, cs.LG, I.4.6 | コメントを受け付けていません

Anatomy Might Be All You Need: Forecasting What to Do During Surgery

投稿日: 2025年2月3日作成者: jarxiv

要約外科的指導はさまざまな方法で提供できます。脳神経外科では、術前のMRIス … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年2月

ContextFormer: Redefining Efficiency in Semantic Segmentation

Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs

Classifying Deepfakes Using Swin Transformers

Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge

$α$-OCC: Uncertainty-Aware Camera-based 3D Semantic Occupancy Prediction

Medical Semantic Segmentation with Diffusion Pretrain

Imagine with the Teacher: Complete Shape in a Multi-View Distillation Way

Referential communication in heterogeneous communities of pre-trained visual deep networks

Application of Generative Adversarial Network (GAN) for Synthetic Training Data Creation to improve performance of ANN Classifier for extracting Built-Up pixels from Landsat Satellite Imagery

Anatomy Might Be All You Need: Forecasting What to Do During Surgery

最近の投稿

最近のコメント

アーカイブ

カテゴリー