投稿者「jarxiv」のアーカイブ

Soybean Disease Detection via Interpretable Hybrid CNN-GNN: Integrating MobileNetV2 and GraphSAGE with Cross-Modal Attention

投稿日: 2025年5月5日作成者: jarxiv

要約大豆の葉の病害検出は農業生産性にとって重要であるが、従来の方法では視覚的に … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Project-and-Fuse: Improving RGB-D Semantic Segmentation via Graph Convolution Networks

投稿日: 2025年5月5日作成者: jarxiv

要約既存のRGB-Dセマンティックセグメンテーション手法の多くは、複雑なクロス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

投稿日: 2025年5月5日作成者: jarxiv

要約大規模言語モデル(LLM)は、より多くの推論を行うことで、強化された能力と … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Monitoring morphometric drift in lifelong learning segmentation of the spinal cord

投稿日: 2025年5月5日作成者: jarxiv

要約脊髄のセグメンテーションから得られる形態計測指標は、脊髄に影響を及ぼす神経 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing

投稿日: 2025年5月5日作成者: jarxiv

要約本論文では、リモートセンシング画像から多角形の建物をマッピングするという課 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Multimodal Doctor-in-the-Loop: A Clinically-Guided Explainable Framework for Predicting Pathological Response in Non-Small Cell Lung Cancer

投稿日: 2025年5月5日作成者: jarxiv

要約本研究では、ネオアジュバント療法を受ける非小細胞肺がん患者の病理学的奏効を … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

An Automated Pipeline for Few-Shot Bird Call Classification: A Case Study with the Tooth-Billed Pigeon

投稿日: 2025年5月5日作成者: jarxiv

要約本論文では、BirdNETやPerchのような大規模な公開分類器にはない希 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.SD | コメントを受け付けていません

VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models

投稿日: 2025年5月5日作成者: jarxiv

要約ビデオ拡散モデルの急速な台頭により、非常にリアルで時間的にコヒーレントなビ … 続きを読む →

カテゴリー: cs.CR, cs.CV, cs.LG | コメントを受け付けていません

RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training

投稿日: 2025年5月5日作成者: jarxiv

要約未知の関節角度を持つ多関節ロボットの視覚に基づく姿勢推定は、協調ロボット工 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

GENMO: A GENeralist Model for Human MOtion

投稿日: 2025年5月5日作成者: jarxiv

要約ヒューマンモーションモデリングは伝統的に、モーションの生成と推定を、特化し … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG, cs.RO | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Soybean Disease Detection via Interpretable Hybrid CNN-GNN: Integrating MobileNetV2 and GraphSAGE with Cross-Modal Attention

Project-and-Fuse: Improving RGB-D Semantic Segmentation via Graph Convolution Networks

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Monitoring morphometric drift in lifelong learning segmentation of the spinal cord

Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing

Multimodal Doctor-in-the-Loop: A Clinically-Guided Explainable Framework for Predicting Pathological Response in Non-Small Cell Lung Cancer

An Automated Pipeline for Few-Shot Bird Call Classification: A Case Study with the Tooth-Billed Pigeon

VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models

RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training

GENMO: A GENeralist Model for Human MOtion

最近の投稿

最近のコメント

アーカイブ

カテゴリー