投稿者「jarxiv」のアーカイブ

Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Image Recognition

投稿日: 2025年4月10日作成者: jarxiv

要約食品の画像に基づいた自動食事評価は依然として課題であり、正確な食品検出、セ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Towards Communication-Efficient Adversarial Federated Learning for Robust Edge Intelligence

投稿日: 2025年4月10日作成者: jarxiv

要約 Federated Learning（FL）は、生データを公開せずにエッジ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PathSegDiff: Pathology Segmentation using Diffusion model representations

投稿日: 2025年4月10日作成者: jarxiv

要約画像セグメンテーションは、正確な疾患診断、サブタイピング、結果、生存可能性 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology

投稿日: 2025年4月10日作成者: jarxiv

要約多くの生物医学的画像分析タスクでは、正確で効率的な細胞検出が重要です。予 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

投稿日: 2025年4月10日作成者: jarxiv

要約補強学習における最近の進歩により、マルチモーダルの大手言語モデル（MLLM … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation

投稿日: 2025年4月10日作成者: jarxiv

要約家具アセンブリやコンポーネントフィッティングなどの3Dアセンブリタスクは、 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Efficient Self-Supervised Learning for Earth Observation via Dynamic Dataset Curation

投稿日: 2025年4月10日作成者: jarxiv

要約自己学習学習（SSL）により、地球観察のためのVision Foundat … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Deep Single Image Rectification Approach for Pan-Tilt-Zoom Cameras

投稿日: 2025年4月10日作成者: jarxiv

要約広角レンズを備えたパンチルトズーム（PTZ）カメラは、監視に広く使用されて … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting

投稿日: 2025年4月10日作成者: jarxiv

要約植物の形態学的特性の自動抽出は、ハイスループットフィールド表現型（HTFP … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SIGMAN:Scaling 3D Human Gaussian Generation with Millions of Assets

投稿日: 2025年4月10日作成者: jarxiv

要約 3D人間のデジタル化は、長い間、非常に追求されているが挑戦的な作業でした。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Image Recognition

Towards Communication-Efficient Adversarial Federated Learning for Robust Edge Intelligence

PathSegDiff: Pathology Segmentation using Diffusion model representations

A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation

Efficient Self-Supervised Learning for Earth Observation via Dynamic Dataset Curation

A Deep Single Image Rectification Approach for Pan-Tilt-Zoom Cameras

Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting

SIGMAN:Scaling 3D Human Gaussian Generation with Millions of Assets

最近の投稿

最近のコメント

アーカイブ

カテゴリー