月別アーカイブ: 2024年8月

AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation

投稿日: 2024年8月4日作成者: jarxiv

要約大規模言語モデル（Large Language Model: LLM）ベー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation

投稿日: 2024年8月4日作成者: jarxiv

要約医用画像のセグメンテーションは、病変の自動認識と解析において重要な役割を果 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

How Effective are Self-Supervised Models for Contact Identification in Videos

投稿日: 2024年8月4日作成者: jarxiv

要約自己教師あり学習（Self-Supervised Learning：SSL … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets

投稿日: 2024年8月4日作成者: jarxiv

要約ロボットのデータセットは、科学的ベンチマークや、SLAM（Simultan … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Mitigating Multilingual Hallucination in Large Vision-Language Models

投稿日: 2024年8月4日作成者: jarxiv

要約大規模視覚言語モデル（Large Vision-Language Mode … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Alleviating Hallucination in Large Vision-Language Models with Active Retrieval Augmentation

投稿日: 2024年8月4日作成者: jarxiv

要約近年、大規模言語モデル（LLM）において、外部の知識資源から情報を検索する … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection

投稿日: 2024年8月4日作成者: jarxiv

要約近年、レーダーによる物体検出に基づくアプローチは、LiDARと比較して悪天 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Learned Compression of Point Cloud Geometry and Attributes in a Single Model through Multimodal Rate-Control

投稿日: 2024年8月4日作成者: jarxiv

要約点群圧縮は、必要なストリーミングデータレートを大幅に削減するため、ボリュー … 続きを読む →

カテゴリー: cs.CV, cs.MM, eess.IV | コメントを受け付けていません

Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection

投稿日: 2024年8月4日作成者: jarxiv

要約教師なし3D物体検出は、LiDARポイントのようなラベル付けされていない生 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Are Bigger Encoders Always Better in Vision Large Models?

投稿日: 2024年8月4日作成者: jarxiv

要約近年、マルチモーダル大規模言語モデル（MLLM）は、実世界での応用において … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年8月

AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation

SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation

How Effective are Self-Supervised Models for Contact Identification in Videos

High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets

Mitigating Multilingual Hallucination in Large Vision-Language Models

Alleviating Hallucination in Large Vision-Language Models with Active Retrieval Augmentation

MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection

Learned Compression of Point Cloud Geometry and Attributes in a Single Model through Multimodal Rate-Control

Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection

Are Bigger Encoders Always Better in Vision Large Models?

最近の投稿

最近のコメント

アーカイブ

カテゴリー