月別アーカイブ: 2024年4月

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

投稿日: 2024年4月29日作成者: jarxiv

要約データ量が自己教師あり学習の有効性を高める時代では、データセマンティクス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Probing Conceptual Understanding of Large Visual-Language Models

投稿日: 2024年4月29日作成者: jarxiv

要約近年、大規模なビジュアル言語 (V+L) モデルがさまざまな下流タスクで大 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DeepClean: Machine Unlearning on the Cheap by Resetting Privacy Sensitive Weights using the Fisher Diagonal

投稿日: 2024年4月29日作成者: jarxiv

要約機密データや個人データでトレーニングされた機械学習モデルは、その情報を誤っ … 続きを読む →

カテゴリー: cs.CR, cs.CV, cs.LG | コメントを受け付けていません

MAIRA-1: A specialised large multimodal model for radiology report generation

投稿日: 2024年4月29日作成者: jarxiv

要約胸部 X 線 (CXR) から放射線医学レポートを生成するタスクのための放 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations

投稿日: 2024年4月29日作成者: jarxiv

要約新しい操作タスクを学習できる自律型ロボットシステムは、産業を製造からサー … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing

投稿日: 2024年4月29日作成者: jarxiv

要約ほとんどの分子図パーサーは、ラスター画像 (PNG など) から化学構造を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields

投稿日: 2024年4月29日作成者: jarxiv

要約一般化可能な NeRF は、目に見えないシーンに対して新しいビューを合成す … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios

投稿日: 2024年4月29日作成者: jarxiv

要約 Medical Vision-Language Pretraining ( … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Exploring the Distinctiveness and Fidelity of the Descriptions Generated by Large Vision-Language Models

投稿日: 2024年4月29日作成者: jarxiv

要約 Large Vision-Language Model (LVLM) は、 … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Overload: Latency Attacks on Object Detection for Edge Devices

投稿日: 2024年4月29日作成者: jarxiv

要約現在、インテリジェントサービスに対する需要が高まっているため、ディープ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年4月

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

Probing Conceptual Understanding of Large Visual-Language Models

DeepClean: Machine Unlearning on the Cheap by Resetting Privacy Sensitive Weights using the Fisher Diagonal

MAIRA-1: A specialised large multimodal model for radiology report generation

Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations

ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing

Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields

CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios

Exploring the Distinctiveness and Fidelity of the Descriptions Generated by Large Vision-Language Models

Overload: Latency Attacks on Object Detection for Edge Devices

最近の投稿

最近のコメント

アーカイブ

カテゴリー