月別アーカイブ: 2024年7月

FlexAttention for Efficient High-Resolution Vision-Language Models

投稿日: 2024年7月30日作成者: jarxiv

要約現在の高解像度ビジョン言語モデルは、画像を高解像度画像トークンとしてエンコ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Improving 2D Feature Representations by 3D-Aware Fine-Tuning

投稿日: 2024年7月30日作成者: jarxiv

要約現在のビジュアル基盤モデルは、非構造化 2D データのみでトレーニングされ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Matryoshka Multimodal Models

投稿日: 2024年7月30日作成者: jarxiv

要約 LLaVA などの大規模マルチモーダルモデル (LMM) は、視覚言語推 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

SAPG: Split and Aggregate Policy Gradients

投稿日: 2024年7月30日作成者: jarxiv

要約極端なサンプルの非効率にもかかわらず、ポリシーに基づく強化学習、別名ポリシ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing

投稿日: 2024年7月30日作成者: jarxiv

要約テキストベースの編集普及モデルは、ユーザーの入力指示があいまいな場合、パフ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training

投稿日: 2024年7月30日作成者: jarxiv

要約正確なリアルタイムの物体検出は、安全監視から品質管理に至るまで、数多くの産 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Benchmarking Dependence Measures to Prevent Shortcut Learning in Medical Imaging

投稿日: 2024年7月30日作成者: jarxiv

要約医療画像コホートは、取得デバイス、病院の場所、患者の背景などの要因によって … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network

投稿日: 2024年7月30日作成者: jarxiv

要約軌道予測は、交通参加者の将来の動きを予測することを目的としているため、自動 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation

投稿日: 2024年7月30日作成者: jarxiv

要約トレーニングレコメンダーシステム (RecSys) におけるグラフ畳み … 続きを読む →

カテゴリー: cs.IR, cs.LG | コメントを受け付けていません

A Role-specific Guided Large Language Model for Ophthalmic Consultation Based on Stylistic Differentiation

投稿日: 2024年7月30日作成者: jarxiv

要約眼科の診察は、目の病気の診断、治療、予防にとって非常に重要です。しかし、 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

月別アーカイブ: 2024年7月

FlexAttention for Efficient High-Resolution Vision-Language Models

Improving 2D Feature Representations by 3D-Aware Fine-Tuning

Matryoshka Multimodal Models

SAPG: Split and Aggregate Policy Gradients

Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing

DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training

Benchmarking Dependence Measures to Prevent Shortcut Learning in Medical Imaging

Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network

Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation

A Role-specific Guided Large Language Model for Ophthalmic Consultation Based on Stylistic Differentiation

最近の投稿

最近のコメント

アーカイブ

カテゴリー