「cs.AI」カテゴリーアーカイブ

Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level

投稿日: 2024年6月18日作成者: jarxiv

要約言語モデルを人間の好みに合わせるための標準的な方法である直接好み最適化 ( … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Embodied Instruction Following in Unknown Environments

投稿日: 2024年6月18日作成者: jarxiv

要約身体化されたエージェントが自然言語から人間による複雑な指示を完了できるよう … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

WPO: Enhancing RLHF with Weighted Preference Optimization

投稿日: 2024年6月18日作成者: jarxiv

要約ヒューマンフィードバックからの強化学習 (RLHF) は、大規模言語モデ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Language Modeling with Editable External Knowledge

投稿日: 2024年6月18日作成者: jarxiv

要約世界が変われば、それについて人間が書く文章も変わります。これらの変更を反 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection

投稿日: 2024年6月18日作成者: jarxiv

要約画像ベースのドローン検出の主流の方法は、YOLOv5 のような汎用の物体検 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Evaluating Task-based Effectiveness of MLLMs on Charts

投稿日: 2024年6月18日作成者: jarxiv

要約このペーパーでは、GPT-4V はチャート上の低レベルのデータ分析タスクに … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding

投稿日: 2024年6月18日作成者: jarxiv

要約ビジョン言語モデル (VLM) は、多くの言語の画像に関するクエリに応答で … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Deep Learning methodology for the identification of wood species using high-resolution macroscopic images

投稿日: 2024年6月18日作成者: jarxiv

要約持続可能な木材取引を支援するには、世界中で木材種の識別の分野で大幅な進歩が … 続きを読む →

カテゴリー: cs.AI, cs.CV, I.2.1 | コメントを受け付けていません

Task Me Anything

投稿日: 2024年6月18日作成者: jarxiv

要約大規模なマルチモーダル言語モデル (MLM) のベンチマークは、特定の機能 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Brief Survey on Leveraging Large Scale Vision Models for Enhanced Robot Grasping

投稿日: 2024年6月18日作成者: jarxiv

要約ロボットによる把持は、現実世界のシナリオでは困難な運動タスクを提示しており … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level

Embodied Instruction Following in Unknown Environments

WPO: Enhancing RLHF with Weighted Preference Optimization

Language Modeling with Editable External Knowledge

YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection

Evaluating Task-based Effectiveness of MLLMs on Charts

See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding

Deep Learning methodology for the identification of wood species using high-resolution macroscopic images

Task Me Anything

A Brief Survey on Leveraging Large Scale Vision Models for Enhanced Robot Grasping

最近の投稿

最近のコメント

アーカイブ

カテゴリー