月別アーカイブ: 2024年7月

Looking at Model Debiasing through the Lens of Anomaly Detection

投稿日: 2024年7月25日作成者: jarxiv

要約ディープニューラルネットワークがデータの偏りの影響を受けやすいことは広 … 続きを読む →

カテゴリー: cs.CV, cs.LG, I.4 | コメントを受け付けていません

$VILA^2$: VILA Augmented VILA

投稿日: 2024年7月25日作成者: jarxiv

要約ビジュアル言語モデル (VLM) は、大規模言語モデル (LLM) の成功 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CSCPR: Cross-Source-Context Indoor RGB-D Place Recognition

投稿日: 2024年7月25日作成者: jarxiv

要約グローバルな検索と再ランキングを単一のエンドツーエンドモデルに統合する、 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

SoNIC: Safe Social Navigation with Adaptive Conformal Inference and Constrained Reinforcement Learning

投稿日: 2024年7月25日作成者: jarxiv

要約強化学習 (RL) により、ソーシャルロボットは人間が設計したルールや介 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency

投稿日: 2024年7月25日作成者: jarxiv

要約我々は、マルチフレームおよびマルチビューの一貫した動的 3D コンテンツ生 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?

投稿日: 2024年7月25日作成者: jarxiv

要約大規模言語モデル (LLM) は、外部知識データベースからの情報の取得を活 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game

投稿日: 2024年7月25日作成者: jarxiv

要約戦略的社会演繹ゲームは、言語モデルの理解と推論スキルを評価するための貴重な … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data

投稿日: 2024年7月25日作成者: jarxiv

要約国際的な賞金競争、スケール調整された車両、シミュレーション環境が利用可能で … 続きを読む →

カテゴリー: cs.LG, cs.RO | コメントを受け付けていません

Audio Prompt Adapter: Unleashing Music Editing Abilities for Text-to-Music with Lightweight Finetuning

投稿日: 2024年7月25日作成者: jarxiv

要約テキストから音楽へのモデルを使用すると、ユーザーはテキストコマンドを使用 … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS | コメントを受け付けていません

Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles

投稿日: 2024年7月25日作成者: jarxiv

要約異なるセンサーモダリティを融合することは、特にセンサーが非同期である場合 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

月別アーカイブ: 2024年7月

Looking at Model Debiasing through the Lens of Anomaly Detection

$VILA^2$: VILA Augmented VILA

CSCPR: Cross-Source-Context Indoor RGB-D Place Recognition

SoNIC: Safe Social Navigation with Adaptive Conformal Inference and Constrained Reinforcement Learning

SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency

How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?

AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game

A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data

Audio Prompt Adapter: Unleashing Music Editing Abilities for Text-to-Music with Lightweight Finetuning

Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles

最近の投稿

最近のコメント

アーカイブ

カテゴリー