「cs.AI」カテゴリーアーカイブ

Quantifying and Enabling the Interpretability of CLIP-like Models

投稿日: 2024年9月11日作成者: jarxiv

要約 CLIP は最も人気のある基本モデルの 1 つであり、多くの視覚言語タスク … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis

投稿日: 2024年9月11日作成者: jarxiv

要約緑内障、黄斑変性症、糖尿病性網膜症などの眼疾患を早期に発見することは、視力 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

World-Grounded Human Motion Recovery via Gravity-View Coordinates

投稿日: 2024年9月11日作成者: jarxiv

要約単眼ビデオから世界を基準とした人間の動きを復元するための新しい方法を紹介し … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving

投稿日: 2024年9月11日作成者: jarxiv

要約自動運転 (AD) におけるエンドツーエンドのアーキテクチャは、人間と A … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

What Did My Car Say? Impact of Autonomous Vehicle Explanation Errors and Driving Context On Comfort, Reliance, Satisfaction, and Driving Confidence

投稿日: 2024年9月11日作成者: jarxiv

要約自動運転車 (AV) の決定についての説明は信頼を築く可能性がありますが、 … 続きを読む →

カテゴリー: cs.AI, cs.HC | コメントを受け付けていません

Question-Answering Dense Video Events

投稿日: 2024年9月11日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は、単一イベントビデオの質 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

Towards Building a Robust Knowledge Intensive Question Answering Model with Large Language Models

投稿日: 2024年9月11日作成者: jarxiv

要約 LLM の開発により、質問応答のインテリジェンスと流暢さが大幅に向上し、検 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

投稿日: 2024年9月11日作成者: jarxiv

要約検索拡張生成 (RAG) は、検索ツールを活用して外部データベースにアクセ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

NeurLZ: On Enhancing Lossy Compression Performance based on Error-Controlled Neural Learning for Scientific Data

投稿日: 2024年9月11日作成者: jarxiv

要約大規模な科学シミュレーションでは、ストレージと I/O に重大な課題を引き … 続きを読む →

カテゴリー: cs.AI, cs.DC | コメントを受け付けていません

Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation

投稿日: 2024年9月10日作成者: jarxiv

要約ビジョンベースの操作ポリシーを新しい環境に一般化することは、依然として困難 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Quantifying and Enabling the Interpretability of CLIP-like Models

EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis

World-Grounded Human Motion Recovery via Gravity-View Coordinates

Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving

What Did My Car Say? Impact of Autonomous Vehicle Explanation Errors and Driving Context On Comfort, Reliance, Satisfaction, and Driving Confidence

Question-Answering Dense Video Events

Towards Building a Robust Knowledge Intensive Question Answering Model with Large Language Models

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

NeurLZ: On Enhancing Lossy Compression Performance based on Error-Controlled Neural Learning for Scientific Data

Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation

最近の投稿

最近のコメント

アーカイブ

カテゴリー