「cs.AI」カテゴリーアーカイブ

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

投稿日: 2025年1月31日作成者: jarxiv

要約ビジョン言語モデル（VLM）は最近、ロボットアクションを生成するために活用 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Temporal Preference Optimization for Long-Form Video Understanding

投稿日: 2025年1月31日作成者: jarxiv

要約ビデオの大規模なマルチモーダルモデル（ビデオLMMS）の大幅な進歩にもかか … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

R-LLaVA: Improving Med-VQA Understanding through Visual Region of Interest

投稿日: 2025年1月31日作成者: jarxiv

要約人工知能は医学的視覚的質問応答（MED-VQA）に大きな進歩を遂げましたが … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Vision-based autonomous structural damage detection using data-driven methods

投稿日: 2025年1月31日作成者: jarxiv

要約この研究では、再生可能エネルギーインフラストラクチャの重要なコンポーネント … 続きを読む →

カテゴリー: (Primary), cs.AI, cs.CV, eess.IV, secondary | コメントを受け付けていません

Inkspire: Supporting Design Exploration with Generative AI through Analogical Sketching

投稿日: 2025年1月31日作成者: jarxiv

要約テキストツーイメージ（T2I）AIモデルの能力に最近の進歩により、製品設計 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, cs.MM | コメントを受け付けていません

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

投稿日: 2025年1月31日作成者: jarxiv

要約実際のシナリオでは、モデルが未知のターゲット分布に適応または一般化する必要 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Diffusion Autoencoders are Scalable Image Tokenizers

投稿日: 2025年1月31日作成者: jarxiv

要約画像をコンパクトな視覚表現にトークン化することは、効率的で高品質の画像生成 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

In-Context Meta LoRA Generation

投稿日: 2025年1月31日作成者: jarxiv

要約低ランク適応（LORA）は、タスク固有の微調整の顕著な能力を実証しています … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Slaves to the Law of Large Numbers: An Asymptotic Equipartition Property for Perplexity in Generative Language Models

投稿日: 2025年1月31日作成者: jarxiv

要約言語モデルによって生成された長いテキストの困惑と、オープンソースモデルから … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IT, math.IT | コメントを受け付けていません

Computing the gradients with respect to all parameters of a quantum neural network using a single circuit

投稿日: 2025年1月31日作成者: jarxiv

要約勾配を見つけることは、機械学習モデルをトレーニングする上で重要なステップで … 続きを読む →

カテゴリー: cs.AI, cs.LG, quant-ph | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Temporal Preference Optimization for Long-Form Video Understanding

R-LLaVA: Improving Med-VQA Understanding through Visual Region of Interest

Vision-based autonomous structural damage detection using data-driven methods

Inkspire: Supporting Design Exploration with Generative AI through Analogical Sketching

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

Diffusion Autoencoders are Scalable Image Tokenizers

In-Context Meta LoRA Generation

Slaves to the Law of Large Numbers: An Asymptotic Equipartition Property for Perplexity in Generative Language Models

Computing the gradients with respect to all parameters of a quantum neural network using a single circuit

最近の投稿

最近のコメント

アーカイブ

カテゴリー