「cs.AI」カテゴリーアーカイブ

Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting

投稿日: 2024年10月31日作成者: jarxiv

要約近年、深層学習アプローチが降水ナウキャスティングに広く採用されています。 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers

投稿日: 2024年10月31日作成者: jarxiv

要約フューショット知識蒸留は、限られたデータと計算リソースを使用して、大規模な … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET

投稿日: 2024年10月31日作成者: jarxiv

要約認知症、特にアルツハイマー病 (AD) と前頭側頭型認知症 (FTD) の … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Aligning Audio-Visual Joint Representations with an Agentic Workflow

投稿日: 2024年10月31日作成者: jarxiv

要約ビジュアルコンテンツと付随するオーディオ信号は、オーディオビジュアル ( … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

Keypoint Abstraction using Large Models for Object-Relative Imitation Learning

投稿日: 2024年10月31日作成者: jarxiv

要約多様なタスクや環境にわたる新しいオブジェクト構成やインスタンスへの一般化は … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

EMMA: End-to-End Multimodal Model for Autonomous Driving

投稿日: 2024年10月31日作成者: jarxiv

要約自動運転のためのエンドツーエンドのマルチモーダルモデルであるEMMAを紹介 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models

投稿日: 2024年10月31日作成者: jarxiv

要約既存のベンチマークでは、ビデオ理解のための時間的コンテキストを活用する際に … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Multi-student Diffusion Distillation for Better One-step Generators

投稿日: 2024年10月31日作成者: jarxiv

要約拡散モデルは、長時間にわたる複数ステップの推論手順を犠牲にして、高品質のサ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation

投稿日: 2024年10月31日作成者: jarxiv

要約人間には、一般的な世界の動きの遅い学習と、新しい経験からのエピソード記憶の … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Integration of Large Language Models and Federated Learning

投稿日: 2024年10月31日作成者: jarxiv

要約大規模言語モデル (LLM) のパラメータサイズが拡大し続ける中、高品質 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting

Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers

DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET

Aligning Audio-Visual Joint Representations with an Agentic Workflow

Keypoint Abstraction using Large Models for Object-Relative Imitation Learning

EMMA: End-to-End Multimodal Model for Autonomous Driving

TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models

Multi-student Diffusion Distillation for Better One-step Generators

SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation

Integration of Large Language Models and Federated Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー