「cs.LG」カテゴリーアーカイブ

Style-based Clustering of Visual Artworks and the Play of Neural Style-Representations

投稿日: 2025年2月4日作成者: jarxiv

要約スタイルに基づく芸術作品のクラスタリングは、芸術作品の推薦、スタイルに基づ … 続きを読む →

カテゴリー: cs.CV, cs.LG, I.4.8 | コメントを受け付けていません

Multimodal ELBO with Diffusion Decoders

投稿日: 2025年2月4日作成者: jarxiv

要約マルチモーダル変分オートエンコーダは、異なるモダリティを潜在表現にマッピン … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Disentanglement with Factor Quantized Variational Autoencoders

投稿日: 2025年2月4日作成者: jarxiv

要約分離表現学習は、データセットの基礎となる生成因子を、互いに独立した潜在表現 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

A comparison between humans and AI at recognizing objects in unusual poses

投稿日: 2025年2月4日作成者: jarxiv

要約ディープラーニングは、いくつかの物体認識ベンチマークにおいて、人間の視覚と … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Understanding Model Calibration — A gentle introduction and visual exploration of calibration and the expected calibration error (ECE)

投稿日: 2025年2月4日作成者: jarxiv

要約モデルが信頼できるとみなされるためには、各決定における信頼度が真の結果を忠 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, stat.ME, stat.ML | コメントを受け付けていません

Contrast-Aware Calibration for Fine-Tuned CLIP: Leveraging Image-Text Alignment

投稿日: 2025年2月4日作成者: jarxiv

要約 CLIPのような視覚言語モデル(VLM)は、卓越した汎化能力を実証しており … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions

投稿日: 2025年2月4日作成者: jarxiv

要約この研究では、音声とテキストを大規模言語モデル（LLM）への入力として統合 … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

What is causal about causal models and representations?

投稿日: 2025年2月4日作成者: jarxiv

要約因果ベイズネットワークは、介入分布に関する予測を行うので、「因果」モデルで … 続きを読む →

カテゴリー: cs.AI, cs.LG, math.ST, stat.ML, stat.TH | コメントを受け付けていません

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

投稿日: 2025年2月4日作成者: jarxiv

要約実世界のシナリオにおいて、領域適応と汎化を達成することは、モデルが未知のタ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

s1: Simple test-time scaling

投稿日: 2025年2月4日作成者: jarxiv

要約テスト・タイム・スケーリングは、言語モデリングに対する有望な新しいアプロー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Style-based Clustering of Visual Artworks and the Play of Neural Style-Representations

Multimodal ELBO with Diffusion Decoders

Disentanglement with Factor Quantized Variational Autoencoders

A comparison between humans and AI at recognizing objects in unusual poses

Understanding Model Calibration — A gentle introduction and visual exploration of calibration and the expected calibration error (ECE)

Contrast-Aware Calibration for Fine-Tuned CLIP: Leveraging Image-Text Alignment

SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions

What is causal about causal models and representations?

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

s1: Simple test-time scaling

最近の投稿

最近のコメント

アーカイブ

カテゴリー