「cs.LG」カテゴリーアーカイブ

Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers

投稿日: 2024年12月10日作成者: jarxiv

要約ビジョントランスフォーマー (ViT) は、完全教師ありから自己教師あり … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

How Transformers Solve Propositional Logic Problems: A Mechanistic Analysis

投稿日: 2024年12月10日作成者: jarxiv

要約大規模言語モデル (LLM) は、計画と推論を必要とするタスクで驚くべきパ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Can tweets predict article retractions? A comparison between human and LLM labelling

投稿日: 2024年12月10日作成者: jarxiv

要約問題のある研究論文を迅速に検出することは、科学研究の完全性を守るために非常 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.DL, cs.LG | コメントを受け付けていません

Semantic Search and Recommendation Algorithm

投稿日: 2024年12月10日作成者: jarxiv

要約このペーパーでは、Word2Vec と Annoy Index を使用して … 続きを読む →

カテゴリー: cs.AI, cs.DB, cs.IR, cs.LG | コメントを受け付けていません

Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone

投稿日: 2024年12月10日作成者: jarxiv

要約意思決定ポリシーの学習における最近の進歩は、主に模倣学習を介した表現力豊か … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

LLM Pruning and Distillation in Practice: The Minitron Approach

投稿日: 2024年12月10日作成者: jarxiv

要約プルーニングと蒸留を使用して、Llama 3.1 8B モデルと Mist … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Croissant: A Metadata Format for ML-Ready Datasets

投稿日: 2024年12月10日作成者: jarxiv

要約データは機械学習 (ML) にとって重要なリソースですが、データの操作は依 … 続きを読む →

カテゴリー: cs.AI, cs.DB, cs.IR, cs.LG | コメントを受け付けていません

The Narrow Gate: Localized Image-Text Communication in Vision-Language Models

投稿日: 2024年12月10日作成者: jarxiv

要約マルチモーダルトレーニングの最近の進歩により、統一モデル内での画像の理解 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Break a Lag: Triple Exponential Moving Average for Enhanced Optimization

投稿日: 2024年12月10日作成者: jarxiv

要約深層学習モデルのパフォーマンスは、高度な最適化戦略に大きく依存します。既 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

DexDiffuser: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation

投稿日: 2024年12月10日作成者: jarxiv

要約高度なロボット工学には、接触を多く含むインタラクションによる器用な操作が不 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers

How Transformers Solve Propositional Logic Problems: A Mechanistic Analysis

Can tweets predict article retractions? A comparison between human and LLM labelling

Semantic Search and Recommendation Algorithm

Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone

LLM Pruning and Distillation in Practice: The Minitron Approach

Croissant: A Metadata Format for ML-Ready Datasets

The Narrow Gate: Localized Image-Text Communication in Vision-Language Models

Break a Lag: Triple Exponential Moving Average for Enhanced Optimization

DexDiffuser: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation

最近の投稿

最近のコメント

アーカイブ

カテゴリー