月別アーカイブ: 2024年3月

CriticBench: Benchmarking LLMs for Critique-Correct Reasoning

投稿日: 2024年3月11日作成者: jarxiv

要約大規模言語モデル (LLM) が推論を批判し洗練する能力は、評価、フィード … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

投稿日: 2024年3月11日作成者: jarxiv

要約限られた計算コストで、事前トレーニングされた大規模言語モデル (LLM) … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Multi-View Causal Representation Learning with Partial Observability

投稿日: 2024年3月11日作成者: jarxiv

要約我々は、異なるデータモダリティなど、同時に観察されたビューから学習された表 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Considering Nonstationary within Multivariate Time Series with Variational Hierarchical Transformer for Forecasting

投稿日: 2024年3月11日作成者: jarxiv

要約多変量時系列 (MTS) の予測は、長い間重要ではありますが、困難なタスク … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Algorithmic Identification of Essential Exogenous Nodes for Causal Sufficiency in Brain Networks

投稿日: 2024年3月11日作成者: jarxiv

要約脳の因果ネットワークなどの因果メカニズムの研究においては、因果関係が十分で … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Algorithm-Hardware Co-Design of Distribution-Aware Logarithmic-Posit Encodings for Efficient DNN Inference

投稿日: 2024年3月11日作成者: jarxiv

要約整数、固定小数点、または浮動小数点のデータ型を使用する従来のディープニュ … 続きを読む →

カテゴリー: cs.AI, cs.AR, cs.LG, cs.NE | コメントを受け付けていません

RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models models via Romanization

投稿日: 2024年3月11日作成者: jarxiv

要約この研究は、非ローマ字を使用して大規模言語モデル (LLM) を英語以外の … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Intriguing Properties of Input-dependent Randomized Smoothing

投稿日: 2024年3月11日作成者: jarxiv

要約ランダム化平滑化は、現在、保証された堅牢な分類器を取得するための最先端の方 … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought

投稿日: 2024年3月11日作成者: jarxiv

要約思考連鎖プロンプト (CoT) は、言語モデル推論の説明可能性を向上させる … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

DeepSeek-VL: Towards Real-World Vision-Language Understanding

投稿日: 2024年3月11日作成者: jarxiv

要約ここでは、現実世界の視覚および言語理解アプリケーション向けに設計されたオー … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

月別アーカイブ: 2024年3月

CriticBench: Benchmarking LLMs for Critique-Correct Reasoning

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Multi-View Causal Representation Learning with Partial Observability

Considering Nonstationary within Multivariate Time Series with Variational Hierarchical Transformer for Forecasting

Algorithmic Identification of Essential Exogenous Nodes for Causal Sufficiency in Brain Networks

Algorithm-Hardware Co-Design of Distribution-Aware Logarithmic-Posit Encodings for Efficient DNN Inference

RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models models via Romanization

Intriguing Properties of Input-dependent Randomized Smoothing

Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought

DeepSeek-VL: Towards Real-World Vision-Language Understanding

最近の投稿

最近のコメント

アーカイブ

カテゴリー