「cs.CL」カテゴリーアーカイブ

ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code

投稿日: 2024年8月22日作成者: jarxiv

要約 GPT-4 のような大規模言語モデル (LLM) は、関数レベルのコード生 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Large Language Models in Mental Health Care: a Scoping Review

投稿日: 2024年8月22日作成者: jarxiv

要約メンタルヘルスケアにおける大規模言語モデル (LLM) の統合は、新興分野 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Efficient Detection of Toxic Prompts in Large Language Models

投稿日: 2024年8月22日作成者: jarxiv

要約 ChatGPT や Gemini などの大規模言語モデル (LLM) は、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.SE | コメントを受け付けていません

FocusLLM: Scaling LLM’s Context by Parallel Decoding

投稿日: 2024年8月22日作成者: jarxiv

要約 LLM に長いコンテキストからの有用な情報を利用できるようにすることは、多 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM Era

投稿日: 2024年8月22日作成者: jarxiv

要約大規模言語モデル (LLM) の急速な進歩に伴い、検索エンジンやレコメンダ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

LLM Pruning and Distillation in Practice: The Minitron Approach

投稿日: 2024年8月22日作成者: jarxiv

要約プルーニングと蒸留を使用して、Llama 3.1 8B モデルと Mist … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Great Memory, Shallow Reasoning: Limits of $k$NN-LMs

投稿日: 2024年8月22日作成者: jarxiv

要約検索と次の単語の予測を統合する $K$-最近傍言語モデル ($k$NN-L … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

KOSMOS-2.5: A Multimodal Literate Model

投稿日: 2024年8月22日作成者: jarxiv

要約テキスト中心の画像の自動読み取りは、汎用人工知能 (AGI) の実現に向け … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework

投稿日: 2024年8月22日作成者: jarxiv

要約現在のビデオ生成モデルは、短くてリアルなクリップの作成には優れていますが、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.SE, TsingHua University | コメントを受け付けていません

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding

投稿日: 2024年8月22日作成者: jarxiv

要約大規模言語モデル (LLM) は、対話型チャットボット、ドキュメント分析、 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code

Large Language Models in Mental Health Care: a Scoping Review

Efficient Detection of Toxic Prompts in Large Language Models

FocusLLM: Scaling LLM’s Context by Parallel Decoding

Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM Era

LLM Pruning and Distillation in Practice: The Minitron Approach

Great Memory, Shallow Reasoning: Limits of $k$NN-LMs

KOSMOS-2.5: A Multimodal Literate Model

DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding

最近の投稿

最近のコメント

アーカイブ

カテゴリー