月別アーカイブ: 2024年8月

Iterative Graph Alignment

投稿日: 2024年8月30日作成者: jarxiv

要約多様な物語を圧縮することで、LLM は暗記を超え、一般化可能な因果関係を捉 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.MA | コメントを受け付けていません

Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

投稿日: 2024年8月30日作成者: jarxiv

要約大規模な言語モデルは、下流のタスクに特化するために教師あり微調整 (SFT … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction Retriever

投稿日: 2024年8月30日作成者: jarxiv

要約 ColBERT などのマルチベクトル高密度モデルは、情報検索に非常に効果的 … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL, cs.IR, I.2.7 | コメントを受け付けていません

GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM

投稿日: 2024年8月30日作成者: jarxiv

要約キー値 (KV) キャッシュは、大規模言語モデル (LLM) 推論の生成速 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Quantifying Geospatial in the Common Crawl Corpus

投稿日: 2024年8月30日作成者: jarxiv

要約大規模言語モデル (LLM) は、Common Crawl (CC) コー … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

A GREAT Architecture for Edge-Based Graph Problems Like TSP

投稿日: 2024年8月30日作成者: jarxiv

要約ここ数年、ルーティング問題などの組み合わせ最適化問題に取り組むために、多く … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

投稿日: 2024年8月30日作成者: jarxiv

要約言語モデルの最近の進歩は大幅な進歩を遂げています。 GPT-4o は新たな … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

FilFL: Client Filtering for Optimized Client Participation in Federated Learning

投稿日: 2024年8月30日作成者: jarxiv

要約新しい機械学習パラダイムであるフェデレーテッドラーニングを使用すると、ク … 続きを読む →

カテゴリー: cs.AI, cs.DC, cs.LG | コメントを受け付けていません

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

投稿日: 2024年8月30日作成者: jarxiv

要約強力な言語モデル (LM) からの高品質の合成データをトレーニングすること … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Assessing Large Language Models for Online Extremism Research: Identification, Explanation, and New Knowledge

投稿日: 2024年8月30日作成者: jarxiv

要約米国では暴力的過激主義が大幅に増加しており、オンラインでの過激派イデオロギ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

月別アーカイブ: 2024年8月

Iterative Graph Alignment

Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction Retriever

GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM

Quantifying Geospatial in the Common Crawl Corpus

A GREAT Architecture for Edge-Based Graph Problems Like TSP

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

FilFL: Client Filtering for Optimized Client Participation in Federated Learning

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

Assessing Large Language Models for Online Extremism Research: Identification, Explanation, and New Knowledge

最近の投稿

最近のコメント

アーカイブ

カテゴリー