「cs.CL」カテゴリーアーカイブ

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

投稿日: 2025年4月11日作成者: jarxiv

要約最近、Deepseek R1は、補強学習（RL）が、シンプルで効果的なデザ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models

投稿日: 2025年4月11日作成者: jarxiv

要約検索拡張生成（RAG）は最近の過去に注目を集めており、最近の大規模な言語モ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

VideoComp: Advancing Fine-Grained Compositional and Temporal Alignment in Video-Text Models

投稿日: 2025年4月11日作成者: jarxiv

要約 VideoCompは、微調整された時間的アライメントでビジョン言語モデル（ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR | コメントを受け付けていません

CollEX — A Multimodal Agentic RAG System Enabling Interactive Exploration of Scientific Collections

投稿日: 2025年4月11日作成者: jarxiv

要約このペーパーでは、広範な科学コレクションのインタラクティブな探索を強化する … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.IR | コメントを受け付けていません

A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions

投稿日: 2025年4月11日作成者: jarxiv

要約継続的なトレーニングのための高品質の推論データの合成は、大規模な言語モデル … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

On the Temporal Question-Answering Capabilities of Large Language Models Over Anonymized Data

投稿日: 2025年4月11日作成者: jarxiv

要約トレーニング中に存在しないデータに対する一時的な推論タスクにおける大規模な … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections

投稿日: 2025年4月11日作成者: jarxiv

要約基礎モデルの時代では、Clipは、テキストと視覚モダリティを共通の埋め込み … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Unveiling the Impact of Multimodal Features on Chinese Spelling Correction: From Analysis to Design

投稿日: 2025年4月11日作成者: jarxiv

要約中国の綴り補正（CSC）タスクは、文のスペルエラーの検出と修正に焦点を当て … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT

投稿日: 2025年4月11日作成者: jarxiv

要約スピーチの自己監視学習（SSL）におけるデータ駆動型ユニットの発見は、音声 … 続きを読む →

カテゴリー: cs.CL, eess.AS | コメントを受け付けていません

Synthetic Fluency: Hallucinations, Confabulations, and the Creation of Irish Words in LLM-Generated Translations

投稿日: 2025年4月11日作成者: jarxiv

要約この研究では、アイルランドへの大規模な言語モデル（LLM）翻訳の幻覚を調べ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models

VideoComp: Advancing Fine-Grained Compositional and Temporal Alignment in Video-Text Models

CollEX — A Multimodal Agentic RAG System Enabling Interactive Exploration of Scientific Collections

A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions

On the Temporal Question-Answering Capabilities of Large Language Models Over Anonymized Data

CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections

Unveiling the Impact of Multimodal Features on Chinese Spelling Correction: From Analysis to Design

SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT

Synthetic Fluency: Hallucinations, Confabulations, and the Creation of Irish Words in LLM-Generated Translations

最近の投稿

最近のコメント

アーカイブ

カテゴリー