「cs.CL」カテゴリーアーカイブ

G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o

投稿日: 2024年12月20日作成者: jarxiv

要約視覚的なキャプションの評価指標は重要ですが、十分に検討されていません。 B … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Movie2Story: A framework for understanding videos and telling stories in the form of novel text

投稿日: 2024年12月20日作成者: jarxiv

要約マルチモーダルビデオからテキストへのモデルは、主にビデオコンテンツの簡単な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers

投稿日: 2024年12月20日作成者: jarxiv

要約現在、ディープニューラルネットワークはさまざまな複雑なタスクを処理でき … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

投稿日: 2024年12月20日作成者: jarxiv

要約テキストからビデオへのモデルは、高品質のテキストとビデオのペアの最適化を通 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM | コメントを受け付けていません

LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation

投稿日: 2024年12月20日作成者: jarxiv

要約私たちは、マルチモーダル生成機能を備えた事前トレーニング済みのテキスト専用 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Towards an optimised evaluation of teachers’ discourse: The case of engaging messages

投稿日: 2024年12月20日作成者: jarxiv

要約教師のスキルを評価することは、教育の質と生徒の成果を向上させるために非常に … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting

投稿日: 2024年12月20日作成者: jarxiv

要約 LLM はさまざまなタスクで優れたパフォーマンスを発揮するため広く使用され … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback

投稿日: 2024年12月19日作成者: jarxiv

要約自然言語記述からの高密度報酬の自動合成は、強化学習 (RL) における有望 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.RO | コメントを受け付けていません

EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation

投稿日: 2024年12月19日作成者: jarxiv

要約質問応答 (QA) における検索拡張生成 (RAG) の有効性と効率の両方 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs

投稿日: 2024年12月19日作成者: jarxiv

要約あいまいさの解決は効果的なコミュニケーションの鍵です。人間は会話のグラウ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o

Movie2Story: A framework for understanding videos and telling stories in the form of novel text

Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization Layers

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation

Towards an optimised evaluation of teachers’ discourse: The case of engaging messages

RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting

Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback

EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation

RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs

最近の投稿

最近のコメント

アーカイブ

カテゴリー