月別アーカイブ: 2025年5月

The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

投稿日: 2025年5月7日作成者: jarxiv

要約大規模なデータセットで訓練されたテキストツービデオ（T2V）生成モデルの進 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Personalization of Large Language Models: A Survey

投稿日: 2025年5月7日作成者: jarxiv

要約大規模な言語モデル（LLMS）のパーソナライズは、幅広いアプリケーションで … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement

投稿日: 2025年5月7日作成者: jarxiv

要約検索された生成（RAG）は、外部ソースから関連するドキュメントを取得し、そ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Applications of Artificial Intelligence for Cross-language Intelligibility Assessment of Dysarthric Speech

投稿日: 2025年5月7日作成者: jarxiv

要約目的：音声明瞭度は、ダイサルリアの評価と管理における重要な結果ですが、ほと … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications

投稿日: 2025年5月7日作成者: jarxiv

要約 VLMベースのWebエージェントアプリケーション用のオープンソーススイート … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.MA | コメントを受け付けていません

Survey of Abstract Meaning Representation: Then, Now, Future

投稿日: 2025年5月7日作成者: jarxiv

要約このホワイトペーパーでは、グラフベースの構造を介して文の意味をキャプチャす … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

投稿日: 2025年5月7日作成者: jarxiv

要約ロータリー位置の埋め込み（ロープ）を改善することにより、言語モデル（LMS … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation

投稿日: 2025年5月7日作成者: jarxiv

要約現代の音声分離技術は、長い混合オーディオ波形を巧みに処理しますが、騒々しい … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Ψ-Arena: Interactive Assessment and Optimization of LLM-based Psychological Counselors with Tripartite Feedback

投稿日: 2025年5月7日作成者: jarxiv

要約大規模な言語モデル（LLM）は、スケーラブルなメンタルヘルスサポートを提供 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Tailored Design of Audio-Visual Speech Recognition Models using Branchformers

投稿日: 2025年5月7日作成者: jarxiv

要約視聴覚音声認識（AVSR）の最近の進歩により、この分野では前例のない成果が … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年5月

The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

Personalization of Large Language Models: A Survey

Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement

Applications of Artificial Intelligence for Cross-language Intelligibility Assessment of Dysarthric Speech

LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications

Survey of Abstract Meaning Representation: Then, Now, Future

Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation

Ψ-Arena: Interactive Assessment and Optimization of LLM-based Psychological Counselors with Tripartite Feedback

Tailored Design of Audio-Visual Speech Recognition Models using Branchformers

最近の投稿

最近のコメント

アーカイブ

カテゴリー