「cs.AI」カテゴリーアーカイブ

Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization

投稿日: 2025年3月31日作成者: jarxiv

要約視覚言語モデル（VLM）の急速な進歩は、マルチモーダルの理解を変えましたが … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Using AI to Summarize US Presidential Campaign TV Advertisement Videos, 1952-2012

投稿日: 2025年3月31日作成者: jarxiv

要約このペーパーでは、デジタル形式で入手可能な米国大統領キャンペーンテレビ広告 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

KEVS: Enhancing Segmentation of Visceral Adipose Tissue in Pre-Cystectomy CT with Gaussian Kernel Density Estimation

投稿日: 2025年3月31日作成者: jarxiv

要約目的：膀胱切除患者における内臓脂肪組織（VAT）の分布は、術後合併症の発生 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

USC: Uncompromising Spatial Constraints for Safety-Oriented 3D Object Detectors in Autonomous Driving

投稿日: 2025年3月31日作成者: jarxiv

要約この作業では、自律運転コンテキストでの3Dオブジェクト検出器の安全指向のパ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models

投稿日: 2025年3月31日作成者: jarxiv

要約大規模な言語モデル（LLMS）の開発は、一般的なアシスタントとしてマルチモ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

VidTwin: Video VAE with Decoupled Structure and Dynamics

投稿日: 2025年3月31日作成者: jarxiv

要約ビデオ自動エンコーダー（ビデオAE）の最近の進歩により、ビデオ生成の品質と … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

投稿日: 2025年3月31日作成者: jarxiv

要約トレーニングビジョン言語モデル（VLM）には通常、大規模で高品質の画像テキ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

Evaluation of Machine-generated Biomedical Images via A Tally-based Similarity Measure

投稿日: 2025年3月31日作成者: jarxiv

要約超解像度、インペインティング、全画像の生成、対応のないスタイル移動、ネット … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness

投稿日: 2025年3月31日作成者: jarxiv

要約ほとんどの3Dオブジェクトジェネレーターは、美的品質に焦点を当てており、ア … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Outlier dimensions favor frequent tokens in language models

投稿日: 2025年3月31日作成者: jarxiv

要約最後の層の外れ値の寸法、つまり、大部分の入力に対して極端な活性化を示す寸法 … 続きを読む →

カテゴリー: cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization

Using AI to Summarize US Presidential Campaign TV Advertisement Videos, 1952-2012

KEVS: Enhancing Segmentation of Visceral Adipose Tissue in Pre-Cystectomy CT with Gaussian Kernel Density Estimation

USC: Uncompromising Spatial Constraints for Safety-Oriented 3D Object Detectors in Autonomous Driving

RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models

VidTwin: Video VAE with Decoupled Structure and Dynamics

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Evaluation of Machine-generated Biomedical Images via A Tally-based Similarity Measure

DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness

Outlier dimensions favor frequent tokens in language models

最近の投稿

最近のコメント

アーカイブ

カテゴリー