「cs.AI」カテゴリーアーカイブ

Boosting Camera Motion Control for Video Diffusion Transformers

投稿日: 2024年10月15日作成者: jarxiv

要約拡散モデルの最近の進歩により、ビデオ生成の品質が大幅に向上しました。ただ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

投稿日: 2024年10月15日作成者: jarxiv

要約画像生成品質において拡散モデルに匹敵する、1024×1024 画 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Depth Any Video with Scalable Synthetic Data

投稿日: 2024年10月15日作成者: jarxiv

要約ビデオ深度の推定は、一貫性とスケーラブルなグラウンドトゥルースデータの … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

投稿日: 2024年10月15日作成者: jarxiv

要約ビデオ生成モデルの有効性は、トレーニングデータセットの品質に大きく依存し … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

投稿日: 2024年10月15日作成者: jarxiv

要約マルチモーダルビデオの理解と生成には、きめの細かい時間ダイナミクスを理解す … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

投稿日: 2024年10月15日作成者: jarxiv

要約ユーザーが安全対策を回避しモデル機能を悪用するプロンプトを設計する脱獄攻撃 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

SimpleStrat: Diversifying Language Model Generation with Stratification

投稿日: 2024年10月15日作成者: jarxiv

要約大規模言語モデル (LLM) から多様な応答を生成することは、多様性によっ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

DCNet: A Data-Driven Framework for DVL Calibration

投稿日: 2024年10月15日作成者: jarxiv

要約自律型水中ビークル (AUV) は、さまざまな用途に使用される水中ロボット … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Learning Representations of Instruments for Partial Identification of Treatment Effects

投稿日: 2024年10月15日作成者: jarxiv

要約観察データから治療効果を信頼性高く推定することは、医学などの多くの分野にお … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

投稿日: 2024年10月15日作成者: jarxiv

要約 Direct Preference Optimization (DPO) … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, stat.ML | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Boosting Camera Motion Control for Video Diffusion Transformers

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Depth Any Video with Scalable Synthetic Data

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

SimpleStrat: Diversifying Language Model Generation with Stratification

DCNet: A Data-Driven Framework for DVL Calibration

Learning Representations of Instruments for Partial Identification of Treatment Effects

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

最近の投稿

最近のコメント

アーカイブ

カテゴリー