「cs.CV」カテゴリーアーカイブ

Detect an Object At Once without Fine-tuning

投稿日: 2024年11月5日作成者: jarxiv

要約これまで見たことのないオブジェクトの 1 枚または数枚の写真が提示されると … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Double Descent Meets Out-of-Distribution Detection: Theoretical Insights and Empirical Analysis on the role of model complexity

投稿日: 2024年11月5日作成者: jarxiv

要約過剰パラメータ化は一般化に利益をもたらすことが知られていますが、配信外 ( … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, math.ST, stat.ML, stat.TH | コメントを受け付けていません

Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models

投稿日: 2024年11月5日作成者: jarxiv

要約顔認識システムの精度は、収集された大量のデータとニューラルネットワーク … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Fast yet Safe: Early-Exiting with Risk Control

投稿日: 2024年11月5日作成者: jarxiv

要約機械学習モデルのスケーリングは、その性能を大幅に向上させる。しかし、このよ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist Collaboration

投稿日: 2024年11月5日作成者: jarxiv

要約ジェネラリスト基盤モデル (GFM) は、多様なタスクやモダリティを効果的 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

One VLM to Keep it Learning: Generation and Balancing for Data-free Continual Visual Question Answering

投稿日: 2024年11月5日作成者: jarxiv

要約視覚言語モデル (VLM) は、Web スケールのマルチモーダルデータセ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SIRA: Scalable Inter-frame Relation and Association for Radar Perception

投稿日: 2024年11月5日作成者: jarxiv

要約従来のレーダー特徴抽出は、低い空間分解能、ノイズ、マルチパス反射、ゴースト … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training

投稿日: 2024年11月5日作成者: jarxiv

要約画像からの新規ビュー合成の分野は、ニューラル・ラディアンス・フィールド（N … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3D Audio-Visual Segmentation

投稿日: 2024年11月5日作成者: jarxiv

要約シーン中の発音体を認識することは、具現化AIにおける長年の課題であり、ロボ … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

SPEAK: Speech-Driven Pose and Emotion-Adjustable Talking Head Generation

投稿日: 2024年11月5日作成者: jarxiv

要約しゃべる顔生成に関する先行研究の多くは、唇の動きと発話内容の同期に焦点を当 … 続きを読む →

カテゴリー: cs.CV, I.4.5 | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Detect an Object At Once without Fine-tuning

Double Descent Meets Out-of-Distribution Detection: Theoretical Insights and Empirical Analysis on the role of model complexity

Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models

Fast yet Safe: Early-Exiting with Risk Control

GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist Collaboration

One VLM to Keep it Learning: Generation and Balancing for Data-free Continual Visual Question Answering

SIRA: Scalable Inter-frame Relation and Association for Radar Perception

FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training

3D Audio-Visual Segmentation

SPEAK: Speech-Driven Pose and Emotion-Adjustable Talking Head Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー