月別アーカイブ: 2024年6月

ReduceFormer: Attention with Tensor Reduction by Summation

投稿日: 2024年6月12日作成者: jarxiv

要約トランスフォーマーは視覚を含む多くのタスクで優れています。ただし、低レイ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Trim 3D Gaussian Splatting for Accurate Geometry Representation

投稿日: 2024年6月12日作成者: jarxiv

要約このペーパーでは、画像から正確な 3D ジオメトリを再構築するための Tr … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

SPIN: Spacecraft Imagery for Navigation

投稿日: 2024年6月12日作成者: jarxiv

要約宇宙運用のコストと複雑さのため、宇宙運用条件で取得されるデータは不足してい … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions

投稿日: 2024年6月12日作成者: jarxiv

要約画像記述データセットは、画像の理解、テキストから画像への生成、テキストから … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Understanding Visual Concepts Across Models

投稿日: 2024年6月12日作成者: jarxiv

要約安定拡散などの大規模なマルチモーダルモデルでは、たった 1 つの単語の埋 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

RudolfV: A Foundation Model by Pathologists for Pathologists

投稿日: 2024年6月12日作成者: jarxiv

要約人工知能は、臨床診断や生物医学研究に影響を与える組織病理学を変革し始めてい … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Instant 3D Human Avatar Generation using Image Diffusion Models

投稿日: 2024年6月12日作成者: jarxiv

要約画像やテキストプロンプトなどのさまざまな入力モダリティから、生成されたポ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Neural Gaffer: Relighting Any Object via Diffusion

投稿日: 2024年6月12日作成者: jarxiv

要約単一イメージのリライティングは、ジオメトリ、マテリアル、ライティングの間の … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

Hearing Anything Anywhere

投稿日: 2024年6月12日作成者: jarxiv

要約近年、3D コンピュータビジョンとコンピュータグラフィックスが大幅に進 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.SD, eess.AS, I.2.10 | コメントを受け付けていません

Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection

投稿日: 2024年6月12日作成者: jarxiv

要約深層学習テクノロジーの進歩により、毎日新しいモデルが誕生し、スケーラブルな … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

月別アーカイブ: 2024年6月

ReduceFormer: Attention with Tensor Reduction by Summation

Trim 3D Gaussian Splatting for Accurate Geometry Representation

SPIN: Spacecraft Imagery for Navigation

Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions

Understanding Visual Concepts Across Models

RudolfV: A Foundation Model by Pathologists for Pathologists

Instant 3D Human Avatar Generation using Image Diffusion Models

Neural Gaffer: Relighting Any Object via Diffusion

Hearing Anything Anywhere

Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection

最近の投稿

最近のコメント

アーカイブ

カテゴリー