「cs.CV」カテゴリーアーカイブ

MouseSIS: A Frames-and-Events Dataset for Space-Time Instance Segmentation of Mice

投稿日: 2024年9月6日作成者: jarxiv

要約注釈付きの大規模なデータセットによって可能になったビデオ内のオブジェクトの … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding

投稿日: 2024年9月6日作成者: jarxiv

要約大規模言語モデル (LLM) で 3D 物理世界を理解できるようにすること … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

ChartMoE: Mixture of Expert Connector for Advanced Chart Understanding

投稿日: 2024年9月6日作成者: jarxiv

要約グラフの自動理解は、内容の理解と文書の解析にとって非常に重要です。マルチ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

KAN See In the Dark

投稿日: 2024年9月6日作成者: jarxiv

要約既存の低照度画像強調方法は、不均一な照明とノイズの影響により、通常画像と低 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

TG-LMM: Enhancing Medical Image Segmentation Accuracy through Text-Guided Large Multi-Modal Model

投稿日: 2024年9月6日作成者: jarxiv

要約我々は、臓器のテキストによる説明を活用して医療画像のセグメンテーション精度 … 続きを読む →

カテゴリー: 68T07, cs.CV, physics.med-ph | コメントを受け付けていません

mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding

投稿日: 2024年9月6日作成者: jarxiv

要約マルチモデル大規模言語モデル (MLLM) は、サポートされているドキュメ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Weight Conditioning for Smooth Optimization of Neural Networks

投稿日: 2024年9月6日作成者: jarxiv

要約この記事では、ニューラルネットワークの重み行列の新しい正規化手法 (重み … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images

投稿日: 2024年9月6日作成者: jarxiv

要約多様な地理的環境、複雑な景観、高密度の集落のため、リモートセンシング画像を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Key-Driven Framework for Identity-Preserving Face Anonymization

投稿日: 2024年9月6日作成者: jarxiv

要約仮想顔はメタバースの重要なコンテンツです。最近、プライバシー保護のために … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

Enhancing Facial Expression Recognition through Dual-Direction Attention Mixed Feature Networks: Application to 7th ABAW Challenge

投稿日: 2024年9月6日作成者: jarxiv

要約私たちは、ECCV 2024 で第 7 回 ABAW チャレンジへの貢献を … 続きを読む →

カテゴリー: cs.CV, I.4 | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

MouseSIS: A Frames-and-Events Dataset for Space-Time Instance Segmentation of Mice

More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding

ChartMoE: Mixture of Expert Connector for Advanced Chart Understanding

KAN See In the Dark

TG-LMM: Enhancing Medical Image Segmentation Accuracy through Text-Guided Large Multi-Modal Model

mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding

Weight Conditioning for Smooth Optimization of Neural Networks

UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images

A Key-Driven Framework for Identity-Preserving Face Anonymization

Enhancing Facial Expression Recognition through Dual-Direction Attention Mixed Feature Networks: Application to 7th ABAW Challenge

最近の投稿

最近のコメント

アーカイブ

カテゴリー