投稿者「jarxiv」のアーカイブ

Automation of Quantum Dot Measurement Analysis via Explainable Machine Learning

投稿日: 2025年1月14日作成者: jarxiv

要約量子コンピューティング用の量子ドット (QD) デバイスの急速な開発により … 続きを読む →

カテゴリー: cond-mat.mes-hall, cs.CV, cs.LG | コメントを受け付けていません

A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor Fusion

投稿日: 2025年1月14日作成者: jarxiv

要約モデル圧縮は、組み込みデバイス上に大規模な Computer Vision … 続きを読む →

カテゴリー: 68T45, cs.CV, I.2.10 | コメントを受け付けていません

ScVLM: Enhancing Vision-Language Model for Safety-Critical Event Understanding

投稿日: 2025年1月14日作成者: jarxiv

要約衝突、タイヤ衝突、衝突寸前などの交通安全上重要なイベント (SCE) を正 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective

投稿日: 2025年1月14日作成者: jarxiv

要約 Transformer ベースのセマンティックセグメンテーションの最先端 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

3DGS-to-PC: Convert a 3D Gaussian Splatting Scene into a Dense Point Cloud or Mesh

投稿日: 2025年1月14日作成者: jarxiv

要約 3D ガウススプラッティング (3DGS) は、非常に詳細な 3D 再構 … 続きを読む →

カテゴリー: cs.CV, cs.GR, I.2.10 | コメントを受け付けていません

Agentic Copyright Watermarking against Adversarial Evidence Forgery with Purification-Agnostic Curriculum Proxy Learning

投稿日: 2025年1月14日作成者: jarxiv

要約さまざまなドメインで AI エージェントが急増するにつれて、AI モデルの … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

Aligning First, Then Fusing: A Novel Weakly Supervised Multimodal Violence Detection Method

投稿日: 2025年1月14日作成者: jarxiv

要約弱く監視された暴力検出とは、ビデオレベルのラベルのみを使用してビデオ内の暴 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision

投稿日: 2025年1月14日作成者: jarxiv

要約言語教師付き事前トレーニングは、画像から意味的に意味のある特徴を抽出するた … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Three-view Focal Length Recovery From Homographies

投稿日: 2025年1月14日作成者: jarxiv

要約この論文では、3 視点ホモグラフィーから焦点距離を回復するための新しいアプ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance

投稿日: 2025年1月14日作成者: jarxiv

要約マルチビュー設定内で詳細な 3D シーンを再構成する 3D ガウススプラ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Automation of Quantum Dot Measurement Analysis via Explainable Machine Learning

A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor Fusion

ScVLM: Enhancing Vision-Language Model for Safety-Critical Event Understanding

Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective

3DGS-to-PC: Convert a 3D Gaussian Splatting Scene into a Dense Point Cloud or Mesh

Agentic Copyright Watermarking against Adversarial Evidence Forgery with Purification-Agnostic Curriculum Proxy Learning

Aligning First, Then Fusing: A Novel Weakly Supervised Multimodal Violence Detection Method

RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision

Three-view Focal Length Recovery From Homographies

Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance

最近の投稿

最近のコメント

アーカイブ

カテゴリー