月別アーカイブ: 2024年7月

For a semiotic AI: Bridging computer vision and visual semiotics for computational observation of large scale facial image archives

投稿日: 2024年7月4日作成者: jarxiv

要約ソーシャルネットワークは、人間の顔や体のイメージの認知的、感情的、実用的な … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars

投稿日: 2024年7月4日作成者: jarxiv

要約本論文の目的は、Spoken2Sign翻訳と呼ばれる、音声言語を手話言語に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VCHAR:Variance-Driven Complex Human Activity Recognition framework with Generative Representation

投稿日: 2024年7月4日作成者: jarxiv

要約複雑な人間の活動認識（CHAR）は、ユビキタスコンピューティング、特にスマ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, eess.SP | コメントを受け付けていません

Biomechanics-informed Non-rigid Medical Image Registration and its Inverse Material Property Estimation with Linear and Nonlinear Elasticity

投稿日: 2024年7月4日作成者: jarxiv

要約本論文では、物理情報ニューラルネットワーク（PINN）を用いて、生体力学的 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

投稿日: 2024年7月4日作成者: jarxiv

要約大規模言語モデル（LLM）の開発は、大規模マルチモーダルモデル（LMM）の … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Improved Noise Schedule for Diffusion Training

投稿日: 2024年7月4日作成者: jarxiv

要約拡散モデルは、視覚信号を生成するための事実上の選択肢として登場した。しかし … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery

投稿日: 2024年7月4日作成者: jarxiv

要約ラベル付けされていないデータセットにおいて、継続的に新しい概念を発見するこ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents

投稿日: 2024年7月4日作成者: jarxiv

要約拡散モデル（DM）は生成学習に革命をもたらした。DMは拡散過程を利用して、 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Smart City Surveillance Unveiling Indian Person Attributes in Real Time

投稿日: 2024年7月4日作成者: jarxiv

要約このプロジェクトは、リアルタイムで人の属性を識別・分析できるインドの都市向 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization

投稿日: 2024年7月4日作成者: jarxiv

要約デジタル病理学において、ディープラーニングに基づく画像セグメンテーションの … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

月別アーカイブ: 2024年7月

For a semiotic AI: Bridging computer vision and visual semiotics for computational observation of large scale facial image archives

A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars

VCHAR:Variance-Driven Complex Human Activity Recognition framework with Generative Representation

Biomechanics-informed Non-rigid Medical Image Registration and its Inverse Material Property Estimation with Linear and Nonlinear Elasticity

DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

Improved Noise Schedule for Diffusion Training

Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery

DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents

Smart City Surveillance Unveiling Indian Person Attributes in Real Time

HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization

最近の投稿

最近のコメント

アーカイブ

カテゴリー