「cs.AI」カテゴリーアーカイブ

MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark

投稿日: 2024年2月8日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は最近大きな注目を集めており … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation

投稿日: 2024年2月8日作成者: jarxiv

要約このペーパーでは、ソースフリードメイン適応のための拡散モデル (DM-S … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Text or Image? What is More Important in Cross-Domain Generalization Capabilities of Hate Meme Detection Models?

投稿日: 2024年2月8日作成者: jarxiv

要約この論文は、マルチモーダルなヘイトミーム検出におけるクロスドメイン一般化と … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Detection and Pose Estimation of flat, Texture-less Industry Objects on HoloLens using synthetic Training

投稿日: 2024年2月8日作成者: jarxiv

要約現在の最先端の 6D 姿勢推定は、ますます多くの拡張現実アプリケーションに … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

投稿日: 2024年2月8日作成者: jarxiv

要約私たちは、加速されたセグメント何でもモデルの新しいファミリーである Eff … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation

投稿日: 2024年2月8日作成者: jarxiv

要約屋外の視覚と言語のナビゲーション (VLN) では、エージェントが自然言語 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Image captioning for Brazilian Portuguese using GRIT model

投稿日: 2024年2月8日作成者: jarxiv

要約この研究は、ブラジル系ポルトガル語の画像キャプションモデルの初期開発を示し … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Advancing Legal Reasoning: The Integration of AI to Navigate Complexities and Biases in Global Jurisprudence with Semi-Automated Arbitration Processes (SAAPs)

投稿日: 2024年2月8日作成者: jarxiv

要約この研究は、米国、英国、ルワンダ、スウェーデン、香港を含む 5 か国にわた … 続きを読む →

カテゴリー: cs.AI, cs.CY, cs.HC | コメントを受け付けていません

High-dimensional and Permutation Invariant Anomaly Detection

投稿日: 2024年2月8日作成者: jarxiv

要約新しい物理プロセスの異常検出方法は、高次元の確率密度を学習することが難しい … 続きを読む →

カテゴリー: cs.AI, cs.LG, hep-ex, hep-ph | コメントを受け付けていません

Can Generative Agents Predict Emotion?

投稿日: 2024年2月8日作成者: jarxiv

要約大規模言語モデル (LLM) は、人間に似た多くの能力を実証してきましたが … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark

Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation

Text or Image? What is More Important in Cross-Domain Generalization Capabilities of Hate Meme Detection Models?

Detection and Pose Estimation of flat, Texture-less Industry Objects on HoloLens using synthetic Training

EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation

Image captioning for Brazilian Portuguese using GRIT model

Advancing Legal Reasoning: The Integration of AI to Navigate Complexities and Biases in Global Jurisprudence with Semi-Automated Arbitration Processes (SAAPs)

High-dimensional and Permutation Invariant Anomaly Detection

Can Generative Agents Predict Emotion?

最近の投稿

最近のコメント

アーカイブ

カテゴリー