「cs.IR」カテゴリーアーカイブ

Semantically Enhanced Hard Negatives for Cross-modal Information Retrieval

投稿日: 2023年2月14日作成者: jarxiv

要約 Visual Semantic Embedding (VSE) は、画像の … 続きを読む →

カテゴリー: cs.CV, cs.IR | コメントを受け付けていません

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning

投稿日: 2023年2月10日作成者: jarxiv

要約ビジョンエンコーダー (Flamingo など) を使用して事前トレーニ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR, cs.LG | コメントを受け付けていません

LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval

投稿日: 2023年2月7日作成者: jarxiv

要約画像テキスト検索（Image-text retrieval: ITR）は、 … 続きを読む →

カテゴリー: cs.CV, cs.IR | コメントを受け付けていません

Open Problems in Applied Deep Learning

投稿日: 2023年1月27日作成者: jarxiv

要約この作業は、機械学習メカニズムをバイレベル最適化問題として定式化します。 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, cs.IR, cs.LG | コメントを受け付けていません

Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A Reproducibility Study

投稿日: 2023年1月13日作成者: jarxiv

要約クロスモーダル検索 (CMR) へのほとんどのアプローチは、オブジェクト中 … 続きを読む →

カテゴリー: cs.CV, cs.IR, cs.LG, cs.MM | コメントを受け付けていません

Online Backfilling with No Regret for Large-Scale Image Retrieval

投稿日: 2023年1月11日作成者: jarxiv

要約バックフィルは、画像検索システムでアップグレードされたモデルからすべてのギ … 続きを読む →

カテゴリー: cs.CV, cs.IR, cs.LG | コメントを受け付けていません

Retrieving Users’ Opinions on Social Media with Multimodal Aspect-Based Sentiment Analysis

投稿日: 2023年1月10日作成者: jarxiv

要約人々は自分の意見や経験をソーシャルメディアに投稿し、エンドユーザーの感情を … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR, cs.LG | コメントを受け付けていません

Saliency-Aware Spatio-Temporal Artifact Detection for Compressed Video Quality Assessment

投稿日: 2023年1月4日作成者: jarxiv

要約圧縮された映像には，Perceivable Encoding Artifa … 続きを読む →

カテゴリー: cs.CV, cs.IR, eess.IV | コメントを受け付けていません

DCC: A Cascade based Approach to Detect Communities in Social Networks

投稿日: 2022年12月22日作成者: jarxiv

要約ソーシャルネットワークのコミュニティ検出は、ネットワークに固有の最も類似 … 続きを読む →

カテゴリー: cs.CV, cs.IR, cs.LG, cs.SI, J.4; G.4; I.6 | コメントを受け付けていません

Reasoning with Language Model Prompting: A Survey

投稿日: 2022年12月20日作成者: jarxiv

要約複雑な問題解決に不可欠な能力である推論は、医療診断、交渉など、さまざまな実 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR, cs.LG | コメントを受け付けていません

「cs.IR」カテゴリーアーカイブ

Semantically Enhanced Hard Negatives for Cross-modal Information Retrieval

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning

LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval

Open Problems in Applied Deep Learning

Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A Reproducibility Study

Online Backfilling with No Regret for Large-Scale Image Retrieval

Retrieving Users’ Opinions on Social Media with Multimodal Aspect-Based Sentiment Analysis

Saliency-Aware Spatio-Temporal Artifact Detection for Compressed Video Quality Assessment

DCC: A Cascade based Approach to Detect Communities in Social Networks

Reasoning with Language Model Prompting: A Survey

最近の投稿

最近のコメント

アーカイブ

カテゴリー