月別アーカイブ: 2024年4月

RadRotator: 3D Rotation of Radiographs with Diffusion Models

投稿日: 2024年4月22日作成者: jarxiv

要約 2 次元 (2D) イメージを 3 次元 (3D) ボリュームに変換するこ … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Towards Robust Ferrous Scrap Material Classification with Deep Learning and Conformal Prediction

投稿日: 2024年4月22日作成者: jarxiv

要約鉄鋼生産分野では、鉄スクラップのリサイクルはエネルギー消費と温室効果ガス排 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

投稿日: 2024年4月22日作成者: jarxiv

要約根拠があり、きめ細かい視覚認識能力を備えたマルチモーダル大規模言語モデル … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Optimizing Calibration by Gaining Aware of Prediction Correctness

投稿日: 2024年4月22日作成者: jarxiv

要約モデルのキャリブレーションは、信頼性と予測の正確さを一致させることを目的と … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

BANF: Band-limited Neural Fields for Levels of Detail Reconstruction

投稿日: 2024年4月22日作成者: jarxiv

要約主にその暗黙的な性質により、離散信号処理からのフーリエ解析がこれらの表現に … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation

投稿日: 2024年4月22日作成者: jarxiv

要約現実的なオブジェクトのインタラクションは、没入型の仮想体験を作成するために … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

LaPA: Latent Prompt Assist Model For Medical Visual Question Answering

投稿日: 2024年4月22日作成者: jarxiv

要約 Medical Visual Question Answering (Me … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Analysis of Classifier-Free Guidance Weight Schedulers

投稿日: 2024年4月22日作成者: jarxiv

要約 Classifier-Free Guide (CFG) は、テキストから画 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Data Alignment for Zero-Shot Concept Generation in Dermatology AI

投稿日: 2024年4月22日作成者: jarxiv

要約皮膚科における AI は急速に進化していますが、信頼できる分類器をトレーニ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Unified Scene Representation and Reconstruction for 3D Large Language Models

投稿日: 2024年4月22日作成者: jarxiv

要約大規模言語モデル (LLM) が 3D 環境と対話できるようにすることは困 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年4月

RadRotator: 3D Rotation of Radiographs with Diffusion Models

Towards Robust Ferrous Scrap Material Classification with Deep Learning and Conformal Prediction

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Optimizing Calibration by Gaining Aware of Prediction Correctness

BANF: Band-limited Neural Fields for Levels of Detail Reconstruction

PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation

LaPA: Latent Prompt Assist Model For Medical Visual Question Answering

Analysis of Classifier-Free Guidance Weight Schedulers

Data Alignment for Zero-Shot Concept Generation in Dermatology AI

Unified Scene Representation and Reconstruction for 3D Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー