月別アーカイブ: 2025年1月

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

投稿日: 2025年1月9日作成者: jarxiv

要約思考連鎖 (CoT) 推論は、大規模言語モデル (LLM) の数学的推論に … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding

投稿日: 2025年1月9日作成者: jarxiv

要約世界とのインタラクションは、多感覚体験です。効果的な汎用インタラクションを … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

EpiCoder: Encompassing Diversity and Complexity in Code Generation

投稿日: 2025年1月9日作成者: jarxiv

要約コード LLM を最適化し、モデルの動作をユーザーの期待に合わせて調整し、 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

The Role of Machine Learning in Congenital Heart Disease Diagnosis: Datasets, Algorithms, and Insights

投稿日: 2025年1月9日作成者: jarxiv

要約先天性心疾患は、最も一般的な胎児の異常および先天異常の 1 つです。その … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling

投稿日: 2025年1月9日作成者: jarxiv

要約 3D ガウススプラッティング (3DGS) は、その優れたビジュアル品質 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Efficient Video-Based ALPR System Using YOLO and Visual Rhythm

投稿日: 2025年1月9日作成者: jarxiv

要約自動ナンバープレート認識 (ALPR) には、画像またはビデオキャプチ … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

From Pixels to Titles: Video Game Identification by Screenshots using Convolutional Neural Networks

投稿日: 2025年1月9日作成者: jarxiv

要約このペーパーでは、10 個の畳み込みニューラルネットワーク (CNN) … 続きを読む →

カテゴリー: cs.CV, cs.NE | コメントを受け付けていません

Energy-based Hopfield Boosting for Out-of-Distribution Detection

投稿日: 2025年1月9日作成者: jarxiv

要約機械学習モデルを現実世界に展開する場合、配布外 (OOD) の検出が重要で … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation

投稿日: 2025年1月9日作成者: jarxiv

要約最近の研究では、CLIP を利用して、注釈のない画像のみを利用できる、困難 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time

投稿日: 2025年1月9日作成者: jarxiv

要約自動的に予測された人間のフィードバックを生成モデルのトレーニングプロセス … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年1月

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding

EpiCoder: Encompassing Diversity and Complexity in Code Generation

The Role of Machine Learning in Congenital Heart Disease Diagnosis: Datasets, Algorithms, and Insights

Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling

Efficient Video-Based ALPR System Using YOLO and Visual Rhythm

From Pixels to Titles: Video Game Identification by Screenshots using Convolutional Neural Networks

Energy-based Hopfield Boosting for Out-of-Distribution Detection

ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation

Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time

最近の投稿

最近のコメント

アーカイブ

カテゴリー