投稿者「jarxiv」のアーカイブ

Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities

投稿日: 2025年4月3日作成者: jarxiv

要約参照式セグメンテーション（RES）は、記述言語式に一致するエンティティのマ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Scene-Centric Unsupervised Panoptic Segmentation

投稿日: 2025年4月3日作成者: jarxiv

要約監視されていないパノプティックセグメンテーションは、手動で注釈付きのデータ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

投稿日: 2025年4月3日作成者: jarxiv

要約スパースビューから3Dシーンを回復することは、その固有の不適切な問題のため … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GaussianLSS — Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting

投稿日: 2025年4月3日作成者: jarxiv

要約バードアイビュー（BEV）の認識は、複数のビュー画像を融合するための統一さ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis

投稿日: 2025年4月3日作成者: jarxiv

要約 3Dガウスの飛び散（3DG）および神経放射輝度（NERF）の最近の進歩は、 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Learning from Streaming Video with Orthogonal Gradients

投稿日: 2025年4月3日作成者: jarxiv

要約私たちは、自己教師の方法で、入力としての動画の連続的なストリームから学習す … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Sim-and-Real Co-Training: A Simple Recipe for Vision-Based Robotic Manipulation

投稿日: 2025年4月3日作成者: jarxiv

要約大規模な現実世界のロボットデータセットは、ジェネラリストのロボットモデルを … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Non-Determinism of ‘Deterministic’ LLM Settings

投稿日: 2025年4月3日作成者: jarxiv

要約 LLM（大規模な言語モデル）開業医は、一般に、出力が決定論的と予想される設 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SE | コメントを受け付けていません

Low-resource Machine Translation: what for? who for? An observational study on a dedicated Tetun language translation service

投稿日: 2025年4月3日作成者: jarxiv

要約低リソースの機械翻訳（MT）は、コミュニティのニーズとアプリケーションの課 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection

投稿日: 2025年4月3日作成者: jarxiv

要約通信詐欺の検出は、オーディオ信号を推論指向のテキスト分析と統合する高品質の … 続きを読む →

カテゴリー: cs.CL, cs.MM | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities

Scene-Centric Unsupervised Panoptic Segmentation

VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

GaussianLSS — Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting

Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis

Learning from Streaming Video with Orthogonal Gradients

Sim-and-Real Co-Training: A Simple Recipe for Vision-Based Robotic Manipulation

Non-Determinism of ‘Deterministic’ LLM Settings

Low-resource Machine Translation: what for? who for? An observational study on a dedicated Tetun language translation service

TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection

最近の投稿

最近のコメント

アーカイブ

カテゴリー