月別アーカイブ: 2024年4月

On the Content Bias in Fréchet Video Distance

投稿日: 2024年4月19日作成者: jarxiv

要約ビデオ生成モデルを評価するための著名な指標である Fr\’ec … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.LG | コメントを受け付けていません

Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models

投稿日: 2024年4月19日作成者: jarxiv

要約混合一般化線形モデルの目的は、ラベルのない観測から複数の信号を学習すること … 続きを読む →

カテゴリー: cs.IT, cs.LG, math.IT, math.ST, stat.ML, stat.TH | コメントを受け付けていません

Can LLMs perform structured graph reasoning?

投稿日: 2024年4月19日作成者: jarxiv

要約事前トレーニングされた大規模言語モデル (LLM) は、特に非構造化タスク … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

End-To-End Training and Testing Gamification Framework to Learn Human Highway Driving

投稿日: 2024年4月19日作成者: jarxiv

要約現在の自律スタックは十分にモジュール化されており、手作りのフレームワークで … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

FlexMap Fusion: Georeferencing and Automated Conflation of HD Maps with OpenStreetMap

投稿日: 2024年4月19日作成者: jarxiv

要約現在の自動運転車用のソフトウェアスタックは、HD マップに依存して、十分 … 続きを読む →

カテゴリー: cs.RO | コメントを受け付けていません

Predicting Traffic Congestion at Urban Intersections Using Data-Driven Modeling

投稿日: 2024年4月19日作成者: jarxiv

要約都市部では交差点での交通渋滞が重大な問題となっており、通勤時間の増加、安全 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent

投稿日: 2024年4月19日作成者: jarxiv

要約マルチモーダル AI エージェントは、自然言語、視覚、音声入力を含むさまざ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

JointViT: Modeling Oxygen Saturation Levels with Joint Supervision on Long-Tailed OCTA

投稿日: 2024年4月19日作成者: jarxiv

要約血中の酸素飽和度 (SaO2) は、健康にとって、特に睡眠関連の呼吸障害に … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

投稿日: 2024年4月19日作成者: jarxiv

要約テキストアニメーションは表現媒体として機能し、言葉に動きを吹き込んで感情 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TaCOS: Task-Specific Camera Optimization with Simulation

投稿日: 2024年4月19日作成者: jarxiv

要約アプリケーションにおけるロボットのパフォーマンスは、感覚入力の質に大きく依 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

月別アーカイブ: 2024年4月

On the Content Bias in Fréchet Video Distance

Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models

Can LLMs perform structured graph reasoning?

End-To-End Training and Testing Gamification Framework to Learn Human Highway Driving

FlexMap Fusion: Georeferencing and Automated Conflation of HD Maps with OpenStreetMap

Predicting Traffic Congestion at Urban Intersections Using Data-Driven Modeling

Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent

JointViT: Modeling Oxygen Saturation Levels with Joint Supervision on Long-Tailed OCTA

Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

TaCOS: Task-Specific Camera Optimization with Simulation

最近の投稿

最近のコメント

アーカイブ

カテゴリー