月別アーカイブ: 2024年8月

Edit As You Wish: Video Caption Editing with Multi-grained User Control

投稿日: 2024年8月9日作成者: jarxiv

要約ユーザーのリクエストに応じて自然言語でビデオを自動的にナレーションすること … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Enhancing Journalism with AI: A Study of Contextualized Image Captioning for News Articles using LLMs and LMMs

投稿日: 2024年8月9日作成者: jarxiv

要約大規模言語モデル (LLM) と大規模マルチモーダルモデル (LMM) … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

HARMamba: Efficient and Lightweight Wearable Sensor Human Activity Recognition Based on Bidirectional Mamba

投稿日: 2024年8月9日作成者: jarxiv

要約ウェアラブルセンサーベースの人間活動認識 (HAR) は、活動認識にお … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Fast and Accurate Object Detection on Asymmetrical Receptive Field

投稿日: 2024年8月9日作成者: jarxiv

要約物体検出は幅広い業界で使用されています。たとえば、自動運転における物体検 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Self-supervised visual learning from interactions with objects

投稿日: 2024年8月9日作成者: jarxiv

要約自己教師あり学習 (SSL) は視覚表現学習に革命をもたらしましたが、人間 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

AggSS: An Aggregated Self-Supervised Approach for Class-Incremental Learning

投稿日: 2024年8月9日作成者: jarxiv

要約この論文では、自己教師あり学習、特に画像の回転がさまざまなクラス増分学習パ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework

投稿日: 2024年8月9日作成者: jarxiv

要約交通事故は世界のほぼすべての地域で非常に頻繁に発生しており、死亡事故の大多 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

MultiViPerFrOG: A Globally Optimized Multi-Viewpoint Perception Framework for Camera Motion and Tissue Deformation

投稿日: 2024年8月9日作成者: jarxiv

要約移動深度カメラによって捕捉された情報から変形可能な環境の 3D 形状を再構 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GenAD: Generalized Predictive Model for Autonomous Driving

投稿日: 2024年8月9日作成者: jarxiv

要約この論文では、自動運転分野における初の大規模ビデオ予測モデルを紹介します。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

P2LHAP:Wearable sensor-based human activity recognition, segmentation and forecast through Patch-to-Label Seq2Seq Transformer

投稿日: 2024年8月9日作成者: jarxiv

要約従来の深層学習手法では、センサーデータから人間の活動を同時にセグメント化 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年8月

Edit As You Wish: Video Caption Editing with Multi-grained User Control

Enhancing Journalism with AI: A Study of Contextualized Image Captioning for News Articles using LLMs and LMMs

HARMamba: Efficient and Lightweight Wearable Sensor Human Activity Recognition Based on Bidirectional Mamba

Fast and Accurate Object Detection on Asymmetrical Receptive Field

Self-supervised visual learning from interactions with objects

AggSS: An Aggregated Self-Supervised Approach for Class-Incremental Learning

Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework

MultiViPerFrOG: A Globally Optimized Multi-Viewpoint Perception Framework for Camera Motion and Tissue Deformation

GenAD: Generalized Predictive Model for Autonomous Driving

P2LHAP:Wearable sensor-based human activity recognition, segmentation and forecast through Patch-to-Label Seq2Seq Transformer

最近の投稿

最近のコメント

アーカイブ

カテゴリー