「cs.CL」カテゴリーアーカイブ

RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)

投稿日: 2025年4月17日作成者: jarxiv

要約ロボット工学の急速に進歩する分野では、デュアルアーム調整と複雑なオブジェク … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.RO | コメントを受け付けていません

Efficient Contrastive Decoding with Probabilistic Hallucination Detection – Mitigating Hallucinations in Large Vision Language Models –

投稿日: 2025年4月17日作成者: jarxiv

要約大規模なビジョン言語モデル（LVLMS）の最近の進歩にもかかわらず、これら … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

投稿日: 2025年4月17日作成者: jarxiv

要約ドキュメント解析は、契約、学術論文、請求書などの非構造化および半構造化され … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

Taming Data and Transformers for Audio Generation

投稿日: 2025年4月17日作成者: jarxiv

要約アンビエントサウンドジェネレーターのスケーラビリティは、データ不足、キャプ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

Automatic Item Generation for Personality Situational Judgment Tests with Large Language Models

投稿日: 2025年4月17日作成者: jarxiv

要約特に状況判断テスト（SJTS）を通じて、人格評価は、心理的研究、人材選択、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, I.2.1 | コメントを受け付けていません

Automated Python Translation

投稿日: 2025年4月17日作成者: jarxiv

要約 Pythonは、業界と教育で最も一般的に使用されるプログラミング言語の1つ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis

投稿日: 2025年4月17日作成者: jarxiv

要約大規模なビジョン言語モデルの最近の進歩は、デジタルデバイスの生産性を高める … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.HC | コメントを受け付けていません

CleanMAP: Distilling Multimodal LLMs for Confidence-Driven Crowdsourced HD Map Updates

投稿日: 2025年4月16日作成者: jarxiv

要約インテリジェント接続車両（ICV）と統合された車両ロードクラウドシステ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO, I.2.10 | コメントを受け付けていません

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving

投稿日: 2025年4月16日作成者: jarxiv

要約既存の学習ベースの自律運転（AD）システムは、高レベルの情報を理解し、まれ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis

投稿日: 2025年4月16日作成者: jarxiv

要約マルチモーダル融合はマルチモーダルセンチメント分析（MSA）で広く研究され … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)

Efficient Contrastive Decoding with Probabilistic Hallucination Detection – Mitigating Hallucinations in Large Vision Language Models –

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

Taming Data and Transformers for Audio Generation

Automatic Item Generation for Personality Situational Judgment Tests with Large Language Models

Automated Python Translation

UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis

CleanMAP: Distilling Multimodal LLMs for Confidence-Driven Crowdsourced HD Map Updates

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving

DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis

最近の投稿

最近のコメント

アーカイブ

カテゴリー