月別アーカイブ: 2024年1月

Instruct-Imagen: Image Generation with Multi-modal Instruction

投稿日: 2024年1月5日作成者: jarxiv

要約本稿では、異種の画像生成タスクに取り組み、未知のタスクに汎化するモデルであ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Lon-ea at SemEval-2023 Task 11: A Comparison of Activation Functions for Soft and Hard Label Prediction

投稿日: 2024年1月5日作成者: jarxiv

要約我々は、不一致を伴う学習タスクにおける、ソフト及びハードラベル予測のための … 続きを読む →

カテゴリー: cs.CL, cs.LG, I.2.7 | コメントを受け付けていません

A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity

投稿日: 2024年1月5日作成者: jarxiv

要約アライメントアルゴリズムは、現在、事前学習された言語モデルをユーザの嗜好に … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Revisiting Zero-Shot Abstractive Summarization in the Era of Large Language Models from the Perspective of Position Bias

投稿日: 2024年1月5日作成者: jarxiv

要約我々は、ラージ・ランゲージ・モデル（LLM）におけるゼロショット抽象的要約 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives

投稿日: 2024年1月5日作成者: jarxiv

要約ラージ・ランゲージ・モデル（LLM）のリフレクション能力は注目されている。 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

VideoChat: Chat-Centric Video Understanding

投稿日: 2024年1月5日作成者: jarxiv

要約本論文では、VideoChatと呼ばれる、エンドツーエンドのチャット中心の … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

投稿日: 2024年1月5日作成者: jarxiv

要約大規模な言語モデルは、分子のテキスト表現を処理することにより、分子科学にお … 続きを読む →

カテゴリー: cs.CL, cs.LG, q-bio.BM | コメントを受け付けていません

Text2MDT: Extracting Medical Decision Trees from Medical Texts

投稿日: 2024年1月5日作成者: jarxiv

要約医療意思決定支援システムを構築するためには、医療意思決定ツリー（MDT）と … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Understanding LLMs: A Comprehensive Overview from Training to Inference

投稿日: 2024年1月5日作成者: jarxiv

要約 ChatGPTの導入により、下流タスクに対応するための大規模言語モデル（L … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Theory of Hallucinations based on Equivariance

投稿日: 2024年1月5日作成者: jarxiv

要約本研究の目的は、幻覚を起こさない超大規模言語モデルを作成するための知識を得 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年1月

Instruct-Imagen: Image Generation with Multi-modal Instruction

Lon-ea at SemEval-2023 Task 11: A Comparison of Activation Functions for Soft and Hard Label Prediction

A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity

Revisiting Zero-Shot Abstractive Summarization in the Era of Large Language Models from the Perspective of Position Bias

Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives

VideoChat: Chat-Centric Video Understanding

GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

Text2MDT: Extracting Medical Decision Trees from Medical Texts

Understanding LLMs: A Comprehensive Overview from Training to Inference

Theory of Hallucinations based on Equivariance

最近の投稿

最近のコメント

アーカイブ

カテゴリー