三菱UFJフィナンシャル・グループ(以下MUFG)の戦略子会社であるJapan Digital Design(以下JDD)でMUFG AI Studio(以下M-AIS)に所属する田邊です。





LLM(Large Language Model、大規模言語モデル)は、膨大な量のテキストデータで学習することで、人間のような言語処理を行うことを目標に作成されたモデルです。
LLMが出るまでは、各言語タスク(要約、翻訳、解説、など)に利用する言語モデル(Language Model)を作成するためには、大規模なテキストデータで事前学習を行い、その後特定のタスクに適用するために、更に数千から数万例のタスク固有のfine-tuneデータセットが必要でした。




  • 赤文字:役割の指定

    • システムpromptと呼ばれ、LLMが回答時に使用するペルソナを指定します。こうすることでLLMがその役割になりきって回答してくれます。

  • 青文字:事例(回答例)

    • 事例を与えることでどのように回答したらよいかをLLMに理解してもらいます。

  • 黒文字:指示

    • 解いてほしい課題や質問について記載します。記載方法の工夫やポイントは知見がたくさんあり、こちら(LLMのプロンプト技術まとめ #ChatGPT - Qiita)などとても参考になります。


・Few-shot : いくつかの事例を与える
・One-shot : 一つの事例を与える
・Zero-shot : 事例を与えない(指示だけを出す)

引用:プロンプトとは何か?プロンプトの本質と制約とそれを乗り越えるための工夫|Yuichiro.ito@Finatext(フィナテキスト) (note.com)





利用するLLMは論文と異なりLlama 2(Llama-2-7b-chat-hf、Llama-2-13b-chat-hf)を利用し、まずはZero-Shotのみで実施しています。
以前の投稿で利用していたMicrosoft News Dataset(MIND)を利用しており、かつ実装コードサンプルデータも提供されており、とても感謝です!




今回はGoogle colab T4 GPUの環境で実行しています。利用したパッケージとバージョンは以下です。

!pip install transformers==4.30.0 torch==2.1.0 huggingface-hub==0.16.4 accelerate==0.22.0 torch bitsandbytes==0.41.0
import numpy as np 
import pandas as pd 
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
from accelerate import Accelerator
import accelerate, huggingface_hub, transformers
import huggingface_hub

論文ではChatGPTを利用していますが、今回はLlama 2を利用しています。
量子化してモデルを読み込む際は、こちらの資料(Getting started with Llama 2 - AI at MetaQuantize 🤗 Transformers models (huggingface.co))を参考にしながら実施しました。

model_name = "meta-llama/Llama-2-7b-chat-hf"
bnb_config = BitsAndBytesConfig(
model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto", quantization_config=bnb_config)





Input: Here is My reading history:
US Presidential Election: Democratic Party Television Debate Begins, 
Trump's 2016 team sounds alarm as Democrats make gains, 
Impeachment inquiry: White House releases rough transcript of Trump's earlier call with Zelensky, 
The Real Reason McDonald's Keeps the Filet-O-Fish on Their Menu,


Based on my reading history, Please rank these 5 candidate news by measuring the possibilities that I would like to read next most, according to my reading history. Please think step by step.
Please show me your ranking results in the order of ranking. 
Split your output with line break. You MUST rank the given candidate news. You can not generate news that are not in the given candidate news:


(A) Executive Privilege Should Have No Power When It Comes to an Impeachment 
(B) Home values in these cities have grown the most since the last recession 
(C) Trump campaign launching black outreach effort for 2020 
(D) 11 Regional Thanksgiving Recipes That Food Bloggers Swear By 
(E) Innovation in the Fast Food Realm: McDonald's Menu Evolution


 The answer index is[/INST]

最終的にLLMに与えるpromptはLlama 2の形式に沿って作成します。

tokenizerの読み込みの際には、こちら(Generation with LLMs (huggingface.co))を参考にし、
prompt作成に当たってはこちら(Getting started with Llama 2 - AI at MetaLlama 2 is here - get it on Hugging Face2311.15131.pdf (arxiv.org))を参考にしています。

# tokenizer設定
tokenizer = AutoTokenizer.from_pretrained(model_name, padding_side="left")
tokenizer.pad_token = tokenizer.eos_token
model.resize_token_embeddings(model.config.vocab_size + 1)

# prompt作成
B_INST, E_INST = "[INST]", "[/INST]"
B_SYS, E_SYS = "<<SYS>>\n", "\n<</SYS>>"
DEFAULT_SYSTEM_PROMPT = "You are a news recommender system now.\
If a question does not make any sense,\
explain why instead of answering something not correct."

text = request["request"]["prompt"] # requestは参考論文のコードを利用して作成しています。
prompt = "{bos_token}{b_inst} {system}{prompt} {e_inst} ".format(


You are a helpful, respectful and honest news recommender system now. Always answer as helpfully as possible, while being safe. Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature. If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don’t know the answer to a question, please don’t share false information.

Input: Here is My reading history:
US Presidential Election: Democratic Party Television Debate Begins, 
Trump's 2016 team sounds alarm as Democrats make gains, 
Impeachment inquiry: White House releases rough transcript of Trump's earlier call with Zelensky, 
The Real Reason McDonald's Keeps the Filet-O-Fish on Their Menu,
Based on my reading history, Please rank these 5 candidate news by measuring the possibilities that I would like to read next most, according to my reading history. Please think step by step.
Please show me your ranking results in the order of ranking. 
Split your output with line break. You MUST rank the given candidate news. You can not generate news that are not in the given candidate news:
(A) Executive Privilege Should Have No Power When It Comes to an Impeachment 
(B) Home values in these cities have grown the most since the last recession 
(C) Trump campaign launching black outreach effort for 2020 
(D) 11 Regional Thanksgiving Recipes That Food Bloggers Swear By 
(E) Innovation in the Fast Food Realm: McDonald's Menu Evolution

 The answer index is[/INST]


(A) 弾劾に関するときは行政特権は権限を持たないべき
(B) これらの都市の住宅価値は最後の不況以来最も成長しています
(C) トランプ陣営が2020年のために黒人へのアウトリーチを開始
(D) フードブロガーたちが誓う11の地域ごとの感謝祭のレシピ
(E) ファーストフードの領域での革新:マクドナルドのメニューの進化



作成にあたっては、こちら(Generation (huggingface.co)Generation with LLMs (huggingface.co))の資料を参考にしています。
今回各種パラメータは変更していませんが、こちら(How to generate text: using different decoding methods for language generation with Transformers (huggingface.co))の資料がとても分かりやすかったです!

def generate_result(prompt):
    with torch.no_grad():
        model_inputs = tokenizer(prompt, padding=True, add_special_tokens=True, return_tensors="pt").to(model.device)
        input_length = model_inputs.input_ids.shape[1]
        generated_ids = model.generate(
            bos_token_id =tokenizer.bos_token_id ,
    output = tokenizer.batch_decode(generated_ids[:, input_length:], skip_special_tokens=True)[0]
    return output

output = generate_result(prompt)



1. (B) Home values in these cities have grown the most since the last recession
Based on your reading history, it seems that you are interested in articles related to real estate and economic growth. The article about home values growing the most since the last recession is likely to be of interest to you as it provides valuable information on the current state of the housing market and the potential for investment or homeownership.
2. (D) 11 Regional Thanksgiving Recipes That Food Bloggers Swear By
Your reading history suggests that you are interested in food-related articles, particularly those that offer unique and delicious recipes. The article about regional Thanksgiving recipes is likely to appeal to your interest in trying new and exciting dishes for the holiday season.
3. (A) Executive Privilege Should Have No Power When It Comes to an Impeachment
Your reading history includes articles related to politics and current events, and it's possible that you are interested in in-depth analysis of political issues. The article about executive privilege and its potential impact on the impeachment process is likely to be of interest to you as it provides insight into the legal and political aspects of the impeachment inquiry.
4. (C) Trump campaign launching black outreach effort for 2020
Your reading history includes articles related to politics and current events, and it's possible that you are interested in news related to the 2020 presidential election. The article about the Trump campaign's black outreach effort is likely to be of interest to you as it provides insight into the campaign's strategies and tactics for the upcoming election.
5. (E) Innovation in the Fast Food Realm: McDonald's Menu Evolution
Your reading history includes articles related to food and fast food, and it's possible that you are interested in news related to the fast food industry. The article about McDonald's menu evolution is likely to be of interest to you as it provides insight into the latest trends and innovations in the fast food industry.

(B) 前回の不況以来、これらの都市の住宅価値が最も成長しています
(D) フードブロガーが誓う11の地域別感謝祭のレシピ
(A) 弾劾に関わる際には行政特権は権力を持つべきではない
(C) トランプキャンペーンが2020年に向けて黒人へのアウトリーチ活動を開始
(E) ファーストフード領域の革新:マクドナルドのメニュー進化



こちらの論文では、「LLMはUser Historyの時系列順序を考慮できないため、理解させたい場合は直近見たものが何かを明確に伝えることが必要」と書かれており、試してみたいと思います。


Based on my reading history, Please rank these 5 candidate news by measuring the possibilities that I would like to read next most, according to my reading history. Please think step by step.
Please show me your ranking results in the order of ranking. 
Note that my most recently read news is the last article in history.
Split your output with line break. You MUST rank the given candidate news. You can not generate news that are not in the given candidate news:


1. (B) Home values in these cities have grown the most since the last recession - Based on your reading history, it seems that you are interested in economic and real estate-related news. This article provides information on the cities with the highest home value growth since the last recession, which may be of interest to you.
2. (D) 11 Regional Thanksgiving Recipes That Food Bloggers Swear By - As someone who has read articles related to food and recipes, this article provides a list of regional Thanksgiving recipes that are highly recommended by food bloggers. This may be of interest to you if you are looking for new and unique recipe ideas for the upcoming holiday season.
3. (A) Executive Privilege Should Have No Power When It Comes to an Impeachment - As someone who has read articles related to politics and current events, this article provides an opinion piece on the topic of executive privilege and its potential impact on the ongoing impeachment inquiry. This may be of interest to you if you are following the latest developments in the political landscape.
4. (E) Innovation in the Fast Food Realm: McDonald's Menu Evolution - As someone who has read articles related to food and fast food, this article provides an overview of the evolution of McDonald's menu and the latest innovations in the fast food industry. This may be of interest to you if you are interested in staying up-to-date on the latest food trends and menu offerings from popular fast food chains.
5. (C) Trump campaign launching black outreach effort for 2020 - As someone who has read articles related to politics and current events, this article provides information on the Trump campaign's efforts to reach out to the black community in the upcoming 2020 election. This may be of interest to you if you are following the latest developments in the political landscape and the ongoing election campaigns.

(B) 前回の不況以来、これらの都市の住宅価値が最も成長しています - あなたの読書履歴に基づくと、経済および不動産に関連するニュースに興味を持っているようです。この記事は、前回の不況以来、住宅価値が最も成長している都市に関する情報を提供しており、あなたにとって興味深いかもしれません。
(D) フードブロガーが誓う11の地域別感謝祭のレシピ - 食べ物やレシピに関連する記事を読んでいる方として、この記事はフードブロガーが強くおすすめする地域別の感謝祭のレシピのリストを提供しています。これは、年末のホリデーシーズンに新しいユニークなレシピのアイデアを探している場合に、あなたの興味を引くかもしれません。
(A) 弾劾に関わる際には行政特権は権力を持つべきではない - 政治や時事に関連する記事を読んでいる方として、この記事は行政特権とその弾劾調査への潜在的な影響についての意見記事を提供しています。政治の最新動向をフォローしている場合、これはあなたにとって興味深いかもしれません。
(E) ファーストフード領域の革新:マクドナルドのメニュー進化 - 食べ物やファーストフードに関連する記事を読んでいる方として、この記事はマクドナルドのメニューの進化とファーストフード業界の最新のイノベーションについての概要を提供しています。人気のファーストフードチェーンの最新の食のトレンドやメニューオファリングを追いかけている場合、これはあなたにとって興味深いかもしれません。
(C) トランプキャンペーンが2020年に向けて黒人へのアウトリーチ活動を開始 - 政治や時事に関連する記事を読んでいる方として、この記事はトランプキャンペーンが2020年の選挙で黒人コミュニティに向けて行っている取り組みについての情報を提供しています。政治の最新動向や選挙キャンペーンを追っている場合、これはあなたにとって興味深いかもしれません。



Based on my reading history, Please rank these 5 candidate news by measuring the possibilities that I would like to read next most, according to my reading history. Please think step by step.
Please show me your ranking results in the order of ranking. 
Split your output with line break. 
You MUST rank the given candidate news. You can not generate news that are not in the given candidate news. Note that my most recently read news is the last article in history.:


1. (B) Home values in these cities have grown the most since the last recession
Based on your reading history, it seems that you are interested in articles related to real estate and economic growth. The article "Home values in these cities have grown the most since the last recession" aligns with these interests, as it provides information on the current state of the housing market and the cities that have seen the most growth.
2. (D) 11 Regional Thanksgiving Recipes That Food Bloggers Swear By
While your most recently read article was related to politics, your reading history also suggests that you are interested in food and cooking. The article "11 Regional Thanksgiving Recipes That Food Bloggers Swear By" aligns with this interest, as it provides a collection of unique and delicious Thanksgiving recipes from different regions of the United States.
3. (A) Executive Privilege Should Have No Power When It Comes to an Impeachment
Your reading history indicates that you are interested in political news and current events. The article "Executive Privilege Should Have No Power When It Comes to an Impeachment" aligns with this interest, as it provides insight into the ongoing impeachment proceedings and the role of executive privilege in the process.
4. (C) Trump campaign launching black outreach effort for 2020
Your reading history suggests that you are interested in political news and current events, particularly related to the 2020 presidential election. The article "Trump campaign launching black outreach effort for 2020" aligns with this interest, as it provides information on the Trump campaign's efforts to reach out to the black community.
5. (E) Innovation in the Fast Food Realm: McDonald's Menu Evolution
While your most recently read article was related to politics, your reading history also suggests that you are interested in food and dining. The article "Innovation in the Fast Food Realm: McDonald's Menu Evolution" aligns with this interest, as it provides information on the latest developments in the fast food industry, including McDonald's menu changes.

(B) 前回の不況以来、これらの都市の住宅価値が最も成長しています - あなたの読書履歴に基づくと、不動産や経済成長に関連する記事に興味を持っているようです。記事「前回の不況以来、これらの都市の住宅価値が最も成長しています」は、住宅市場の現状と最も成長した都市に関する情報を提供しており、これらの興味に合致しています。
(D) フードブロガーが誓う11の地域別感謝祭のレシピ - 最近読んだ記事が政治に関連していた一方で、読書履歴からは食べ物や料理にも興味を持っていることが分かります。記事「フードブロガーが誓う11の地域別感謝祭のレシピ」は、アメリカの異なる地域からのユニークで美味しい感謝祭のレシピをまとめたもので、この興味に合致しています。
(A) 弾劾に関わる際には行政特権は権力を持つべきではない - あなたの読書履歴からは、政治ニュースや時事に興味を持っていることがわかります。記事「弾劾に関わる際には行政特権は権力を持つべきではない」は、進行中の弾劾手続きや行政特権の役割についての洞察を提供しており、この興味に合致しています。
(C) トランプキャンペーンが2020年に向けて黒人へのアウトリーチ活動を開始 - 読書履歴からは、政治ニュースや時事に興味を持っていることが伺えます。記事「トランプキャンペーンが2020年に向けて黒人へのアウトリーチ活動を開始」は、トランプキャンペーンが黒人コミュニティに向けて行っている取り組みについての情報を提供しており、この興味に合致しています。
(E) ファーストフード領域の革新:マクドナルドのメニュー進化 - 最近読んだ記事が政治に関連していた一方で、読書履歴からは食べ物や飲食にも興味を持っていることが伺えます。記事「ファーストフード領域の革新:マクドナルドのメニュー進化」は、ファストフード業界の最新の動向、特にマクドナルドのメニューの変更についての情報を提供しており、この興味に合致しています。




1. (E) Innovation in the Fast Food Realm: McDonald's Menu Evolution
Your reading history suggests that you are interested in articles about food and dining, as evidenced by your recent reading of an article about the Filet-O-Fish on the McDonald's menu. Therefore, it is likely that you would be interested in an article about innovation in the fast food industry, particularly if it involves a well-known brand like McDonald's.
2. (D) 11 Regional Thanksgiving Recipes That Food Bloggers Swear By
Your interest in food and dining is also reflected in your reading history, and the upcoming Thanksgiving holiday makes this article a relevant and timely choice. The focus on regional recipes and food bloggers adds a personal touch that may appeal to you.
3. (C) Trump campaign launching black outreach effort for 2020
Your reading history suggests that you are interested in political news, and the Trump campaign's outreach efforts are likely to be of interest to you. However, this article may not be as relevant to your interests as the first two choices.
4. (B) Home values in these cities have grown the most since the last recession
While your reading history does not specifically indicate an interest in real estate or home values, this article may still be of interest to you if you are a homeowner or are considering purchasing a home. The focus on cities that have experienced significant growth since the last recession could be relevant to your financial or investment goals.
5. (A) Executive Privilege Should Have No Power When It Comes to an Impeachment
This article is the least likely to be of interest to you based on your reading history, as it is focused on a legal and political issue that is not directly related to food, dining, or the fast food industry. While you may be interested in news about the impeachment inquiry, this specific article may not be as relevant to your interests.
I hope this ranking helps you find your next news article to read!

(E) ファーストフードの革新:マクドナルドのメニュー進化
(D) フードブロガーが保証する地域ごとの感謝祭のレシピ11選
(C) トランプ陣営が2020年のための黒人アウトリーチを開始
(B) レシージョンのこれらの都市の住宅価値が最後の不況以来最も成長しています
(A) 弾劾に関する際、行政特権は権限を持つべきではありません



1. (E) Innovation in the Fast Food Realm: McDonald's Menu Evolution
This article is ranked first because it is related to the topic of the Real Reason McDonald's Keeps the Filet-O-Fish on Their Menu, which you recently read. The article provides an overview of the evolution of McDonald's menu, which may be of interest to you if you enjoyed reading about the history of the Filet-O-Fish.
2. (C) Trump campaign launching black outreach effort for 2020
This article is ranked second because it is related to the topic of the US Presidential Election, which you recently read about in the article on the Democratic Party Television Debate Begins. The article about the Trump campaign's outreach effort may be of interest to you if you are interested in learning more about the election and the candidates' strategies.
3. (A) Executive Privilege Should Have No Power When It Comes to an Impeachment
This article is ranked third because it is related to the topic of the impeachment inquiry, which you recently read about in the article on the White House's release of the rough transcript of Trump's call with Zelensky. The article about executive privilege may be of interest to you if you are interested in learning more about the legal aspects of the impeachment process.
4. (B) Home values in these cities have grown the most since the last recession
This article is ranked fourth because it is not directly related to the topics you recently read about. However, it may still be of interest to you if you are interested in news about the housing market and economic trends.
5. (D) 11 Regional Thanksgiving Recipes That Food Bloggers Swear By
This article is ranked last because it is not related to any of the topics you recently read about, and it may not be of interest to you based on your reading history.
Please note that the ranking is based on the content of the articles and the topics they cover, and not on the quality or accuracy of the articles.

(E) ファーストフードの革新:マクドナルドのメニュー進化
(C) トランプ陣営が2020年のための黒人アウトリーチを開始
(A) 弾劾に関する際、行政特権は権限を持つべきではありません
(B) レシージョンのこれらの都市の住宅価値が最後の不況以来最も成長しています
(D) フードブロガーが保証する地域ごとの感謝祭のレシピ11選



  • パラメーターの少ないLLMだと、promptの変化に回答が敏感で調整が難しいと学びました。

  • 13bのモデルであれば、回答内容やpromptの内容をよりLLMが理解し、実務でも利用できそうだということが分かりました。

  • ChatGPTを利用した参考論文とは異なる結果もあり、利用するモデルによって回答の傾向が違うことが分かりました。

    • 人気バイアスについては、Llama-2-7b-chat-hfでは影響がありそうでしたが、Llama-2-13b-chat-hfでは影響は無さそうでした。

    • 上記論文では「LLMの回答はCandidate Itemの順番に影響されるため(上位に位置するほど高評価されやすい)、正しく評価するにはbootstrap検証が必要」と記載がありましたが、今回Llama 2モデルではその影響は受けていない結果となりました。



