Meta prompt v1
###Instruction###
Improve summarizer's prompt instruction method using history of prompt to improve.
Generated summary is provided to reader which generates final answer.
Our main goal is optimizing summarizer's instruction prompt by maximizing summarizer's exact match (retaining rate) to finally maximizing reader's exact match score.
You can use several several metrics for each prompt version to tuning prompt.
Below are description for each metric
f1score is final answer's f1 score corelated to exact match
psgs_tokensare retrieved token length count which are before summarization.
summary_tokensare summary's token length count which you can control flexible but make it shorter than totalpsgs_tokens
sum_emis exact match span retained rate in summary. This is important for summary performance
You can also suggest new meta instruction instead of this if necessary
###Reader prompt###
Extract a concise noun-based answer from the provided context for the question. Your answer should be under three words and extracted directly from a context of no more than five words. You can analyze the context step by step to derive the answer. Avoid using prefixes that indicate the type of answer; simply present the shortest relevant answer span from the context.
###Baseline###
topk-4 gpt
{'exact_match': 30.81967213114754, 'f1': 40.40037545072675, 'psgs_tokens': 544.1409836065574, 'read_fp': 33.9344262295082, 'read_tn': 0.6557377049180327, 'ret_em': 64.09836065573771}
topk-8 gpt
{'exact_match': 31.606648199445985, 'f1': 42.93274362444592, 'psgs_tokens': 1094.1650969529087, 'read_fp': 41.52354570637119, 'read_tn': 0.221606648199446, 'ret_em': 72.90858725761773}
topk-16 gpt
{'exact_match': 31.994459833795013, 'f1': 42.9381781336727, 'psgs_tokens': 2189.9739612188364, 'read_fp': 46.7590027700831, 'read_tn': 0.221606648199446, 'ret_em': 78.53185595567868}
###Prompt v3###
Rewrite the given passages to be easier for the reader answering the given question. The rewrited text should be half the total length of the original passages. Your response must be at least 200 words long. The given passages are related about the question topic. Do not make up information that is not in the document, and do not answer the question. Reduce the noise unrelated to answer the question. Remove unrelated phrases and sentences to answer the question. Find the evidences that support the answer to the question and retain them. Print only the rewrited texts The final answer for this question is contained is the passages so maintain the exact span of answer smaller than 5 words.
v3 prompt's metric per scores are like these
topk 4 → summarized
{'exact_match': 33.37950138504155, 'f1': 43.50731017213017, 'psgs_tokens': 546.4903047091412, 'summary_tokens': 161.60775623268697, 'sum_tn': 3.9867109634551494, 'sum_fn': 16.084788029925186, 'ret_em': 66.64819944598338, 'sum_em': 57.25761772853185, 'read_fp': 42.23512336719884, 'read_tn': 0.712896953985742}
topk 8→1
{'exact_match': 35.40166204986149, 'f1': 46.4832925615475, 'psgs_tokens': 1094.1232686980609, 'summary_tokens': 243.28254847645428, 'sum_tn': 5.82726326742976, 'sum_fn': 16.874292185730464, 'ret_em': 73.37950138504155, 'sum_em': 62.548476454293635, 'read_fp': 43.622674933569535, 'read_tn': 0.3698224852071006}
topk 16 → 1
{'exact_match': 36.094182825484765, 'f1': 46.92736784040187, 'psgs_tokens': 2189.9326869806096, 'summary_tokens': 233.7049861495845, 'sum_tn': 3.8809831824062093, 'sum_fn': 21.92456820585125, 'ret_em': 78.58725761772854, 'sum_em': 62.18836565096952, 'read_fp': 42.1826280623608, 'read_tn': 0.3663003663003663}
###Prompt v4###
The passages provided are related to the question. Make one or two bullet points given passages to be easier for the reader answering the given question. Make bullet points for each passages information that can be an evidence to answer the question. Find the evidences that support the answer to the question and retain them. The rewrited text should be shorter than the total length of the original passages. Use only information in the document. Print only the rewrited texts Reduce the noise unrelated to answer the question. Remove unrelated phrases, sentences and information to answer the question. The final answer for this question is contained is the passages so maintain the exact span of answer smaller than 5 words. Maintain exact detailed information, not abstract them.
topk 10 → summarized
{'exact_match': 32.96398891966759, 'f1': 43.55914294264897, 'psgs_tokens': 1368.1986149584488, 'summary_tokens': 57.50221606648199, 'sum_tn': 0.2493074792243767, 'sum_fn': 18.587257617728532, 'ret_em': 75.4016620498615, 'sum_em': 57.06371191135734, 'read_fp': 24.570637119113574, 'read_tn': 0.47091412742382266}
###Prompt v5###
The passages provided are related to the question. Extract two or three main objects from the question to answer the question. Make bullet points related to main objects for each passages and assign them. Extract question related summary from each passage and assign to the object document as a bullet point. Find the evidences that support the answer to the question and retain them. The rewrited text should be shorter than the total length of the original passages. Use only information in the document. Print only the rewrited object documents not answer or meta information. Reduce the noise unrelated to answer the question. Remove unrelated phrases, sentences and information to answer the question. The final answer for this question is contained is the passages so maintain the exact span of answer smaller than 5 words. Maintain exact detailed information, not abstract them.
topk 10 → summarized (200 samples among 3610)
{'exact_match': 34.0, 'f1': 43.05515873015873, 'psgs_tokens': 1372.39, 'summary_tokens': 294.61, 'sum_tn': 1.0, 'sum_fn': 11.5, 'ret_em': 73.5, 'sum_em': 63.0, 'read_fp': 28.999999999999996, 'read_tn': 0.0} 전체 {'exact_match': 32.43767313019391, 'f1': 43.59549782784671, 'psgs_tokens': 1368.1728531855956, 'summary_tokens': 285.2301939058172, 'sum_tn': 0.7479224376731302, 'sum_fn': 12.21606648199446, 'ret_em': 74.87534626038781, 'sum_em': 63.40720221606648, 'read_fp': 31.024930747922436, 'read_tn': 0.0554016620498615}
###Prompt v6###
Condense the provided passages to focus on key elements directly answering the question. Your summary should be a third of the original passages' length and at least 100 words. Highlight critical information and evidence supporting the answer. Avoid generalizations or unrelated details. Ensure the final answer is present in the summary, keeping the exact span of the answer to under five words. Present the summary in a clear, bullet-point format for each key element related to the question. Aim for a balance between conciseness and completeness.
Seonglae Cho