ReSRer Meta Prompt V2

Meta prompt v2

###Instruction### Your primary goal is to refine the summarizer's method to maximize the Exact Match (EM) retention rate. This is crucial for ensuring the summary closely aligns with the original text's key elements, directly impacting the reader's ability to achieve a high EM score in their final answer. The F1 score is a secondary metric, serving as an additional indicator of the summarizer's performance. Keep these points in mind while tuning the summarizer's prompt:

Prioritize Exact Match (EM) Retention: The most critical aspect is maintaining as much of the original text's exact match span as possible in the summary. Focus on capturing and preserving key terms and phrases that directly relate to the question.

Efficient Information Retrieval (psgs_tokens): Control the retrieved token length count from the original passages. The aim is to extract only the most relevant information, avoiding unnecessary details that don't contribute to the EM.

Concise Summary Tokens (summary_tokens): Ensure the summary is succinct, ideally shorter than the total psgs_tokens. While conciseness is key, don't sacrifice crucial details necessary for retaining the exact match.

Use of F1 Score as a Supportive Metric: While the primary focus is on EM, also consider the F1 score as an indicator of how well the summary captures the essential information in a coherent manner.

Leverage Historical Data for Improvement: Regularly analyze the performance of previous prompts, particularly focusing on their EM rates. Utilize insights from this analysis to make informed adjustments to the summarization strategy.

Iterative Refinement: Continuously refine the summarization approach based on the latest EM performance data. This process should aim to incrementally enhance the exact match rate, ensuring that the summary remains relevant and effectively serves as a bridge for the reader to extract the final answer.

Remember, the ultimate aim is to create a summary that maximizes EM retention, thereby directly assisting the reader in accurately identifying the final answer.

###Reader prompt###


Extract a concise noun-based answer from the provided context for the question. Your answer should be under three words and extracted directly from a context of no more than five words. You can analyze the context step by step to derive the answer. Avoid using prefixes that indicate the type of answer; simply present the shortest relevant answer span from the context.

###Baseline###

topk-4 gpt


{'exact_match': 30.81967213114754, 'f1': 40.40037545072675, 'psgs_tokens': 544.1409836065574, 'read_fp': 33.9344262295082, 'read_tn': 0.6557377049180327, 'ret_em': 64.09836065573771}

topk-8 gpt


{'exact_match': 31.606648199445985, 'f1': 42.93274362444592, 'psgs_tokens': 1094.1650969529087, 'read_fp': 41.52354570637119, 'read_tn': 0.221606648199446, 'ret_em': 72.90858725761773}

topk-16 gpt


{'exact_match': 31.994459833795013, 'f1': 42.9381781336727, 'psgs_tokens': 2189.9739612188364, 'read_fp': 46.7590027700831, 'read_tn': 0.221606648199446, 'ret_em': 78.53185595567868}

###Prompt v3###


Rewrite the given passages to be easier for the reader answering the given question.
The rewrited text should be half the total length of the original passages. Your response must be at least 200 words long.
The given passages are related about the question topic.
Do not make up information that is not in the document, and do not answer the question.
Reduce the noise unrelated to answer the question.
Remove unrelated phrases and sentences to answer the question.
Find the evidences that support the answer to the question and retain them.
Print only the rewrited texts
The final answer for this question is contained is the passages so maintain the exact span of answer smaller than 5 words.

v3 prompt's metric per scores are like these

topk 4 → summarized


{'exact_match': 33.37950138504155, 'f1': 43.50731017213017, 'psgs_tokens': 546.4903047091412, 'summary_tokens': 161.60775623268697, 'sum_tn': 3.9867109634551494, 'sum_fn': 16.084788029925186, 'ret_em': 66.64819944598338, 'sum_em': 57.25761772853185, 'read_fp': 42.23512336719884, 'read_tn': 0.712896953985742}

topk 8→1


{'exact_match': 35.40166204986149, 'f1': 46.4832925615475, 'psgs_tokens': 1094.1232686980609, 'summary_tokens': 243.28254847645428, 'sum_tn': 5.82726326742976, 'sum_fn': 16.874292185730464, 'ret_em': 73.37950138504155, 'sum_em': 62.548476454293635, 'read_fp': 43.622674933569535, 'read_tn': 0.3698224852071006}

topk 16 → 1


{'exact_match': 36.094182825484765, 'f1': 46.92736784040187, 'psgs_tokens': 2189.9326869806096, 'summary_tokens': 233.7049861495845, 'sum_tn': 3.8809831824062093, 'sum_fn': 21.92456820585125, 'ret_em': 78.58725761772854, 'sum_em': 62.18836565096952, 'read_fp': 42.1826280623608, 'read_tn': 0.3663003663003663}

prompt v3 improved total pipeline performance overall but it is not enough. Gap between f1 score and EM is large and information loss during summarization (ret_em to sum_em) is too large. However the gap is increasing as top-k increase so it means good trend.

###Prompt v4###


The passages provided are related to the question.
Make one or two bullet points given passages to be easier for the reader answering the given question.
Make bullet points for each passages information that can be an evidence to answer the question.
Find the evidences that support the answer to the question and retain them.
The rewrited text should be shorter than the total length of the original passages.
Use only information in the document. Print only the rewrited texts
Reduce the noise unrelated to answer the question. Remove unrelated phrases, sentences and information to answer the question.
The final answer for this question is contained is the passages so maintain the exact span of answer smaller than 5 words.
Maintain exact detailed information, not abstract them.

topk 10 → summarized


{'exact_match': 32.96398891966759, 'f1': 43.55914294264897, 'psgs_tokens': 1368.1986149584488, 'summary_tokens': 57.50221606648199, 'sum_tn': 0.2493074792243767, 'sum_fn': 18.587257617728532, 'ret_em': 75.4016620498615, 'sum_em': 57.06371191135734, 'read_fp': 24.570637119113574, 'read_tn': 0.47091412742382266}

As an insight, bullet points works better for question answering. However this prompt is not optimized yet so the score is lower than expected

###Prompt v5###


The passages provided are related to the question.
Extract two or three main objects from the question to answer the question.
Make bullet points related to main objects for each passages and assign them.
Extract question related summary from each passage and assign to the object document as a bullet point.
Find the evidences that support the answer to the question and retain them.
The rewrited text should be shorter than the total length of the original passages.
Use only information in the document. Print only the rewrited object documents not answer or meta information.
Reduce the noise unrelated to answer the question. Remove unrelated phrases, sentences and information to answer the question.
The final answer for this question is contained is the passages so maintain the exact span of answer smaller than 5 words.
Maintain exact detailed information, not abstract them.

topk 10 → summarized


{'exact_match': 32.43767313019391, 'f1': 43.59549782784671, 'psgs_tokens': 1368.1728531855956, 'summary_tokens': 285.2301939058172, 'sum_tn': 0.7479224376731302, 'sum_fn': 12.21606648199446, 'ret_em': 74.87534626038781, 'sum_em': 63.40720221606648, 'read_fp': 31.024930747922436, 'read_tn': 0.0554016620498615}

Like version 4, structured text is expected to works good for question answering. However this prompt is not optimized yet too.

###Prompt v6###


Condense the provided passages to focus on key elements directly answering the question. Your summary should be a third of the original passages' length and at least 100 words. Highlight critical information and evidence supporting the answer. Avoid generalizations or unrelated details. Ensure the final answer is present in the summary, keeping the exact span of the answer to under five words. Present the summary in a clear, bullet-point format for each key element related to the question. Aim for a balance between conciseness and completeness.

topk 10 → summarized


{'exact_match': 35.34626038781163, 'f1': 46.62519920830175, 'psgs_tokens': 1368.1986149584488, 'summary_tokens': 171.44265927977838, 'sum_tn': 0.7202216066481995, 'sum_fn': 11.911357340720222, 'ret_em': 75.4016620498615, 'sum_em': 64.21052631578948, 'read_fp': 28.89196675900277, 'read_tn': 0.02770083102493075}

It improved a lot than before but still lower than v3(even than topk8). The good news is sum_em is higher

###Prompt v7###


Refine the provided passages with a focus on the key elements that directly answer the question. Your summary should aim to be about one-third the length of the original passages, but not less than 100 words. Adhere to these guidelines:

1. **Directly Address the Question**: Extract and emphasize information that directly answers the question. The final answer, under five words, must be identifiable in your summary.

2. **Structured Bullet-Point Format**: Present key information in a structured, bullet-point format. Each bullet point should correspond to a specific element or piece of evidence related to the question. This will aid in clarity and ease of understanding.

3. **Preserve Critical Details**: While summarizing, ensure that crucial information and terms, especially those that contribute to the EM score, are retained without alteration.

4. **Eliminate Redundant or Irrelevant Information**: Remove any content that does not contribute to answering the question, thus reducing the length of the text and focusing on relevant details.

5. **Coherent and Concise Summary**: The summary should be coherent, linking bullet points logically. Aim for a balance between brevity and comprehensive coverage of the necessary details.

6. **Continuous Improvement Based on Performance Data**: Regularly analyze the performance of this summarization approach, particularly the EM rates, and refine it accordingly to enhance effectiveness.

The goal is to provide a clear, concise, and relevant summary that maximizes EM retention, aiding the reader in quickly identifying the precise answer.

topk 10 → summarized


{'exact_match': 34.79224376731302, 'f1': 45.25108848063672, 'psgs_tokens': 1368.1728531855956, 'summary_tokens': 178.9202216066482, 'sum_tn': 0.5263157894736842, 'sum_fn': 12.770083102493073, 'ret_em': 74.87534626038781, 'sum_em': 62.63157894736842, 'read_fp': 28.116343490304708, 'read_tn': 0.2770083102493075}

The score lower than v5. I’m curious about structured prompt like bullet point or numbering instruction is better than general text.

###Prompt v8###

Highlighting key element is nice approach but let us see how it works


Focus on condensing the provided passages to highlight the key elements that directly answer the question. Keep these guidelines in mind while summarizing:

1. **Concentrate on Exact Match (EM) Retention**: Prioritize the preservation of exact phrases and terms from the original text that are crucial for the question. This is essential for maximizing the EM rate.

2. **Structured and Concise Summary**: Aim for a summary that is about a third of the original passages' length, with a minimum of 100 words. Present the information in a clear, bullet-point format, focusing on different key elements or evidence related to the question.

3. **Eliminate Extraneous Information**: Carefully remove content that doesn't contribute to answering the question. This will help in reducing the length of the text while maintaining focus on relevant details.

4. **Ensure Coherence and Clarity**: While being concise, the summary should be coherent and easily understandable. Arrange the bullet points in a logical sequence that guides the reader to the answer.

5. **Continuous Improvement and Adjustment**: Regularly review and adjust your summarization technique based on its performance, particularly the EM and F1 scores, to enhance its effectiveness.

Remember, the objective is to provide a summary that is both concise and rich in crucial details, aiding in the quick identification of the precise answer.

topk 10 → summarized


{'exact_match': 34.6814404432133, 'f1': 45.06912939322919, 'psgs_tokens': 1368.1728531855956, 'summary_tokens': 164.06675900277008, 'sum_tn': 0.6371191135734072, 'sum_fn': 12.24376731301939, 'ret_em': 74.87534626038781, 'sum_em': 63.26869806094183, 'read_fp': 28.781163434903046, 'read_tn': 0.19390581717451524}

Highlight was novel idea but didn’t worked better. sum_em is similar with v5 but final exact match score was bad that the format of v5 summary was better.


Your task is to rewrite the provided passages to enhance their specificity and precision in relation to the question. In your rewrite, you should:
- Align the text closely with the key aspects of the question.
- Your summary should be a half of the original passages' length and at least 150 words.
- Prioritize information most likely to contain or support the answer.
- Utilize summary tokens efficiently to cover relevant information comprehensively.
- Ensure the rewritten text is clear, readable, and facilitates quick understanding.
- Actively remove content that does not contribute to answering the question.
- Understand the intent of the question and summarize to make it easier to find an answer according to the intent.
Focus on maintaining the exact span of the answer to be smaller than 5 words, ensuring the most relevant and specific information is included in the rewrite.


{'exact_match': 35.84487534626039, 'f1': 46.83870750914061, 'psgs_tokens': 1368.1728531855956, 'summary_tokens': 235.92963988919666, 'sum_tn': 1.3573407202216066, 'sum_rc': 83.12985571587126, 'sum_fn': 12.631578947368421, 'ret_em': 74.87534626038781, 'sum_em': 63.60110803324099, 'read_fp': 27.839335180055404, 'read_tn': 0.08310249307479224}

ReSRer Meta Prompt V2

Meta prompt v2

Recommendations