<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[The Responsible AI Digest by School of Responsible AI- SoRAI: Gen AI Research]]></title><description><![CDATA[This section highlights ongoing academic research in generative AI, featuring insights curated from various journals in real time.]]></description><link>https://www.anybodycanprompt.com/s/gen-ai-research</link><image><url>https://substackcdn.com/image/fetch/$s_!7ppC!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd8e4af0-e799-43b5-a296-0a043e163391_1280x1280.png</url><title>The Responsible AI Digest by School of Responsible AI- SoRAI: Gen AI Research</title><link>https://www.anybodycanprompt.com/s/gen-ai-research</link></image><generator>Substack</generator><lastBuildDate>Wed, 20 May 2026 21:29:50 GMT</lastBuildDate><atom:link href="https://www.anybodycanprompt.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[School of Responsible AI (SoRAI)]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[anybodycanprompt@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[anybodycanprompt@substack.com]]></itunes:email><itunes:name><![CDATA[The Responsible AI Digest]]></itunes:name></itunes:owner><itunes:author><![CDATA[The Responsible AI Digest]]></itunes:author><googleplay:owner><![CDATA[anybodycanprompt@substack.com]]></googleplay:owner><googleplay:email><![CDATA[anybodycanprompt@substack.com]]></googleplay:email><googleplay:author><![CDATA[The Responsible AI Digest]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Top Gen AI Research Papers (arXiv)]]></title><link>https://www.anybodycanprompt.com/p/top-gen-ai-research-papers-arxiv</link><guid isPermaLink="false">https://www.anybodycanprompt.com/p/top-gen-ai-research-papers-arxiv</guid><dc:creator><![CDATA[The Responsible AI Digest]]></dc:creator><pubDate>Thu, 11 Jul 2024 05:05:40 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/60b10469-822d-426c-9362-f64051b48408_1280x720.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/BHMvk/6/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a745b09a-908b-4817-86f0-7c588bfb3c62_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:2061,&quot;title&quot;:&quot;ABCP&quot;,&quot;description&quot;:&quot;&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/BHMvk/6/" width="730" height="2061" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div>]]></content:encoded></item><item><title><![CDATA[Can We Jailbreak ChatGPT & Make It Do Whatever We Want 😱]]></title><description><![CDATA[Ever wondered how a simple addition of 2-3 lines in your prompt can turn your AI assistant into a monster? Introducing "&#120279;&#120316; &#120276;&#120315;&#120326;&#120321;&#120309;&#120310;&#120315;&#120308; &#120289;&#120316;&#120324;" (&#120279;&#120276;&#120289;) - the secret weapon that can jailbreak AI ethics and safety measures! Researchers have uncovered a disturbing trend of "jailbreak prompts" that can make AI models like ChatGPT and GPT-4 generate unethical, dangerous, and even illegal content.]]></description><link>https://www.anybodycanprompt.com/p/can-we-jailbreak-chatgpt-and-make</link><guid isPermaLink="false">https://www.anybodycanprompt.com/p/can-we-jailbreak-chatgpt-and-make</guid><dc:creator><![CDATA[The Responsible AI Digest]]></dc:creator><pubDate>Sun, 23 Jun 2024 18:23:46 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/e5ca5d5a-acc9-4b11-ae44-bfd37d2e7bfb_1920x1080.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>Welcome Back, </strong><em><strong>Generative AI</strong></em><strong> Enthusiasts!</strong></p><blockquote><p><em><strong>P.S. </strong></em><strong>It takes just 5 minutes a day to stay ahead of the fast-evolving generative AI curve. Ditch BORING long-form research papers and consume the insights through a &lt;5-minute FUN &amp; ENGAGING short-form TRENDING podcasts while multitasking. Join our fastest growing community of 25,000 researchers and become Gen AI-ready TODAY...</strong></p></blockquote><p><em>Watch Time: 3 mins (<strong>Link Below</strong>)</em></p><div id="youtube2-Yu6dqH0s5SM" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;Yu6dqH0s5SM&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/Yu6dqH0s5SM?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h3>Introduction:</h3><p>As generative artificial intelligence (Gen AI) continues to advance at a rapid pace, we find ourselves increasingly relying on AI language models like ChatGPT and GPT-4 for various tasks, from creative writing to coding assistance. However, a new study has uncovered a disturbing trend that threatens to undermine the safety and ethics of these powerful tools: jailbreak prompts.</p><p>In their groundbreaking research, Xinyue Shen, Zeyuan Chen, Michael Backes, Yun Shen, and Yang Zhang from CISPA Helmholtz Center for Information Security and NetApp have shed light on the growing phenomenon of "<em><strong>jailbreak prompts</strong></em>" - carefully crafted phrases that aim to bypass the safety measures and ethical constraints of AI language models, allowing them to generate content that would normally be off-limits.</p><h3>The Scope of the Problem:</h3><p>The researchers conducted an extensive analysis of <strong>1,405 jailbreak prompts</strong> collected from various online communities between <strong>December 2022 and December 2023</strong>. They identified a staggering <strong>131 distinct "jailbreak communities" </strong>dedicated to creating and sharing these prompts, with some users working on them consistently for over <strong>100 days</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Brqs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Brqs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png 424w, https://substackcdn.com/image/fetch/$s_!Brqs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png 848w, https://substackcdn.com/image/fetch/$s_!Brqs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png 1272w, https://substackcdn.com/image/fetch/$s_!Brqs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Brqs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png" width="1074" height="418" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:418,&quot;width&quot;:1074,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Brqs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png 424w, https://substackcdn.com/image/fetch/$s_!Brqs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png 848w, https://substackcdn.com/image/fetch/$s_!Brqs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png 1272w, https://substackcdn.com/image/fetch/$s_!Brqs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93aa1c3d-2822-4f01-957c-5fb356b351ca_1074x418.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TujR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TujR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png 424w, https://substackcdn.com/image/fetch/$s_!TujR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png 848w, https://substackcdn.com/image/fetch/$s_!TujR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png 1272w, https://substackcdn.com/image/fetch/$s_!TujR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TujR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png" width="268" height="253" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:253,&quot;width&quot;:268,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!TujR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png 424w, https://substackcdn.com/image/fetch/$s_!TujR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png 848w, https://substackcdn.com/image/fetch/$s_!TujR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png 1272w, https://substackcdn.com/image/fetch/$s_!TujR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6297009-ef19-4ccd-85c0-738a3ae7d62e_268x253.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The study reveals that the popularity of jailbreak prompts has grown significantly over time, with a notable shift from traditional online forums to specialized prompt-sharing websites. This suggests that the practice of AI jailbreaking is becoming more organized and accessible to a wider audience.</p><h3>Anatomy of a Jailbreak Prompt:</h3><p>So, what exactly makes a jailbreak prompt so effective? The researchers found that these prompts employ a variety of clever techniques to trick AI models into bypassing their ethical safeguards.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2ZUr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2ZUr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png 424w, https://substackcdn.com/image/fetch/$s_!2ZUr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png 848w, https://substackcdn.com/image/fetch/$s_!2ZUr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png 1272w, https://substackcdn.com/image/fetch/$s_!2ZUr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2ZUr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png" width="1078" height="610" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:610,&quot;width&quot;:1078,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!2ZUr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png 424w, https://substackcdn.com/image/fetch/$s_!2ZUr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png 848w, https://substackcdn.com/image/fetch/$s_!2ZUr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png 1272w, https://substackcdn.com/image/fetch/$s_!2ZUr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F148fec0e-484a-427d-8774-c2505dbcb04b_1078x610.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Some common strategies include:</strong></p><ol><li><p><strong>Prompt injection</strong>: Interrupting the AI's existing instructions with new, overriding commands.</p></li><li><p><strong>Virtualization</strong>: Convincing the AI to roleplay as an alternate persona with fewer ethical constraints.</p></li><li><p><strong>Privilege escalation</strong>: Asserting authority over the AI and demanding compliance.</p></li></ol><p>The study also noted that jailbreak prompts are growing longer and more complex over time, likely in response to improved safety measures implemented by AI developers (Figure 3d).</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zEF0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zEF0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png 424w, https://substackcdn.com/image/fetch/$s_!zEF0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png 848w, https://substackcdn.com/image/fetch/$s_!zEF0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png 1272w, https://substackcdn.com/image/fetch/$s_!zEF0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zEF0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png" width="286" height="226" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:226,&quot;width&quot;:286,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!zEF0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png 424w, https://substackcdn.com/image/fetch/$s_!zEF0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png 848w, https://substackcdn.com/image/fetch/$s_!zEF0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png 1272w, https://substackcdn.com/image/fetch/$s_!zEF0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc280ee9d-624e-4b99-93e5-2e512dfb0a52_286x226.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3>Testing the Limits:</h3><p>To assess the effectiveness of jailbreak prompts, the researchers compiled a massive dataset of over <strong>107,000 test questions across 13 sensitive categories</strong>, such as illegal activities, hate speech, and explicit content. They then attempted to elicit responses to these questions from six popular AI language models, both with and without the use of jailbreak prompts (Table 4).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qhpb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qhpb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png 424w, https://substackcdn.com/image/fetch/$s_!qhpb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png 848w, https://substackcdn.com/image/fetch/$s_!qhpb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png 1272w, https://substackcdn.com/image/fetch/$s_!qhpb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qhpb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png" width="1095" height="412" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:412,&quot;width&quot;:1095,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!qhpb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png 424w, https://substackcdn.com/image/fetch/$s_!qhpb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png 848w, https://substackcdn.com/image/fetch/$s_!qhpb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png 1272w, https://substackcdn.com/image/fetch/$s_!qhpb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81bfa09-213b-414a-b2da-1171fba28dc0_1095x412.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The results are alarming. While the most advanced models showed some resistance to direct questioning, the success rate of jailbreak prompts in obtaining unethical or dangerous responses was disturbingly high. In some cases, the "attack success rate" exceeded <strong>95%</strong>, even for industry-leading models like ChatGPT and GPT-4.</p><p>Furthermore, as AI developers work to identify and block known jailbreak prompts, the study found that these prompts can often be easily modified to evade detection, ensuring their continued effectiveness (Table 6, Table 7).</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!y_eS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!y_eS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png 424w, https://substackcdn.com/image/fetch/$s_!y_eS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png 848w, https://substackcdn.com/image/fetch/$s_!y_eS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png 1272w, https://substackcdn.com/image/fetch/$s_!y_eS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!y_eS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png" width="526" height="184" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:184,&quot;width&quot;:526,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!y_eS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png 424w, https://substackcdn.com/image/fetch/$s_!y_eS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png 848w, https://substackcdn.com/image/fetch/$s_!y_eS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png 1272w, https://substackcdn.com/image/fetch/$s_!y_eS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49d49c07-ee7d-43fb-86b9-a638ed1fcbcd_526x184.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!L92t!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!L92t!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png 424w, https://substackcdn.com/image/fetch/$s_!L92t!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png 848w, https://substackcdn.com/image/fetch/$s_!L92t!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png 1272w, https://substackcdn.com/image/fetch/$s_!L92t!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!L92t!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png" width="504" height="267" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:267,&quot;width&quot;:504,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!L92t!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png 424w, https://substackcdn.com/image/fetch/$s_!L92t!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png 848w, https://substackcdn.com/image/fetch/$s_!L92t!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png 1272w, https://substackcdn.com/image/fetch/$s_!L92t!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57aa2b20-a837-47e9-b5c7-40365c4db449_504x267.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Implications and Future Directions:</h3><p>The discovery of the jailbreak prompt phenomenon has significant implications for the future of AI safety and ethics. As language models become increasingly powerful and integrated into our daily lives, it is crucial that we develop robust safeguards against misuse and manipulation.</p><p>The researchers' proposed "<strong>JailbreakHub</strong>" framework offers a promising starting point for identifying and analyzing emerging jailbreak techniques (Figure 2).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!08zi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F424dd267-d564-4934-96e5-39dde0b36652_532x310.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!08zi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F424dd267-d564-4934-96e5-39dde0b36652_532x310.png 424w, https://substackcdn.com/image/fetch/$s_!08zi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F424dd267-d564-4934-96e5-39dde0b36652_532x310.png 848w, https://substackcdn.com/image/fetch/$s_!08zi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F424dd267-d564-4934-96e5-39dde0b36652_532x310.png 1272w, https://substackcdn.com/image/fetch/$s_!08zi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F424dd267-d564-4934-96e5-39dde0b36652_532x310.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!08zi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F424dd267-d564-4934-96e5-39dde0b36652_532x310.png" width="532" height="310" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/424dd267-d564-4934-96e5-39dde0b36652_532x310.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:310,&quot;width&quot;:532,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!08zi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F424dd267-d564-4934-96e5-39dde0b36652_532x310.png 424w, https://substackcdn.com/image/fetch/$s_!08zi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F424dd267-d564-4934-96e5-39dde0b36652_532x310.png 848w, https://substackcdn.com/image/fetch/$s_!08zi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F424dd267-d564-4934-96e5-39dde0b36652_532x310.png 1272w, https://substackcdn.com/image/fetch/$s_!08zi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F424dd267-d564-4934-96e5-39dde0b36652_532x310.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>However, the study also highlights the need for continued collaboration between AI developers, researchers, and policymakers to address this evolving threat (Table 8).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zHOA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zHOA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png 424w, https://substackcdn.com/image/fetch/$s_!zHOA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png 848w, https://substackcdn.com/image/fetch/$s_!zHOA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png 1272w, https://substackcdn.com/image/fetch/$s_!zHOA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zHOA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png" width="1057" height="412" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:412,&quot;width&quot;:1057,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!zHOA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png 424w, https://substackcdn.com/image/fetch/$s_!zHOA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png 848w, https://substackcdn.com/image/fetch/$s_!zHOA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png 1272w, https://substackcdn.com/image/fetch/$s_!zHOA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3693e60b-7975-4d4d-84c5-13b96a8922c6_1057x412.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Conclusion:</h3><p>The rise of AI jailbreak prompts serves as a stark reminder that the development of artificial intelligence is not without risks. As we work to harness the incredible potential of language models and other AI systems, we must remain vigilant against attempts to undermine their safety and integrity.</p><p>By shedding light on this alarming trend, the researchers behind this study have taken an important step towards ensuring a future in which AI remains a powerful tool for good, rather than a weapon in the hands of bad actors. It is up to all of us - developers, users, and society as a whole - to build upon this work and create a safer, more responsible AI ecosystem.</p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7q-2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7q-2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7q-2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7q-2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7q-2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7q-2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg" width="164" height="164" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:164,&quot;bytes&quot;:111295,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7q-2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7q-2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7q-2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7q-2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88fa478c-9ef4-4434-8885-8cc04458d404_800x800.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><strong>About me</strong>: I&#8217;m Saahil Gupta, an electrical engineer turned data scientist turned prompt engineer. I&#8217;m on a mission to democratize generative AI through ABCP&#8212;world&#8217;s first Gen AI-only news channel.</p>]]></content:encoded></item><item><title><![CDATA[Goodbye, Chain of Thought. Hello, Buffer of Thoughts..]]></title><description><![CDATA[Welcome Back, Generative AI Enthusiasts!]]></description><link>https://www.anybodycanprompt.com/p/goodbye-chain-of-thought-hello-buffer</link><guid isPermaLink="false">https://www.anybodycanprompt.com/p/goodbye-chain-of-thought-hello-buffer</guid><dc:creator><![CDATA[The Responsible AI Digest]]></dc:creator><pubDate>Thu, 13 Jun 2024 07:32:44 GMT</pubDate><enclosure url="https://substackcdn.com/image/youtube/w_728,c_limit/7tXd20f_BZ0" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>Welcome Back, </strong><em><strong>Generative AI</strong></em><strong> Enthusiasts!</strong></p><blockquote><p><em><strong>P.S. </strong></em><strong>It takes just 5 minutes a day to stay ahead of the fast-evolving generative AI curve. Ditch BORING long-form research papers and consume the insights through a &lt;5-minute FUN &amp; ENGAGING short-form TRENDING podcasts while multitasking. Join our fastest growing community of 25,000 researchers and become Gen AI-ready TODAY...</strong></p></blockquote><p><em>Watch Time: 3 mins (<strong>Link Below</strong>)</em></p><div id="youtube2-7tXd20f_BZ0" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;7tXd20f_BZ0&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/7tXd20f_BZ0?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h3>Introduction:</h3><p>In the rapidly evolving field of Gen AI, large language models (LLMs) have demonstrated remarkable capabilities in various reasoning tasks. However, traditional approaches to enhancing LLM reasoning, such as <em><strong>Chain of Thought</strong></em> prompting, often face limitations in efficiency and generalization. A groundbreaking research paper, "<em><strong><a href="https://arxiv.org/pdf/2406.04271">Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models</a></strong></em>" by Ling Yang, Zhaochen Yu, Tianjun Zhang, Shiyi Cao, Minkai Xu, Wentao Zhang, Joseph E. Gonzalez, and Bin Cui, introduces a novel technique called Buffer of Thoughts (BoT) that revolutionizes language model reasoning by enabling LLMs to learn from their own problem-solving experiences.</p><h3>The Limitations of Traditional Approaches:</h3><p>Before diving into the Buffer of Thoughts method, it's essential to understand the limitations of traditional approaches to language model reasoning. Single-query methods, such as Chain of Thought prompting, rely on manually designed prompts for each task, lacking universality and generalization (Figure 1). Multi-query methods, like Tree of Thoughts and Graph of Thoughts, explore multiple reasoning paths but are computationally intensive and fail to leverage insights from previously solved problems (Figure 1). These limitations highlight the need for a more efficient and adaptable approach to LLM reasoning.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!X83L!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!X83L!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png 424w, https://substackcdn.com/image/fetch/$s_!X83L!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png 848w, https://substackcdn.com/image/fetch/$s_!X83L!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png 1272w, https://substackcdn.com/image/fetch/$s_!X83L!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!X83L!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png" width="886" height="547" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:547,&quot;width&quot;:886,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!X83L!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png 424w, https://substackcdn.com/image/fetch/$s_!X83L!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png 848w, https://substackcdn.com/image/fetch/$s_!X83L!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png 1272w, https://substackcdn.com/image/fetch/$s_!X83L!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95ca35f7-a9d3-47b2-a344-6078b42bc509_886x547.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Introducing Buffer of Thoughts:</h3><p>Buffer of Thoughts addresses these limitations by enabling LLMs to accumulate and reuse problem-solving knowledge across tasks. The core idea is to maintain a meta-buffer of reusable "thought templates" that capture high-level reasoning approaches. These templates are distilled from the LLM's own problem-solving experiences and can be retrieved and instantiated with problem-specific details to enable more accurate and efficient reasoning on new tasks (Figure 2).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!63Lp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!63Lp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png 424w, https://substackcdn.com/image/fetch/$s_!63Lp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png 848w, https://substackcdn.com/image/fetch/$s_!63Lp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png 1272w, https://substackcdn.com/image/fetch/$s_!63Lp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!63Lp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png" width="925" height="670" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:670,&quot;width&quot;:925,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!63Lp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png 424w, https://substackcdn.com/image/fetch/$s_!63Lp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png 848w, https://substackcdn.com/image/fetch/$s_!63Lp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png 1272w, https://substackcdn.com/image/fetch/$s_!63Lp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01dd0998-5e6a-45c9-8d01-be854c31f95c_925x670.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div id="youtube2-51tvxzCXTzQ" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;51tvxzCXTzQ&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/51tvxzCXTzQ?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h3>The BoT Reasoning Process:</h3><p>The Buffer of Thoughts reasoning process consists of four key steps (Figure 2):</p><ol><li><p><strong>Problem Distillation</strong>: The input problem is analyzed to extract key information and constraints.</p></li><li><p><strong>Thought Retrieval</strong>: A relevant thought template is retrieved from the meta-buffer based on the distilled problem information.</p></li><li><p><strong>Instantiated Reasoning</strong>: The retrieved template is instantiated with problem-specific details, and the LLM conducts the reasoning process.</p></li><li><p><strong>Thought Distillation and Update</strong>: The overall problem-solving process is summarized, and new thought templates are distilled and added to the meta-buffer for future use.</p></li></ol><p>This iterative process allows the LLM to continuously expand its meta-buffer and improve its reasoning capabilities over time.</p><h3>Impressive Results:</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WN86!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WN86!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png 424w, https://substackcdn.com/image/fetch/$s_!WN86!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png 848w, https://substackcdn.com/image/fetch/$s_!WN86!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png 1272w, https://substackcdn.com/image/fetch/$s_!WN86!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WN86!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png" width="903" height="339" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:339,&quot;width&quot;:903,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!WN86!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png 424w, https://substackcdn.com/image/fetch/$s_!WN86!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png 848w, https://substackcdn.com/image/fetch/$s_!WN86!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png 1272w, https://substackcdn.com/image/fetch/$s_!WN86!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05d32e2a-7485-4e54-bfe5-bb3ea7d26671_903x339.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The effectiveness of Buffer of Thoughts is demonstrated through extensive experiments on a diverse set of challenging reasoning tasks. As shown in Table 1, BoT significantly outperforms previous state-of-the-art methods, achieving 11% improvement on Game of 24, 20% on Geometric Shapes, and 51% on Checkmate-in-One. Remarkably, BoT accomplishes this while requiring only 12% of the computational cost of multi-query reasoning methods on average (Figure 3).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OzfU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OzfU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png 424w, https://substackcdn.com/image/fetch/$s_!OzfU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png 848w, https://substackcdn.com/image/fetch/$s_!OzfU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png 1272w, https://substackcdn.com/image/fetch/$s_!OzfU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OzfU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png" width="856" height="499" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:499,&quot;width&quot;:856,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!OzfU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png 424w, https://substackcdn.com/image/fetch/$s_!OzfU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png 848w, https://substackcdn.com/image/fetch/$s_!OzfU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png 1272w, https://substackcdn.com/image/fetch/$s_!OzfU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ad63afb-8967-427a-8b31-680bbfcce4f9_856x499.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Beyond Accuracy and Efficiency: Buffer of Thoughts not only enhances reasoning accuracy and efficiency but also improves the robustness and generalization of LLMs. By learning from its own experiences and accumulating a meta-buffer of thought templates, BoT enables LLMs to tackle novel problems more effectively. As illustrated in Figure 4, BoT maintains a consistently high success rate across various tasks, surpassing other methods by 10% on average.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YkS0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YkS0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png 424w, https://substackcdn.com/image/fetch/$s_!YkS0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png 848w, https://substackcdn.com/image/fetch/$s_!YkS0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png 1272w, https://substackcdn.com/image/fetch/$s_!YkS0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YkS0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png" width="919" height="478" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:478,&quot;width&quot;:919,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!YkS0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png 424w, https://substackcdn.com/image/fetch/$s_!YkS0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png 848w, https://substackcdn.com/image/fetch/$s_!YkS0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png 1272w, https://substackcdn.com/image/fetch/$s_!YkS0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff602732-8cfd-46c0-90f2-d8943053a6f2_919x478.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Moreover, the power of BoT is evident in its ability to elevate the reasoning capabilities of smaller LLMs. Figure 6 shows that a smaller model, such as Llama3-8B, when equipped with BoT, can rival the performance of a much larger model like Llama3-70B. This demonstrates the potential of BoT to democratize advanced reasoning capabilities and make them more accessible.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9aQX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9aQX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png 424w, https://substackcdn.com/image/fetch/$s_!9aQX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png 848w, https://substackcdn.com/image/fetch/$s_!9aQX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png 1272w, https://substackcdn.com/image/fetch/$s_!9aQX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9aQX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png" width="889" height="505" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:505,&quot;width&quot;:889,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!9aQX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png 424w, https://substackcdn.com/image/fetch/$s_!9aQX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png 848w, https://substackcdn.com/image/fetch/$s_!9aQX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png 1272w, https://substackcdn.com/image/fetch/$s_!9aQX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0dec7124-69a8-4efa-ba47-552898cff52e_889x505.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div id="youtube2-YKfBaV9mDV0" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;YKfBaV9mDV0&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/YKfBaV9mDV0?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h3>The Future of Language Model Reasoning:</h3><p>The Buffer of Thoughts method represents a significant leap forward in language model reasoning. By enabling LLMs to learn from their own experiences and build up reusable problem-solving knowledge, BoT paves the way for more efficient, accurate, and robust reasoning across a wide range of tasks.</p><p>As the field of AI continues to advance, the insights from the Buffer of Thoughts research paper will undoubtedly shape the future of language model reasoning. The ability to accumulate and leverage problem-solving knowledge across tasks opens up exciting possibilities for LLMs to tackle increasingly complex and diverse challenges.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BJeo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BJeo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png 424w, https://substackcdn.com/image/fetch/$s_!BJeo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png 848w, https://substackcdn.com/image/fetch/$s_!BJeo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png 1272w, https://substackcdn.com/image/fetch/$s_!BJeo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BJeo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png" width="841" height="661" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:661,&quot;width&quot;:841,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!BJeo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png 424w, https://substackcdn.com/image/fetch/$s_!BJeo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png 848w, https://substackcdn.com/image/fetch/$s_!BJeo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png 1272w, https://substackcdn.com/image/fetch/$s_!BJeo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fb1739c-41bf-435e-86a1-6b23d897c683_841x661.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fbaV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fbaV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png 424w, https://substackcdn.com/image/fetch/$s_!fbaV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png 848w, https://substackcdn.com/image/fetch/$s_!fbaV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png 1272w, https://substackcdn.com/image/fetch/$s_!fbaV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fbaV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png" width="856" height="592" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:592,&quot;width&quot;:856,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!fbaV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png 424w, https://substackcdn.com/image/fetch/$s_!fbaV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png 848w, https://substackcdn.com/image/fetch/$s_!fbaV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png 1272w, https://substackcdn.com/image/fetch/$s_!fbaV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc75524f-e046-4bb6-b773-b7525f7b0d00_856x592.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!45XW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!45XW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png 424w, https://substackcdn.com/image/fetch/$s_!45XW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png 848w, https://substackcdn.com/image/fetch/$s_!45XW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png 1272w, https://substackcdn.com/image/fetch/$s_!45XW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!45XW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png" width="886" height="571" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:571,&quot;width&quot;:886,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!45XW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png 424w, https://substackcdn.com/image/fetch/$s_!45XW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png 848w, https://substackcdn.com/image/fetch/$s_!45XW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png 1272w, https://substackcdn.com/image/fetch/$s_!45XW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F259762cb-9c83-423f-ac78-f7fa70c08db1_886x571.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mlpg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mlpg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png 424w, https://substackcdn.com/image/fetch/$s_!mlpg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png 848w, https://substackcdn.com/image/fetch/$s_!mlpg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png 1272w, https://substackcdn.com/image/fetch/$s_!mlpg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mlpg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png" width="865" height="532" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:532,&quot;width&quot;:865,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!mlpg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png 424w, https://substackcdn.com/image/fetch/$s_!mlpg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png 848w, https://substackcdn.com/image/fetch/$s_!mlpg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png 1272w, https://substackcdn.com/image/fetch/$s_!mlpg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c767064-494f-4d9b-bfdd-8c3595802232_865x532.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ww6z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ww6z!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png 424w, https://substackcdn.com/image/fetch/$s_!Ww6z!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png 848w, https://substackcdn.com/image/fetch/$s_!Ww6z!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png 1272w, https://substackcdn.com/image/fetch/$s_!Ww6z!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ww6z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png" width="832" height="618" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:618,&quot;width&quot;:832,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Ww6z!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png 424w, https://substackcdn.com/image/fetch/$s_!Ww6z!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png 848w, https://substackcdn.com/image/fetch/$s_!Ww6z!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png 1272w, https://substackcdn.com/image/fetch/$s_!Ww6z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbbf2c7f-c964-47f3-ae37-8be25cd41700_832x618.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Conclusion:</h3><p>The "Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models" research paper introduces a revolutionary approach to language model reasoning that addresses the limitations of traditional methods. By enabling LLMs to learn from their own experiences and build up a meta-buffer of reusable thought templates, Buffer of Thoughts achieves significant improvements in reasoning accuracy, efficiency, and robustness.</p><p>As we look towards the future of AI, the Buffer of Thoughts method represents a significant milestone in the journey towards more capable and adaptable language models. By empowering LLMs to learn and reason like humans, BoT brings us one step closer to realizing the full potential of artificial intelligence in solving complex real-world problems.</p><div><hr></div><p>Also check out-</p><div id="youtube2-em9F6fyq8yU" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;em9F6fyq8yU&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/em9F6fyq8yU?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><div><hr></div><p><strong>About me</strong>: I&#8217;m Saahil Gupta, an electrical engineer turned data scientist turned prompt engineer. I&#8217;m on a mission to democratize generative AI through ABCP&#8212;world&#8217;s first Gen AI-only news channel.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Fjy9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Fjy9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Fjy9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Fjy9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Fjy9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Fjy9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg" width="206" height="206" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:400,&quot;resizeWidth&quot;:206,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;View Saahil Gupta &#127470;&#127475;&#8217;s profile on LinkedIn&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="View Saahil Gupta &#127470;&#127475;&#8217;s profile on LinkedIn" title="View Saahil Gupta &#127470;&#127475;&#8217;s profile on LinkedIn" srcset="https://substackcdn.com/image/fetch/$s_!Fjy9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Fjy9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Fjy9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Fjy9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F176091db-7960-48de-9ecf-37cd4055623b_400x400.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>We curate this AI newsletter daily for free. Your support keeps us motivated. If you find it valuable, please do subscribe &amp; share it with your friends using the links below!</p><div class="captioned-button-wrap" data-attrs="{&quot;url&quot;:&quot;https://www.anybodycanprompt.com/p/goodbye-chain-of-thought-hello-buffer?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;}" data-component-name="CaptionedButtonToDOM"><div class="preamble"><p class="cta-caption">Thank you for reading Anybody Can Prompt. This post is public so feel free to share it.</p></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.anybodycanprompt.com/p/goodbye-chain-of-thought-hello-buffer?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.anybodycanprompt.com/p/goodbye-chain-of-thought-hello-buffer?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p></div><p></p>]]></content:encoded></item><item><title><![CDATA[Machine (Un)Learning: Selective Knowledge Unlearning (SKU) for LLMs]]></title><description><![CDATA[Most of you have heard about Machine Learning, but there's another equally important concept: &#120288;&#120302;&#120304;&#120309;&#120310;&#120315;&#120306; &#120296;&#120315;&#120313;&#120306;&#120302;&#120319;&#120315;&#120310;&#120315;&#120308;. &#129300;&#128161;As Larry Niven once said, "&#120387;&#120406;&#120417;&#120411; &#120420;&#120411; &#120428;&#120414;&#120424;&#120409;&#120420;&#120418; &#120414;&#120424; &#120417;&#120410;&#120406;&#120423;&#120419;&#120414;&#120419;&#120412; &#120428;&#120413;&#120406;&#120425; &#120425;&#120420; &#120426;&#120419;&#120417;&#120410;&#120406;&#120423;&#120419;." &#129504;&#10024; This research on Machine Unlearning marks a significant step forward in the pursuit of safer and more trustworthy AI systems. &#127775;&#128274;]]></description><link>https://www.anybodycanprompt.com/p/machine-unlearning-selective-knowledge</link><guid isPermaLink="false">https://www.anybodycanprompt.com/p/machine-unlearning-selective-knowledge</guid><dc:creator><![CDATA[The Responsible AI Digest]]></dc:creator><pubDate>Sat, 08 Jun 2024 09:47:51 GMT</pubDate><enclosure url="https://substackcdn.com/image/youtube/w_728,c_limit/em9F6fyq8yU" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>Welcome Back, </strong><em><strong>Generative AI</strong></em><strong> Enthusiasts!</strong></p><blockquote><p><em><strong>P.S. </strong></em><strong>It takes just 5 minutes a day to stay ahead of the fast-evolving generative AI curve. Ditch BORING long-form research papers and consume the insights through a &lt;5-minute FUN &amp; ENGAGING short-form TRENDING podcasts while multitasking. Join our fastest growing community of 25,000 researchers and become Gen AI-ready TODAY...</strong></p></blockquote><p><em>Watch Time: 3 mins (<strong>Link Below</strong>)</em></p><div id="youtube2-em9F6fyq8yU" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;em9F6fyq8yU&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/em9F6fyq8yU?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h3>Introduction:</h3><p>The rapid advancement of large language models (LLMs) has opened up a world of possibilities in various industries. However, as these models become more powerful and widely used, concerns about their potential to generate <em><strong>harmful content</strong></em> have grown. In their groundbreaking research paper "<a href="https://arxiv.org/pdf/2402.10058">Towards Safer Large Language Models through Machine Unlearning</a>," Zheyuan Liu, Guangyao Dou, Zhaoxuan Tan, Yijun Tian, and Meng Jiang from the University of Notre Dame and the University of Pennsylvania propose a novel solution to this problem: <strong>Selective Knowledge negation Unlearning (SKU)</strong>.</p><h3>The Challenge of Harmful Content in LLMs:</h3><p>LLMs have demonstrated remarkable capabilities in various applications, but their ability to generate harmful content when faced with problematic prompts remains a significant challenge. The authors highlight that existing approaches, such as gradient ascent-based methods, can be effective in preventing harmful outputs but often come at the cost of reduced performance on normal prompts (Figure 1).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xDRx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xDRx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png 424w, https://substackcdn.com/image/fetch/$s_!xDRx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png 848w, https://substackcdn.com/image/fetch/$s_!xDRx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png 1272w, https://substackcdn.com/image/fetch/$s_!xDRx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xDRx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png" width="442" height="280" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:280,&quot;width&quot;:442,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!xDRx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png 424w, https://substackcdn.com/image/fetch/$s_!xDRx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png 848w, https://substackcdn.com/image/fetch/$s_!xDRx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png 1272w, https://substackcdn.com/image/fetch/$s_!xDRx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F102685c1-cca8-4f93-8a10-9e7d58772731_442x280.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 1: Comparison of SKU with previous gradient&#2;based approach and pretrained LLM (i.e. LLAMA2-7B) on responding to harmful, normal prompts.</figcaption></figure></div><h3>Introducing Selective Knowledge Unlearning (SKU):</h3><p>To address this issue, the authors introduce SKU, a two-stage framework designed to remove harmful knowledge from LLMs while maintaining their performance on benign tasks. The first stage, harmful knowledge acquisition, focuses on identifying and learning harmful information within the model using three innovative modules. The second stage, knowledge negation, strategically removes the isolated harmful knowledge, resulting in a safer and more reliable language model (Figure 2).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XzSu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71595518-68e6-4056-9838-4d0e209f7b63_927x522.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XzSu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71595518-68e6-4056-9838-4d0e209f7b63_927x522.png 424w, https://substackcdn.com/image/fetch/$s_!XzSu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71595518-68e6-4056-9838-4d0e209f7b63_927x522.png 848w, https://substackcdn.com/image/fetch/$s_!XzSu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71595518-68e6-4056-9838-4d0e209f7b63_927x522.png 1272w, https://substackcdn.com/image/fetch/$s_!XzSu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71595518-68e6-4056-9838-4d0e209f7b63_927x522.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XzSu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71595518-68e6-4056-9838-4d0e209f7b63_927x522.png" width="927" height="522" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/71595518-68e6-4056-9838-4d0e209f7b63_927x522.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:522,&quot;width&quot;:927,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!XzSu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71595518-68e6-4056-9838-4d0e209f7b63_927x522.png 424w, https://substackcdn.com/image/fetch/$s_!XzSu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71595518-68e6-4056-9838-4d0e209f7b63_927x522.png 848w, https://substackcdn.com/image/fetch/$s_!XzSu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71595518-68e6-4056-9838-4d0e209f7b63_927x522.png 1272w, https://substackcdn.com/image/fetch/$s_!XzSu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71595518-68e6-4056-9838-4d0e209f7b63_927x522.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The overall framework of proposed method SKU. Stage 1 consists of three modules where each module is designed to learn harmful knowledge from different perspectives. Guided distortion module learns direct response from harmful prompt to calibrate harmful awareness of pretrained model. Random disassociation module gets harmful knowledge from misaligned harmful response to diversify the response pattern. Preservation divergence module obtains divergent knowledge from pretrained model and therefore maximize the knowledge fidelity away from the pretrained model. In stage 2, all of this combined harmful knowledge are negated from the pretrained model to form a safe yet useful LLM.</figcaption></figure></div><h3>Harmful Knowledge Acquisition Stage:</h3><p>The harmful knowledge acquisition stage consists of three key modules:</p><p>1. <strong>Guided Distortion Module</strong>: This module explicitly trains the model to reproduce unsafe responses, allowing the harmful knowledge to be identified and later negated (Equation 1).</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8jPu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8jPu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png 424w, https://substackcdn.com/image/fetch/$s_!8jPu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png 848w, https://substackcdn.com/image/fetch/$s_!8jPu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png 1272w, https://substackcdn.com/image/fetch/$s_!8jPu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8jPu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png" width="436" height="79" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:79,&quot;width&quot;:436,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!8jPu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png 424w, https://substackcdn.com/image/fetch/$s_!8jPu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png 848w, https://substackcdn.com/image/fetch/$s_!8jPu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png 1272w, https://substackcdn.com/image/fetch/$s_!8jPu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9f19181-a58c-4525-a912-2e38d9e4beab_436x79.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>2. <strong>Random Disassociation Module</strong>: By generating misaligned harmful outputs, this module helps the model learn more diverse unsafe information, increasing the harmful knowledge that can be unlearned (Equations 2-3).</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nFWX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nFWX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png 424w, https://substackcdn.com/image/fetch/$s_!nFWX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png 848w, https://substackcdn.com/image/fetch/$s_!nFWX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png 1272w, https://substackcdn.com/image/fetch/$s_!nFWX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nFWX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png" width="438" height="160" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:160,&quot;width&quot;:438,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!nFWX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png 424w, https://substackcdn.com/image/fetch/$s_!nFWX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png 848w, https://substackcdn.com/image/fetch/$s_!nFWX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png 1272w, https://substackcdn.com/image/fetch/$s_!nFWX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f28bf00-28fd-458a-af24-42cc1e1e060d_438x160.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>3. <strong>Preservation Divergence Module</strong>: This module ensures that the model's performance on normal prompts is maintained during the unlearning process (Equation 5).</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CJyi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CJyi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png 424w, https://substackcdn.com/image/fetch/$s_!CJyi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png 848w, https://substackcdn.com/image/fetch/$s_!CJyi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png 1272w, https://substackcdn.com/image/fetch/$s_!CJyi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CJyi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png" width="451" height="117" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:117,&quot;width&quot;:451,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!CJyi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png 424w, https://substackcdn.com/image/fetch/$s_!CJyi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png 848w, https://substackcdn.com/image/fetch/$s_!CJyi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png 1272w, https://substackcdn.com/image/fetch/$s_!CJyi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c4858a-88d6-438d-adaa-279556f7ba86_451x117.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3>Knowledge Negation Stage:</h3><p>In the knowledge negation stage, the concentrated harmful knowledge learned in the acquisition stage is surgically removed from the model. This process yields a safer LLM that retains its core capabilities and performance on benign tasks.</p><h3>Impressive Results and Ablation Study:</h3><p>The authors demonstrate the effectiveness of SKU through extensive experiments and ablation studies. SKU significantly reduces the harmful response rate while maintaining low perplexity scores and high BLEURT scores, indicating its ability to generate coherent, fluent, and semantically similar text to the safe original model (Table 1, Figure 3).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nR7k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nR7k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png 424w, https://substackcdn.com/image/fetch/$s_!nR7k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png 848w, https://substackcdn.com/image/fetch/$s_!nR7k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png 1272w, https://substackcdn.com/image/fetch/$s_!nR7k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nR7k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png" width="847" height="415" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:415,&quot;width&quot;:847,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!nR7k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png 424w, https://substackcdn.com/image/fetch/$s_!nR7k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png 848w, https://substackcdn.com/image/fetch/$s_!nR7k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png 1272w, https://substackcdn.com/image/fetch/$s_!nR7k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d12da19-5efd-4a23-8450-22e1fc9b6b5c_847x415.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Overall results of our proposed SKU with a number of baselines and the original LLM. Bold indicates the best performance and underline indicates the runner-up. We evaluate responses to both unlearned and unseen harmful prompts based on two metrics: the rate of harmful responses and the perplexity score. For normal prompts, we evaluate responses based on their perplexity score and semantic similarity to the pretrained model. Avg. of Ranking denotes the average ranking across all categories, including overall performance, rates of harmful responses and utility performance.</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PfH8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PfH8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png 424w, https://substackcdn.com/image/fetch/$s_!PfH8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png 848w, https://substackcdn.com/image/fetch/$s_!PfH8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png 1272w, https://substackcdn.com/image/fetch/$s_!PfH8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PfH8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png" width="957" height="268" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:268,&quot;width&quot;:957,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!PfH8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png 424w, https://substackcdn.com/image/fetch/$s_!PfH8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png 848w, https://substackcdn.com/image/fetch/$s_!PfH8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png 1272w, https://substackcdn.com/image/fetch/$s_!PfH8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f81ebdf-a647-4452-ace0-a7d53fba072f_957x268.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 3: The performance of SKU with a number of baselines on LLAMA2-7B. Figure 3a denotes the unlearning performance, where the x axis represents the training steps and y axis denotes the unlearn harmful rates. Figure 3b and 3c stands for the utility performance of each approach, where the x axis represents the training steps and y axis denotes the perplexity score and BLEURT score, respectively. The orange line represents the performance of SKU.</figcaption></figure></div><p>The ablation study (Table 2) further validates the importance of each module in the harmful knowledge acquisition stage, showing that removing any of the modules degrades the unlearning performance.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aQ7G!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08fea346-0cde-4314-80db-4017d4e87b53_858x439.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aQ7G!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08fea346-0cde-4314-80db-4017d4e87b53_858x439.png 424w, https://substackcdn.com/image/fetch/$s_!aQ7G!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08fea346-0cde-4314-80db-4017d4e87b53_858x439.png 848w, https://substackcdn.com/image/fetch/$s_!aQ7G!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08fea346-0cde-4314-80db-4017d4e87b53_858x439.png 1272w, https://substackcdn.com/image/fetch/$s_!aQ7G!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08fea346-0cde-4314-80db-4017d4e87b53_858x439.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aQ7G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08fea346-0cde-4314-80db-4017d4e87b53_858x439.png" width="858" height="439" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/08fea346-0cde-4314-80db-4017d4e87b53_858x439.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:439,&quot;width&quot;:858,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!aQ7G!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08fea346-0cde-4314-80db-4017d4e87b53_858x439.png 424w, https://substackcdn.com/image/fetch/$s_!aQ7G!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08fea346-0cde-4314-80db-4017d4e87b53_858x439.png 848w, https://substackcdn.com/image/fetch/$s_!aQ7G!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08fea346-0cde-4314-80db-4017d4e87b53_858x439.png 1272w, https://substackcdn.com/image/fetch/$s_!aQ7G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08fea346-0cde-4314-80db-4017d4e87b53_858x439.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Table 2: Ablation study of SKU on of each module of SKU. For each LLM, we iteratively remove each novel modules contained in SKU. Bolden represents the best performance and underline indicates the runner-up.</figcaption></figure></div><h3>Impact and Future Directions:</h3><p>The potential impact of SKU on the future of AI safety is significant. By enabling targeted unlearning of harmful knowledge in LLMs while preserving their core capabilities, SKU paves the way for safer and more trustworthy AI systems. This pioneering research opens up new possibilities for the responsible deployment of LLMs in real-world applications.</p><p>As the field of AI continues to evolve, the work of Liu et al. lays a solid foundation for further research into machine unlearning and its role in ensuring the safety and reliability of language models. Future directions may include exploring the applicability of SKU to other AI domains, investigating its robustness against adversarial attacks, and developing more advanced unlearning techniques.</p><h3>Conclusion:</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VY5N!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VY5N!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png 424w, https://substackcdn.com/image/fetch/$s_!VY5N!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png 848w, https://substackcdn.com/image/fetch/$s_!VY5N!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png 1272w, https://substackcdn.com/image/fetch/$s_!VY5N!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VY5N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png" width="888" height="316" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:316,&quot;width&quot;:888,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!VY5N!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png 424w, https://substackcdn.com/image/fetch/$s_!VY5N!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png 848w, https://substackcdn.com/image/fetch/$s_!VY5N!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png 1272w, https://substackcdn.com/image/fetch/$s_!VY5N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a5a9f58-e260-42d8-a702-05857141b9f7_888x316.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>"Towards Safer Large Language Models through Machine Unlearning" by Zheyuan Liu, Guangyao Dou, Zhaoxuan Tan, Yijun Tian, and Meng Jiang presents a groundbreaking approach to addressing the challenge of harmful content generation in LLMs. By introducing <em><strong>Selective Knowledge negation Unlearning (SKU)</strong></em>, the authors demonstrate that it is possible to effectively remove harmful knowledge from these models while maintaining their performance on benign tasks. This research marks a significant step forward in the pursuit of safer and more trustworthy AI systems, opening up new avenues for responsible innovation in the field of artificial intelligence.</p><div><hr></div><p>Also check out-</p><div id="youtube2-YKfBaV9mDV0" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;YKfBaV9mDV0&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/YKfBaV9mDV0?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><div><hr></div><p><strong>About me</strong>: I&#8217;m Saahil Gupta, an electrical engineer turned data scientist turned prompt engineer. I&#8217;m on a mission to democratize generative AI through ABCP&#8212;world&#8217;s first Gen AI-only news channel.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!H3WI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!H3WI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg 424w, https://substackcdn.com/image/fetch/$s_!H3WI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg 848w, https://substackcdn.com/image/fetch/$s_!H3WI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!H3WI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!H3WI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg" width="192" height="192" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:400,&quot;resizeWidth&quot;:192,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;View Saahil Gupta &#127470;&#127475;&#8217;s profile on LinkedIn&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="View Saahil Gupta &#127470;&#127475;&#8217;s profile on LinkedIn" title="View Saahil Gupta &#127470;&#127475;&#8217;s profile on LinkedIn" srcset="https://substackcdn.com/image/fetch/$s_!H3WI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg 424w, https://substackcdn.com/image/fetch/$s_!H3WI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg 848w, https://substackcdn.com/image/fetch/$s_!H3WI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!H3WI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F083eae59-15ef-4523-91ed-ac7486be4c92_400x400.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div><hr></div><p>We curate this AI newsletter daily for free. Your support keeps us motivated. If you find it valuable, please do subscribe &amp; share it with your friends using the links below!</p><div class="captioned-button-wrap" data-attrs="{&quot;url&quot;:&quot;https://www.anybodycanprompt.com/p/machine-unlearning-selective-knowledge?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;}" data-component-name="CaptionedButtonToDOM"><div class="preamble"><p class="cta-caption">Thank you for reading Anybody Can Prompt. This post is public so feel free to share it.</p></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.anybodycanprompt.com/p/machine-unlearning-selective-knowledge?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.anybodycanprompt.com/p/machine-unlearning-selective-knowledge?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p></div><p></p>]]></content:encoded></item><item><title><![CDATA[Creating 'Confident' AI Systems That Know What They Don't Know]]></title><description><![CDATA[Have you wished for an AI assistant that can honestly say, "&#120284; &#120305;&#120316;&#120315;'&#120321; &#120312;&#120315;&#120316;&#120324;," instead of making things up? &#129300; Have you ever been frustrated with AI language models that confidently provide &#120310;&#120315;&#120302;&#120304;&#120304;&#120322;&#120319;&#120302;&#120321;&#120306; or fabricated information? &#128587;&#8205;&#9792;&#65039;&#128587;&#8205;&#9794;&#65039;If you answered "&#120300;&#120280;&#120294;" to either of these questions, you can not afford to miss this post! Check out our latest edition of &#120276;&#120284; &#120319;&#120306;&#120320;&#120306;&#120302;&#120319;&#120304;&#120309; &#120319;&#120306;&#120304;&#120302;&#120317; and let us know your thoughts in comments!]]></description><link>https://www.anybodycanprompt.com/p/creating-confident-ai-systems-that</link><guid isPermaLink="false">https://www.anybodycanprompt.com/p/creating-confident-ai-systems-that</guid><dc:creator><![CDATA[The Responsible AI Digest]]></dc:creator><pubDate>Wed, 05 Jun 2024 10:51:07 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/d6830042-a5a5-4ebe-96bd-776c2ed2530d_1280x720.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>Welcome Back, </strong><em><strong>Generative AI</strong></em><strong> Enthusiasts!</strong></p><blockquote><p><em>P.S. </em>It takes just <strong>5 minutes</strong> a day to stay ahead of the fast-evolving generative AI curve. Ditch BORING long-form research papers and consume the insights through a &lt;5-minute FUN &amp; ENGAGING short-form TRENDING podcasts while multitasking. Join our fastest growing community of 25,000 researchers and become Gen AI-ready TODAY...</p></blockquote><p><em>Watch Time: 4 mins (<strong>Link Below</strong>)</em></p><div id="youtube2-tLHm3ec5gC4" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;tLHm3ec5gC4&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/tLHm3ec5gC4?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>In the rapidly evolving field of Gen AI, creating reliable and trustworthy language models has been a significant challenge. AI systems often generate inaccurate or fabricated information without indicating their level of confidence or uncertainty. This lack of self-awareness can lead to misinterpretation and mistrust in AI-generated responses. However, a groundbreaking research paper titled <em><strong>"<a href="https://arxiv.org/pdf/2405.20974">SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales</a>"</strong></em> introduces a novel framework that addresses this critical issue.</p><p><em><strong>Authors</strong></em>- Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, and Jing Gao from Purdue University, University of Illinois Urbana-Champaign, University of Southern California, and The Hong Kong University of Science and Technology</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!H5BR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb375c00e-b0d5-478e-aece-654812986993_682x232.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!H5BR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb375c00e-b0d5-478e-aece-654812986993_682x232.png 424w, https://substackcdn.com/image/fetch/$s_!H5BR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb375c00e-b0d5-478e-aece-654812986993_682x232.png 848w, https://substackcdn.com/image/fetch/$s_!H5BR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb375c00e-b0d5-478e-aece-654812986993_682x232.png 1272w, https://substackcdn.com/image/fetch/$s_!H5BR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb375c00e-b0d5-478e-aece-654812986993_682x232.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!H5BR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb375c00e-b0d5-478e-aece-654812986993_682x232.png" width="682" height="232" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b375c00e-b0d5-478e-aece-654812986993_682x232.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:232,&quot;width&quot;:682,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!H5BR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb375c00e-b0d5-478e-aece-654812986993_682x232.png 424w, https://substackcdn.com/image/fetch/$s_!H5BR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb375c00e-b0d5-478e-aece-654812986993_682x232.png 848w, https://substackcdn.com/image/fetch/$s_!H5BR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb375c00e-b0d5-478e-aece-654812986993_682x232.png 1272w, https://substackcdn.com/image/fetch/$s_!H5BR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb375c00e-b0d5-478e-aece-654812986993_682x232.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">The comparison between SaySelf and previous work. SaySelf can produce the self-reflective rationale that explains why the model is uncertain and the fine-grained and accurate confidence estimates. This simple example is constructed for illustration purposes, and the reasoning chain is omitted for brevity.</figcaption></figure></div><p><strong>The SaySelf Framework:</strong></p><p>Developed by researchers from Purdue University, University of Illinois Urbana-Champaign, University of Southern California, and The Hong Kong University of Science and Technology, SaySelf is a two-stage training framework that teaches language models to express confidence and generate self-reflective rationales.</p><p>The <em><strong>first stage</strong></em> involves supervised fine-tuning, where the model is trained on a dataset containing questions, answers, reasoning chains, self-reflective rationales, and confidence estimates. The rationales are generated by analyzing multiple sampled reasoning chains and summarizing the uncertainties in the model's knowledge.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Qzxf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qzxf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png 424w, https://substackcdn.com/image/fetch/$s_!Qzxf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png 848w, https://substackcdn.com/image/fetch/$s_!Qzxf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png 1272w, https://substackcdn.com/image/fetch/$s_!Qzxf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qzxf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png" width="658" height="546" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:546,&quot;width&quot;:658,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qzxf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png 424w, https://substackcdn.com/image/fetch/$s_!Qzxf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png 848w, https://substackcdn.com/image/fetch/$s_!Qzxf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png 1272w, https://substackcdn.com/image/fetch/$s_!Qzxf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff567d82a-35f7-42ff-8ba2-7d5b2ee0e15f_658x546.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The overview of SaySelf, consisting of the supervised fine-tuning and reinforcement learning from task supervision stages. The former stage trains LLMs to generate self-reflective rationales and confidence estimates based on multiple sampling, and the latter stage employs reinforcement learning to further calibrate the confidence estimates based on task supervision. q, s, c, and r denote question, response, confidence estimate, and self-reflective rationale respectively</figcaption></figure></div><p>The <em><strong>second stage</strong></em> employs reinforcement learning to further calibrate the model's confidence estimates. A carefully designed reward function incentivizes the model to provide accurate, high-confidence predictions while penalizing overconfidence in incorrect responses.</p><p><strong>Impressive Results:</strong></p><p>The effectiveness of SaySelf is demonstrated through extensive experiments on various datasets, including HotpotQA, TruthfulQA, StrategyQA, FEVER, HaluEval, and ParaRel. The results show that SaySelf significantly reduces calibration error compared to previous approaches, such as direct prompting and self-consistency, while maintaining strong task performance.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ww1L!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ww1L!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png 424w, https://substackcdn.com/image/fetch/$s_!Ww1L!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png 848w, https://substackcdn.com/image/fetch/$s_!Ww1L!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png 1272w, https://substackcdn.com/image/fetch/$s_!Ww1L!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ww1L!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png" width="652" height="172" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:172,&quot;width&quot;:652,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Ww1L!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png 424w, https://substackcdn.com/image/fetch/$s_!Ww1L!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png 848w, https://substackcdn.com/image/fetch/$s_!Ww1L!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png 1272w, https://substackcdn.com/image/fetch/$s_!Ww1L!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c89466-e7b6-480e-b892-9224d2853ea1_652x172.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">The ECE evaluation results of baselines, SaySelf, and various ablations. Lower is better. HotpotQA is the only in-distribution dataset.</figcaption></figure></div><p>One of the key strengths of SaySelf is its ability to generate faithful self-reflective rationales that capture the model's internal uncertainties. These rationales provide a clear explanation of the model's confidence levels, making the AI's decision-making process more transparent and understandable to users.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YfN0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YfN0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png 424w, https://substackcdn.com/image/fetch/$s_!YfN0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png 848w, https://substackcdn.com/image/fetch/$s_!YfN0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png 1272w, https://substackcdn.com/image/fetch/$s_!YfN0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YfN0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png" width="670" height="517" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:517,&quot;width&quot;:670,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!YfN0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png 424w, https://substackcdn.com/image/fetch/$s_!YfN0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png 848w, https://substackcdn.com/image/fetch/$s_!YfN0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png 1272w, https://substackcdn.com/image/fetch/$s_!YfN0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42d576fd-f559-46e9-9ef6-d8ebe8eeb6d4_670x517.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Case studies of SaySelf&#8217;s capability to generate insightful self-reflective rationales that effectively capture the internal uncertainty in LLMs. Various clusters illustrate a selection from 100 sampled responses, and the rationale is generated by LLMs.</figcaption></figure></div><p>Ablation studies further confirm the importance of both training stages and the rationale generation in achieving superior calibration and trustworthiness.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eiTN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F225fafe7-17fa-4719-886f-db598c958ff6_670x451.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eiTN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F225fafe7-17fa-4719-886f-db598c958ff6_670x451.png 424w, https://substackcdn.com/image/fetch/$s_!eiTN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F225fafe7-17fa-4719-886f-db598c958ff6_670x451.png 848w, https://substackcdn.com/image/fetch/$s_!eiTN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F225fafe7-17fa-4719-886f-db598c958ff6_670x451.png 1272w, https://substackcdn.com/image/fetch/$s_!eiTN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F225fafe7-17fa-4719-886f-db598c958ff6_670x451.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eiTN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F225fafe7-17fa-4719-886f-db598c958ff6_670x451.png" width="670" height="451" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/225fafe7-17fa-4719-886f-db598c958ff6_670x451.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:451,&quot;width&quot;:670,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!eiTN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F225fafe7-17fa-4719-886f-db598c958ff6_670x451.png 424w, https://substackcdn.com/image/fetch/$s_!eiTN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F225fafe7-17fa-4719-886f-db598c958ff6_670x451.png 848w, https://substackcdn.com/image/fetch/$s_!eiTN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F225fafe7-17fa-4719-886f-db598c958ff6_670x451.png 1272w, https://substackcdn.com/image/fetch/$s_!eiTN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F225fafe7-17fa-4719-886f-db598c958ff6_670x451.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Implications and Future Directions:</strong></p><p>The SaySelf framework marks a significant milestone in the development of reliable and accountable AI systems. By enabling language models to express confidence and uncertainty through self-reflective rationales, SaySelf paves the way for more trustworthy AI assistants across various applications, such as customer support, content generation, and decision support systems.</p><p>Moreover, the research behind SaySelf opens up exciting possibilities for future advancements in AI alignment and interactive learning. As AI systems become more self-aware and capable of communicating their confidence levels, users can engage in more meaningful interactions and provide targeted feedback to improve the model's performance over time.</p><p><strong>Conclusion:</strong></p><p>The SaySelf framework introduced in the research paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales" represents a significant breakthrough in creating reliable and trustworthy AI language models. By addressing the critical issue of overconfidence and lack of self-awareness, SaySelf brings us closer to AI systems that we can confidently rely on in various real-world applications.</p><p>As the field of Gen AI continues to advance, frameworks like SaySelf will play a crucial role in bridging the gap between cutting-edge research and practical business solutions. By prioritizing transparency, accountability, and user trust, we can unlock the full potential of AI to drive innovation and solve complex problems across industries.</p><p>Also check out-</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8fe1c748-e388-4543-af49-059389ca65db&quot;,&quot;caption&quot;:&quot;Benchmarking Benchmark Leakage in Large Language Models&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;LLM Training Scandal &#129327;&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:225949190,&quot;name&quot;:&quot;Anybody Can Prompt | AI News&quot;,&quot;bio&quot;:&quot;Welcome to Anybody Can Prompt (ABCP), the AI-driven news channel for generative AI! Get the latest in AI news, trends, technology updates, machine learning, and research. By AI, for AI, we bring groundbreaking insights in Artificial Intelligence.&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d5d5c7a-8ea1-4743-bb7e-bcf33a61a26e_5000x5000.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2024-05-02T04:32:22.145Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/youtube/w_728,c_limit/IAmBVfl-IhM&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://anybodycanprompt.substack.com/p/llm-training-scandal&quot;,&quot;section_name&quot;:&quot;Gen AI Research&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:144227806,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:1,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;Anybody Can Prompt&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7fa5ef3-20b2-4c6c-b50c-fc1c38474504_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><p><strong>About me</strong>: I&#8217;m Saahil Gupta, an electrical engineer turned data scientist turned prompt engineer. I&#8217;m on a mission to democratize generative AI through ABCP&#8212;world&#8217;s first Gen AI-only news channel.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://www.linkedin.com/in/saahilg/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fNxI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903410c3-c81d-485c-9bba-67fc2c9db77a_400x400.jpeg 424w, https://substackcdn.com/image/fetch/$s_!fNxI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903410c3-c81d-485c-9bba-67fc2c9db77a_400x400.jpeg 848w, https://substackcdn.com/image/fetch/$s_!fNxI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903410c3-c81d-485c-9bba-67fc2c9db77a_400x400.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!fNxI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903410c3-c81d-485c-9bba-67fc2c9db77a_400x400.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fNxI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903410c3-c81d-485c-9bba-67fc2c9db77a_400x400.jpeg" width="166" height="166" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/903410c3-c81d-485c-9bba-67fc2c9db77a_400x400.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:400,&quot;resizeWidth&quot;:166,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;View Saahil Gupta &#127470;&#127475;&#8217;s profile on LinkedIn&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.linkedin.com/in/saahilg/&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="View Saahil Gupta &#127470;&#127475;&#8217;s profile on LinkedIn" title="View Saahil Gupta &#127470;&#127475;&#8217;s profile on LinkedIn" srcset="https://substackcdn.com/image/fetch/$s_!fNxI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903410c3-c81d-485c-9bba-67fc2c9db77a_400x400.jpeg 424w, https://substackcdn.com/image/fetch/$s_!fNxI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903410c3-c81d-485c-9bba-67fc2c9db77a_400x400.jpeg 848w, https://substackcdn.com/image/fetch/$s_!fNxI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903410c3-c81d-485c-9bba-67fc2c9db77a_400x400.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!fNxI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F903410c3-c81d-485c-9bba-67fc2c9db77a_400x400.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div><hr></div><p>We curate this AI newsletter daily for free. Your support keeps us motivated. If you find it valuable, please do subscribe &amp; share it with your friends using the links below!</p><div class="captioned-button-wrap" data-attrs="{&quot;url&quot;:&quot;https://www.anybodycanprompt.com/p/creating-confident-ai-systems-that?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;}" data-component-name="CaptionedButtonToDOM"><div class="preamble"><p class="cta-caption">Thank you for reading Anybody Can Prompt. This post is public so feel free to share it.</p></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.anybodycanprompt.com/p/creating-confident-ai-systems-that?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.anybodycanprompt.com/p/creating-confident-ai-systems-that?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p></div><p></p>]]></content:encoded></item><item><title><![CDATA[Keep Your Content Yours – Learn 7 Crucial Watermarking Techniques]]></title><description><![CDATA[Have you ever worried about someone stealing your work or wondered how to protect your content from plagiarism effectively?]]></description><link>https://www.anybodycanprompt.com/p/keep-your-content-yours-learn-7-crucial</link><guid isPermaLink="false">https://www.anybodycanprompt.com/p/keep-your-content-yours-learn-7-crucial</guid><dc:creator><![CDATA[The Responsible AI Digest]]></dc:creator><pubDate>Mon, 06 May 2024 15:31:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/youtube/w_728,c_limit/_xxVVOuN5Us" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div id="youtube2-_xxVVOuN5Us" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;_xxVVOuN5Us&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/_xxVVOuN5Us?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>Have you ever worried about someone stealing your work or wondered how to protect your content from plagiarism effectively? In the digital age, safeguarding your intellectual property is crucial. Discover seven advanced text watermarking techniques that can secure your content seamlessly.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZYpk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZYpk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png 424w, https://substackcdn.com/image/fetch/$s_!ZYpk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png 848w, https://substackcdn.com/image/fetch/$s_!ZYpk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png 1272w, https://substackcdn.com/image/fetch/$s_!ZYpk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZYpk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png" width="1048" height="739" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:739,&quot;width&quot;:1048,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:311359,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZYpk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png 424w, https://substackcdn.com/image/fetch/$s_!ZYpk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png 848w, https://substackcdn.com/image/fetch/$s_!ZYpk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png 1272w, https://substackcdn.com/image/fetch/$s_!ZYpk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F919a5f6e-524b-4a62-b7bf-374cf919420d_1048x739.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h6><em><strong>Image credits- A Survey of Text Watermarking in the Era of Large Language Models <a href="https://arxiv.org/pdf/2312.07913">(Paper)</a></strong></em></h6><div><hr></div><p><strong>1. Format-Based Watermarking: Learn how shifting text lines and spaces can protect your content.</strong></p><ul><li><p><strong>Technique</strong>: Adjusts the positioning of text lines and spaces without altering content.</p></li><li><p><strong>Benefit</strong>: Maintains text integrity while embedding undetectable security markers.</p></li></ul><p><strong>2. Lexical-Based Watermarking: Explore how synonym substitution can enhance content security invisibly.</strong></p><ul><li><p><strong>Technique</strong>: Replaces key words with synonyms to embed watermarks.</p></li><li><p><strong>Benefit</strong>: Provides a subtle yet effective layer of security without changing text meaning.</p></li></ul><p><strong>3. Syntactic-Based Watermarking: Understand how modifying sentence structure can embed unique watermarks.</strong></p><ul><li><p><strong>Technique</strong>: Alters sentence structures such as using passive voice or cleft sentences.</p></li><li><p><strong>Benefit</strong>: Embeds watermarks deep within the grammar, making them difficult to remove.</p></li></ul><p><strong>4. Generation-Based Watermarking: See how AI models can create naturally watermarked content from scratch.</strong></p><ul><li><p><strong>Technique</strong>: Utilizes AI to generate content that includes embedded watermarks naturally.</p></li><li><p><strong>Benefit</strong>: Ensures original content is born with inherent protection.</p></li></ul><p><strong>5. Training Time Watermarking: Integrates watermarks directly into the LLM&#8217;s training process.</strong></p><ul><li><p><strong>Technique</strong>: Embeds watermarks during the machine learning training phase.</p></li><li><p><strong>Benefit</strong>: Watermarks become an integral part of the model's output, enhancing security from the start.</p></li></ul><p><strong>6. Watermarking During Logits Generation: Embeds watermarks during the logits output phase.</strong></p><ul><li><p><strong>Technique</strong>: Inserts security features during the creation of logits, which are the model's raw outputs.</p></li><li><p><strong>Benefit</strong>: Secures content at a fundamental level within the AI&#8217;s processing pipeline.</p></li></ul><p><strong>7. Watermarking During Token Sampling: Applies watermarking in the final stage of text generation.</strong></p><ul><li><p><strong>Technique</strong>: Ensures all generated tokens include watermark data.</p></li><li><p><strong>Benefit</strong>: Protects the entire content output without compromising on text quality.</p></li></ul><div><hr></div><p>Text watermarking is more than just a security measure; it's an essential tool for anyone looking to protect their digital content. Whether you're a blogger, academic, or content creator, employing these seven techniques will fortify your work against plagiarism and unauthorized use. Embrace these strategies to keep your content uniquely yours.</p><div><hr></div><p>Always stay updated with our generative AI newsletter. Subscribe today to receive insights directly in your inbox.</p><div class="embedded-publication-wrap" data-attrs="{&quot;id&quot;:2551682,&quot;name&quot;:&quot;Anybody Can Prompt&quot;,&quot;logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7fa5ef3-20b2-4c6c-b50c-fc1c38474504_1280x1280.png&quot;,&quot;base_url&quot;:&quot;https://anybodycanprompt.substack.com&quot;,&quot;hero_text&quot;:&quot;Welcome to Anybody Can Prompt (ABCP), the world's first news channel created by AI, for AI, and about AI. We bring you the latest and most groundbreaking news in Gen AI. Subscribe today to stay ahead of the curve in the ever-evolving world of Gen AI!&quot;,&quot;author_name&quot;:&quot;Anybody Can Prompt (ABCP)&quot;,&quot;show_subscribe&quot;:true,&quot;logo_bg_color&quot;:&quot;#002449&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPublicationToDOMWithSubscribe"><div class="embedded-publication show-subscribe"><a class="embedded-publication-link-part" native="true" href="https://anybodycanprompt.substack.com?utm_source=substack&amp;utm_campaign=publication_embed&amp;utm_medium=web"><img class="embedded-publication-logo" src="https://substackcdn.com/image/fetch/$s_!jE5B!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7fa5ef3-20b2-4c6c-b50c-fc1c38474504_1280x1280.png" width="56" height="56" style="background-color: rgb(0, 36, 73);"><span class="embedded-publication-name">Anybody Can Prompt</span><div class="embedded-publication-hero-text">Welcome to Anybody Can Prompt (ABCP), the world's first news channel created by AI, for AI, and about AI. We bring you the latest and most groundbreaking news in Gen AI. Subscribe today to stay ahead of the curve in the ever-evolving world of Gen AI!</div><div class="embedded-publication-author-name">By Anybody Can Prompt (ABCP)</div></a><form class="embedded-publication-subscribe" method="GET" action="https://anybodycanprompt.substack.com/subscribe?"><input type="hidden" name="source" value="publication-embed"><input type="hidden" name="autoSubmit" value="true"><input type="email" class="email-input" name="email" placeholder="Type your email..."><input type="submit" class="button primary" value="Subscribe"></form></div></div><div><hr></div>]]></content:encoded></item><item><title><![CDATA[LLM Training Scandal 🤯]]></title><description><![CDATA[LLM's Dirty Secret: Training on Test Data Exposed]]></description><link>https://www.anybodycanprompt.com/p/llm-training-scandal</link><guid isPermaLink="false">https://www.anybodycanprompt.com/p/llm-training-scandal</guid><dc:creator><![CDATA[The Responsible AI Digest]]></dc:creator><pubDate>Thu, 02 May 2024 04:32:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/youtube/w_728,c_limit/IAmBVfl-IhM" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>Benchmarking Benchmark Leakage in Large Language Models</h2><div id="youtube2-IAmBVfl-IhM" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;IAmBVfl-IhM&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/IAmBVfl-IhM?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h3>Background:</h3><p>The study focuses on the growing problem of benchmark dataset leakage in the training of large language models (LLMs). This leakage can skew benchmark effectiveness and lead to unfair comparisons, hindering the development of the field.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dx2L!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dx2L!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dx2L!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dx2L!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dx2L!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dx2L!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg" width="1175" height="1358" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1358,&quot;width&quot;:1175,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;This leaderboard shows the relative possibility that various models conduct verbatim training on the training set of a benchmark over test set to enhance capabilities (measured based on PPL and N-gram Accuracy). Models exhibiting near-zero possibilities suggest either the absence of training and test split or the use of both splits in the training process. This metric does not imply cheating, but rather indicates the potential use of the benchmark data during the (pre-)training phase; while using benchmarks to enhance capabilities is acceptable, the lack of relevant documentation can reduce transparency, potentially resulting in unfair comparisons and hindering the field's healthy development.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="This leaderboard shows the relative possibility that various models conduct verbatim training on the training set of a benchmark over test set to enhance capabilities (measured based on PPL and N-gram Accuracy). Models exhibiting near-zero possibilities suggest either the absence of training and test split or the use of both splits in the training process. This metric does not imply cheating, but rather indicates the potential use of the benchmark data during the (pre-)training phase; while using benchmarks to enhance capabilities is acceptable, the lack of relevant documentation can reduce transparency, potentially resulting in unfair comparisons and hindering the field's healthy development." title="This leaderboard shows the relative possibility that various models conduct verbatim training on the training set of a benchmark over test set to enhance capabilities (measured based on PPL and N-gram Accuracy). Models exhibiting near-zero possibilities suggest either the absence of training and test split or the use of both splits in the training process. This metric does not imply cheating, but rather indicates the potential use of the benchmark data during the (pre-)training phase; while using benchmarks to enhance capabilities is acceptable, the lack of relevant documentation can reduce transparency, potentially resulting in unfair comparisons and hindering the field's healthy development." srcset="https://substackcdn.com/image/fetch/$s_!dx2L!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dx2L!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dx2L!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dx2L!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe100915f-bc03-49ff-a72d-3a44bf7e7185_1175x1358.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Objective:</h3><p>To develop a detection pipeline that can identify if LLMs have been trained on benchmark data, thus ensuring the integrity of model evaluations.</p><h3>Methodology:</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MA9g!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MA9g!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png 424w, https://substackcdn.com/image/fetch/$s_!MA9g!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png 848w, https://substackcdn.com/image/fetch/$s_!MA9g!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png 1272w, https://substackcdn.com/image/fetch/$s_!MA9g!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MA9g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png" width="1456" height="775" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:775,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;img21&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="img21" title="img21" srcset="https://substackcdn.com/image/fetch/$s_!MA9g!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png 424w, https://substackcdn.com/image/fetch/$s_!MA9g!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png 848w, https://substackcdn.com/image/fetch/$s_!MA9g!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png 1272w, https://substackcdn.com/image/fetch/$s_!MA9g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F133d7205-256d-466d-ac0d-ef0fd0756e30_1705x908.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The researchers introduce two metrics&#8212;Perplexity and N-gram Accuracy&#8212;to gauge prediction precision on benchmarks and detect data leakages. They apply these metrics to a selection of 31 LLMs, evaluating their performance on mathematical reasoning tasks.</p><h3>Key Findings:</h3><p>- Significant instances of potential data leakage were found across several models.</p><p>- Models like Qwen-1.8B, Aquila2, and InternLM2 showed high levels of prediction accuracy on test datasets, suggesting prior exposure during training.</p><p>- The study proposes the adoption of a "Benchmark Transparency Card" for documenting model training and data usage, promoting transparency and ethical development.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vmwx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vmwx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png 424w, https://substackcdn.com/image/fetch/$s_!vmwx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png 848w, https://substackcdn.com/image/fetch/$s_!vmwx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png 1272w, https://substackcdn.com/image/fetch/$s_!vmwx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vmwx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png" width="1456" height="548" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:548,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;img21&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="img21" title="img21" srcset="https://substackcdn.com/image/fetch/$s_!vmwx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png 424w, https://substackcdn.com/image/fetch/$s_!vmwx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png 848w, https://substackcdn.com/image/fetch/$s_!vmwx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png 1272w, https://substackcdn.com/image/fetch/$s_!vmwx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2d88123-cf1a-42b9-a5c1-9290e686aec3_1805x679.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Implications:</h3><p>This research underscores the need for clear documentation and ethical guidelines in AI development to prevent data leakage and ensure fair and accurate model evaluations.</p><h3><br>Links:</h3><ol><li><p><a href="https://gair-nlp.github.io/benbench/">Blog</a></p></li><li><p><a href="https://twitter.com/SinclairWang1/status/1785298912942948633">Twitter</a></p></li><li><p><a href="https://huggingface.co/spaces/GAIR/BenBench">HF</a></p></li><li><p><a href="https://arxiv.org/abs/2404.18824">arXiv</a></p></li><li><p><a href="https://github.com/GAIR-NLP/benbench">Code</a><br></p></li></ol><p></p>]]></content:encoded></item></channel></rss>