{"id":7810,"date":"2024-09-19T11:37:58","date_gmt":"2024-09-19T11:37:58","guid":{"rendered":"https:\/\/digitaltradecenter.com\/index.php\/2024\/09\/19\/public-asked-to-help-create-humanitys-last-exam-to-spot-when-ai-achieves-peak-intelligence\/"},"modified":"2024-09-19T11:37:58","modified_gmt":"2024-09-19T11:37:58","slug":"public-asked-to-help-create-humanitys-last-exam-to-spot-when-ai-achieves-peak-intelligence","status":"publish","type":"post","link":"https:\/\/digitaltradecenter.com\/index.php\/2024\/09\/19\/public-asked-to-help-create-humanitys-last-exam-to-spot-when-ai-achieves-peak-intelligence\/","title":{"rendered":"Public asked to help create \u2018humanity\u2019s last exam\u2019 to spot when AI achieves peak intelligence"},"content":{"rendered":"<p>Scientists are creating &#8220;humanity&#8217;s last exam&#8221; to test AI and see when it has reached expert-level intelligence.<\/p>\n<p>People are being asked to submit their questions and create &#8220;the world&#8217;s most difficult <strong>artificial intelligence<\/strong> test&#8221; by the Center for AI Safety (CAIS) and Scale AI.<\/p>\n<div class=\"sdc-site-outbrain sdc-site-outbrain--AR_6\" aria-hidden=\"true\" data-component-name=\"sdc-site-outbrain\" data-target=\"\" data-widget-mapping=\"\" data-installation-keys=\"\">    <\/div>\n<p>&#8220;Existing tests now have become too easy and we can no longer track AI developments well, or how far they are from becoming expert-level,&#8221; said the quiz creators in a statement about the test.<\/p>\n<p>A few years ago, AI was giving almost random answers to questions on exams &#8211; that&#8217;s no longer the case.<\/p>\n<p>Last week, <strong>OpenAI&#8217;s<\/strong> newest model, known as OpenAI o1, &#8220;destroyed the most popular reasoning benchmarks&#8221;, according to Dan Hendrycks, executive director of CAIS.<\/p>\n<div class=\"ad ad--teads\">        <\/div>\n<p>However, AI still isn&#8217;t able to answer difficult research questions and other intellectual questions.<\/p>\n<p>It also appears to score poorly on tests involving planning and visual pattern-recognition puzzles, according to Stanford University&#8217;s AI Index Report from April.<\/p>\n<p>Consequently, &#8220;humanity&#8217;s last exam&#8221; will require abstract reasoning to test how clever AI really is.<\/p>\n<p>The submissions shouldn&#8217;t be any ordinary quiz questions.<\/p>\n<p>&#8220;We found questions written by undergraduates tend to be too easy for the models,&#8221; the creators of the quiz said.<\/p>\n<p>Instead, they recommend that question writers have five or more years of experience in a technical industry job like SpaceX, or are a PhD student or above.<\/p>\n<p>The submissions should be difficult for non-experts to answer and &#8220;not easily answerable via a quick online search&#8221;, and trick questions should be avoided.<\/p>\n<p>&#8220;As a rule of thumb, if a randomly selected undergraduate can understand what is being asked, it is likely too easy for the frontier LLMs of today and tomorrow,&#8221; said the quiz creators.<\/p>\n<p>People who submit successful questions will be invited as co-authors on the paper and have a chance to win money from a $500,000 (\u00a3378,400) prize pool, with the writers of the best questions earning $5,000 (\u00a33,780) each.<\/p>\n<p>Questions should be submitted by 1 November.<\/p>\n<\/p>\n<div>This post appeared first on sky.com<\/div>\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Scientists are creating &#8220;humanity&#8217;s last exam&#8221; to test AI and see when it has reached expert-level intelligence. People are being asked to submit their questions and create &#8220;the world&#8217;s most difficult artificial intelligence test&#8221; by the Center for AI Safety (CAIS) and Scale AI. &#8220;Existing tests now have become too easy and we can no <\/p>\n","protected":false},"author":1,"featured_media":7811,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[24],"tags":[],"class_list":{"0":"post-7810","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-science"},"_links":{"self":[{"href":"https:\/\/digitaltradecenter.com\/index.php\/wp-json\/wp\/v2\/posts\/7810","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/digitaltradecenter.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/digitaltradecenter.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/digitaltradecenter.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/digitaltradecenter.com\/index.php\/wp-json\/wp\/v2\/comments?post=7810"}],"version-history":[{"count":0,"href":"https:\/\/digitaltradecenter.com\/index.php\/wp-json\/wp\/v2\/posts\/7810\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/digitaltradecenter.com\/index.php\/wp-json\/wp\/v2\/media\/7811"}],"wp:attachment":[{"href":"https:\/\/digitaltradecenter.com\/index.php\/wp-json\/wp\/v2\/media?parent=7810"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/digitaltradecenter.com\/index.php\/wp-json\/wp\/v2\/categories?post=7810"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/digitaltradecenter.com\/index.php\/wp-json\/wp\/v2\/tags?post=7810"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}