{"id":75924,"date":"2023-08-08T16:47:24","date_gmt":"2023-08-08T11:17:24","guid":{"rendered":"https:\/\/www.techzimo.com\/?p=75924"},"modified":"2023-08-08T16:47:24","modified_gmt":"2023-08-08T11:17:24","slug":"why-is-chatgpt-becoming-a-fool-in-fundamental-math","status":"publish","type":"post","link":"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/","title":{"rendered":"Why is ChatGPT becoming a fool in fundamental math?"},"content":{"rendered":"<p><span style=\"font-weight: 400;\"><strong>ChatGPT becoming a fool:<\/strong> With that in mind, since becoming widely available to the general public last year, artificial-intelligence chatbots have astonished those who experimented with them, starting a worldwide development race, and even His influence on writers and actors contributed to the strike in Hollywood.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">AI devices have also raised fears that they will continually improve and endanger humanity. OpenAI&#8217;s ChatGPT appeared to the general public in November, sparking the recent frenzy, followed in March by ChatGPT-4, which aimed to be more effective than its predecessor.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">But new studies launched this week reveal a fundamental enterprise of rising artificial intelligence: ChatGPT has become worse at performing positive elementary math(ChatGPT becoming a fool) tasks.<\/span><\/p>\n<h3><b>Check what Stanford professor James Xu said<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">Researchers at Stanford University and the University of California, Berkeley, said the fallout is an example of a phenomenon regarded by <a href=\"https:\/\/www.techzimo.com\/chat-gpt-login\/\">AI<\/a> developers as float, where efforts to improve one part of a fairly complex AI fashion end up making other parts of the fashion worse. Are.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Converting it into one pathway can make it worse in different directions, said Stanford professor James Xu, affiliated with the school&#8217;s AI Lab and one of the authors of the brand-new research. That always makes it very hard to extemporize.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">On the surface, ChatGPT might sound remarkable &#8211; fun, well-versed in any subject matter, and impeccably grammatical. Some people have given ChatGPT standardized testing that it has been successful. However, at different times the chatbot will flub even simple maths.<\/span><\/p>\n<h3><b>ChatGPT: Model 3.5, available to everyone online<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">Computer Technician Ph.D. The goal of the team of researchers including Lingjiao Chen. The Stanford scholar, along with Xue and Berkeley&#8217;s Mattei Zaharia, aimed to systematically and repeatedly observe how the models juggle multiple responsibilities over time.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">So far, they&#8217;ve tested two versions of ChatGPT: Model 3.5, available to everyone online, and Model 4.0, available through a premium subscription.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The results are not entirely promising. They gave the chatbot a basic project: find out if the selected range is a higher range. This is the type of math(ChatGPT becoming a fool) problem that is complex for humans but simple for computer systems.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Is 17,077 on top? Is 17,947 prime? You can&#8217;t work it out unless you&#8217;re an expert, but it&#8217;s easy for computer systems to estimate. A PC can only strain the trouble &#8211; try dividing by two, 3, 5, etc., and see if anything works.<\/span><\/p>\n<h3><b>The top rate GPT-4 correctly identified<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">For overall exposure to music, the researchers assigned ChatGPT 1,000 unique numbers. In March, the top rate GPT-4 correctly identified whether or not 84% of the numbers were high. (Admittedly, pretty average performance for the computer.) By June its success rate had dropped to 51%.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Of the eight unique duties, GPT-4 did worse on six of them. The GPT-3.5 improved on six parameters but remained worse than its best sibling in most responsibilities.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Many people who played with the mod were taken aback at first, but over the years they&#8217;ve started to notice an increasing number of wrong answers or chatbots refusing to answer.<\/span><\/p>\n<h3><b>Chatbots are empirically more harmful at positive tasks<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">The Stanford-Berkeley team&#8217;s study shows empirically that this simply isn&#8217;t a true effect. Chatbots are empirically more harmful at positive tasks, including figuring out math queries(ChatGPT becoming a fool), responding to medical queries, and developing code.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Responding to questions about the new studies, <a href=\"https:\/\/www.techzimo.com\/openai-rolling-out-custom-designed-instruction-for-chatgpt\/\">OpenAI<\/a> said in a written announcement: While we launch new version models, our top priority is to make more advanced models smarter across the board. We are working hard to make sure that new versions improve a wide variety of functions. That said, our evaluation method is not the best, and we are constantly improving it.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The neat thing is that chatbots haven&#8217;t been universally bad. It has gone even further in some ways. In some tests, GPT-3.5, although less true normal, has progressed while GPT-4 has worsened.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The phenomenon of unexpected changes in flow makes sense to researchers who study systems and investigate AI, Xue said. We presumed it would seem right here, however, we were extremely surprised by how quickly the glide was happening.<\/span><\/p>\n<h3><b>Model-4 chatbot will answer 98% of queries<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">The Stanford-Berkeley investigators didn&#8217;t only ask <a href=\"https:\/\/en.wikipedia.org\/wiki\/ChatGPT\">ChatGPT<\/a> calculation queries. They also requested opinion questions to see if the chatbot could respond from a database of about 1,500 questions.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The Model-4 chatbot will answer 98% of queries in March. With Jun&#8217;s help, it gave solutions to only 23%, regularly avoided with extremely brief responses &#8211; declaring that the question became subjective and had no value as an AI.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">This demonstrates something of what is happening with AI structures. Since the release of chatbots, a type of cottage enterprise devoted to so-called trigger engineering has emerged.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Time and again people experimenting with particular activities are trying to make the most of the fashion by finding a good way to ask questions to get the desired results. Although sometimes they are trying to trick the bots into saying something offensive or derogatory. (One famous and extremely effective technique involves tricking the AI into setting up an immodest dialogue with Niccol\u00f2 Machiavelli.)<\/span><\/p>\n<h3><b>AI models were much better at complex reasoning tasks<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">Some of these strategies, or paths, are downright benign. Last year, Google research scientists Jason Wei and Danny Zhou published a paper showing that artificial intelligence models were much better at complex reasoning tasks when asked to tackle the problem one step at a time. In March this approach, called chain-of-thought prompting, turned out to be working well. But by June, Chingari had become much less effective.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Could the degradation of the ability to solve math(ChatGPT becoming a fool) problems be an unintended consequence of trying to prevent humans from giving outrageous feedback by tricking AI? Should this be an attempt to crack down on quick engineering and unintentionally screw up the hint that drives math performance? Should this be the result of trying to make AI less functional? The models are so complex that even the teams developing them will definitely not understand them.<\/span><\/p>\n<h3><b>Xue said his goal is not to skip a generation<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">Xue said his goal is not to skip a generation. Alternatively, it is a far cry to reveal AI more closely. The Stanford and Berkeley teams will continue to systematically test AI models\u2014ChatGPT and others\u2014for multiple inquiries to empirically test their performance over the course of years.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">We are used to considering knowledge as gaining knowledge of a problem and then building on it. As a side effect of its extreme complexity, AI can&#8217;t work that way. Rather it is a leap forward, a step ahead, and a wonderful take on a surprising way. Over the years, AI will likely continue to advance, although it&#8217;s miles away from a straight line.<\/span><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>ChatGPT becoming a fool: With that in mind, since becoming widely available to the general public last year, artificial-intelligence chatbots have astonished those who experimented with them, starting a worldwide development race, and even His influence on writers and actors contributed to the strike in Hollywood. AI devices have also raised fears that they will [&hellip;]<\/p>\n","protected":false},"author":6,"featured_media":74644,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Why is ChatGPT becoming a fool in fundamental math? - Tech Zimo<\/title>\n<meta name=\"description\" content=\"ChatGPT becoming a fool: With that in mind, since becoming widely available to the general public last yeartop rate GPT-4 correctly identified\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Why is ChatGPT becoming a fool in fundamental math? - Tech Zimo\" \/>\n<meta property=\"og:description\" content=\"ChatGPT becoming a fool: With that in mind, since becoming widely available to the general public last yeartop rate GPT-4 correctly identified\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/\" \/>\n<meta property=\"og:site_name\" content=\"Tech Zimo\" \/>\n<meta property=\"article:author\" content=\"https:\/\/www.facebook.com\/VJAY.DEHRAJ\/\" \/>\n<meta property=\"article:published_time\" content=\"2023-08-08T11:17:24+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.techzimo.com\/wp-content\/uploads\/2023\/03\/ChatGPT.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"987\" \/>\n\t<meta property=\"og:image:height\" content=\"640\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Vijay Dehraj\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Vijay Dehraj\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/\",\"url\":\"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/\",\"name\":\"Why is ChatGPT becoming a fool in fundamental math? - Tech Zimo\",\"isPartOf\":{\"@id\":\"https:\/\/www.techzimo.com\/#website\"},\"datePublished\":\"2023-08-08T11:17:24+00:00\",\"dateModified\":\"2023-08-08T11:17:24+00:00\",\"author\":{\"@id\":\"https:\/\/www.techzimo.com\/#\/schema\/person\/39466ebec1bcc645db73e1a7aeeddcac\"},\"description\":\"ChatGPT becoming a fool: With that in mind, since becoming widely available to the general public last yeartop rate GPT-4 correctly identified\",\"breadcrumb\":{\"@id\":\"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.techzimo.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Why is ChatGPT becoming a fool in fundamental math?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.techzimo.com\/#website\",\"url\":\"https:\/\/www.techzimo.com\/\",\"name\":\"TechZimo\",\"description\":\"Latest Tech News &amp; Updates\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.techzimo.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.techzimo.com\/#\/schema\/person\/39466ebec1bcc645db73e1a7aeeddcac\",\"name\":\"Vijay Dehraj\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.techzimo.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/aca54e2c1aa640b716af24fb9a3661f2?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/aca54e2c1aa640b716af24fb9a3661f2?s=96&d=mm&r=g\",\"caption\":\"Vijay Dehraj\"},\"sameAs\":[\"https:\/\/www.facebook.com\/VJAY.DEHRAJ\/\",\"https:\/\/instagram.com\/vijay.dehraj\"],\"url\":\"https:\/\/www.techzimo.com\/author\/vijay\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Why is ChatGPT becoming a fool in fundamental math? - Tech Zimo","description":"ChatGPT becoming a fool: With that in mind, since becoming widely available to the general public last yeartop rate GPT-4 correctly identified","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/","og_locale":"en_US","og_type":"article","og_title":"Why is ChatGPT becoming a fool in fundamental math? - Tech Zimo","og_description":"ChatGPT becoming a fool: With that in mind, since becoming widely available to the general public last yeartop rate GPT-4 correctly identified","og_url":"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/","og_site_name":"Tech Zimo","article_author":"https:\/\/www.facebook.com\/VJAY.DEHRAJ\/","article_published_time":"2023-08-08T11:17:24+00:00","og_image":[{"width":987,"height":640,"url":"https:\/\/www.techzimo.com\/wp-content\/uploads\/2023\/03\/ChatGPT.jpg","type":"image\/jpeg"}],"author":"Vijay Dehraj","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Vijay Dehraj","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/","url":"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/","name":"Why is ChatGPT becoming a fool in fundamental math? - Tech Zimo","isPartOf":{"@id":"https:\/\/www.techzimo.com\/#website"},"datePublished":"2023-08-08T11:17:24+00:00","dateModified":"2023-08-08T11:17:24+00:00","author":{"@id":"https:\/\/www.techzimo.com\/#\/schema\/person\/39466ebec1bcc645db73e1a7aeeddcac"},"description":"ChatGPT becoming a fool: With that in mind, since becoming widely available to the general public last yeartop rate GPT-4 correctly identified","breadcrumb":{"@id":"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.techzimo.com\/why-is-chatgpt-becoming-a-fool-in-fundamental-math\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.techzimo.com\/"},{"@type":"ListItem","position":2,"name":"Why is ChatGPT becoming a fool in fundamental math?"}]},{"@type":"WebSite","@id":"https:\/\/www.techzimo.com\/#website","url":"https:\/\/www.techzimo.com\/","name":"TechZimo","description":"Latest Tech News &amp; Updates","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.techzimo.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.techzimo.com\/#\/schema\/person\/39466ebec1bcc645db73e1a7aeeddcac","name":"Vijay Dehraj","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.techzimo.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/aca54e2c1aa640b716af24fb9a3661f2?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/aca54e2c1aa640b716af24fb9a3661f2?s=96&d=mm&r=g","caption":"Vijay Dehraj"},"sameAs":["https:\/\/www.facebook.com\/VJAY.DEHRAJ\/","https:\/\/instagram.com\/vijay.dehraj"],"url":"https:\/\/www.techzimo.com\/author\/vijay\/"}]}},"_links":{"self":[{"href":"https:\/\/www.techzimo.com\/wp-json\/wp\/v2\/posts\/75924"}],"collection":[{"href":"https:\/\/www.techzimo.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.techzimo.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.techzimo.com\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/www.techzimo.com\/wp-json\/wp\/v2\/comments?post=75924"}],"version-history":[{"count":0,"href":"https:\/\/www.techzimo.com\/wp-json\/wp\/v2\/posts\/75924\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.techzimo.com\/wp-json\/wp\/v2\/media\/74644"}],"wp:attachment":[{"href":"https:\/\/www.techzimo.com\/wp-json\/wp\/v2\/media?parent=75924"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.techzimo.com\/wp-json\/wp\/v2\/categories?post=75924"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.techzimo.com\/wp-json\/wp\/v2\/tags?post=75924"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}