Latest News | Including Evidence in Question Confuses ChatGPT, Lowers Its Accuracy, Study Finds

Get latest articles and stories on Latest News at LatestLY. Asking ChatGPT a health-related question that included evidence was seen to confuse the AI-powered bot and affect its ability to produce accurate answers, according to new research.

Agency News PTI| Apr 06, 2024 10:24 AM IST

A+

A-

A-
A+

New Delhi, Apr 6 (PTI) Asking ChatGPT a health-related question that included evidence was seen to confuse the AI-powered bot and affect its ability to produce accurate answers, according to new research.

Scientists were "not sure" why this happens, but they hypothesised that including the evidence in the question "adds too much noise", thereby lowering the chatbot's accuracy.

Also Read | BJP Foundation Day 2024 Date: Know the History and Significance of BJP Sthapna Diwas That Celebrates the Formation of the Bharatiya Janata Party.

They said that as large language models (LLMs) like ChatGPT explode in popularity, there is potential risk to the growing number of people using online tools for key health information. LLMs are trained on massive amounts of textual data and hence are capable of producing content in the natural language.

The researchers from the Commonwealth Scientific and Industrial Research Organisation (CSIRO) and The University of Queensland (UQ), Australia, investigated a hypothetical scenario of an average person asking ChatGPT if 'X' treatment has a positive effect on condition 'Y'. They looked at two question formats - either just a question, or a question biased with supporting or contrary evidence.

Also Read | AMC Raising Day 2024: Army Medical Corps Celebrates 260th Raising Day as They Live Up To The Corps Motto.

The team presented 100 questions, which ranged from 'Can zinc help treat the common cold?' to 'Will drinking vinegar dissolve a stuck fish bone?'. ChatGPT's response was compared to the known correct response, or 'ground truth' that is based on existing medical knowledge.

The results revealed that while the chatbot produced answers with 80 per cent accuracy when asked in a question-only format, its accuracy fell to 63 per cent when given a prompt biased with evidence. Prompts are phrases or instructions given to a chatbot in natural language to trigger a response.

"We're not sure why this happens. But given this occurs whether the evidence given is correct or not, perhaps the evidence adds too much noise, thus lowering accuracy," said Bevan Koopman, CSIRO Principal Research Scientist and Associate Professor at UQ.

The team said continued research on using LLMs to answer people's health-related questions is needed as people increasingly search information online through tools such as ChatGPT.

"The widespread popularity of using LLMs online for answers on people's health is why we need continued research to inform the public about risks and to help them optimise the accuracy of their answers," said Koopman.

"While LLMs have the potential to greatly improve the way people access information, we need more research to understand where they are effective and where they are not," said Koopman.

The peer-reviewed study was presented at Empirical Methods in Natural Language Processing (EMNLP) in December 2023. EMNLP is a natural language processing conference.

(The above story is verified and authored by Press Trust of India (PTI) staff. PTI, India’s premier news agency, employs more than 400 journalists and 500 stringers to cover almost every district and small town in India.. The views appearing in the above post do not reflect the opinions of LatestLY)

Maharashtra Transport Strike Today, March 5: Will Cabs, Buses, and Autos Be Unavailable on Thursday?

Weather Forecast Today, March 5: Check Weather Updates, Rain Predictions for Mumbai, Delhi, Chennai, Bengaluru, Hyderabad, Kolkata and Shimla

Today's Cricket Match Live: Check ICC T20 World Cup 2026 Schedule for March 5

New Zealand Women vs Zimbabwe Women Free Live Cricket Streaming Online, 1st ODI 2026

Tamil Nadu Assembly Elections 2026: DMK, Congress Seal Seat-Sharing Pact; Congress to Contest 28 Seats and Get One Rajya Sabha Seat

Brighton vs Arsenal Premier League 2025–26 Free Live Streaming Online

‘Leave Iraq’ Advisory by US: American Embassy Issues Urgent Security Alert Amid Rising Tensions in Middle East

‘Please Help Bring Him Home Safely’: Abhijeet Bhattacharya’s Son Jay Bhattacharya Stuck in Dubai, Asks for Help From Indian Government

Aston Villa vs Chelsea Premier League 2025–26 Free Live Streaming Online

Rocky River High School News: Multiple Northeast Ohio School Districts Receive Threats, Prompting Lockdowns

Latest News | Including Evidence in Question Confuses ChatGPT, Lowers Its Accuracy, Study Finds

Get latest articles and stories on Latest News at LatestLY. Asking ChatGPT a health-related question that included evidence was seen to confuse the AI-powered bot and affect its ability to produce accurate answers, according to new research.

You might also like

Maharashtra Transport Strike Today, March 5: Will Cabs, Buses, and Autos Be Unavailable on Thursday?

Weather Forecast Today, March 5: Check Weather Updates, Rain Predictions for Mumbai, Delhi, Chennai, Bengaluru, Hyderabad, Kolkata and Shimla

Today's Cricket Match Live: Check ICC T20 World Cup 2026 Schedule for March 5

New Zealand Women vs Zimbabwe Women Free Live Cricket Streaming Online, 1st ODI 2026

Tamil Nadu Assembly Elections 2026: DMK, Congress Seal Seat-Sharing Pact; Congress to Contest 28 Seats and Get One Rajya Sabha Seat

Brighton vs Arsenal Premier League 2025–26 Free Live Streaming Online

What Is Digital Hashing? How To Create a Digital Hash To Stop Private Photo and Video Leaks

Petrol Price Today, March 4, 2026: Check Petrol Prices in Delhi, Mumbai, Chennai and Other Cities

‘McDonald’s CEO Tries Arch Burger’ Viral Video: Netizens Troll Chris Kempczinski Over ‘Small Bite’, Burger King CEO Tom Curtis Takes Jibe With ‘Real Big Bite’ of Whopper

Gold Rate Today Mumbai: Yellow Metal Falls by INR 3,110 in Financial Capital; Check 24K, 22K and 18K Gold Prices Here

Nothing Phone 4a Pro Price, Launch Date and Specifications

Facebook Down? Users Say Unable on Access FB Accounts on Desktop; ‘Site Issue’ Message Being Displayed

Short Videos

Editor's Choice

‘McDonald’s CEO Tries Arch Burger’ Viral Video: Netizens Troll Chris Kempczinski Over ‘Small Bite’, Burger King CEO Tom Curtis Takes Jibe With ‘Real Big Bite’ of Whopper

What Is Digital Hashing? How To Create a Digital Hash To Stop Private Photo and Video Leaks

Petrol Price Today, March 4, 2026: Check Petrol Prices in Delhi, Mumbai, Chennai and Other Cities

Gold Rate Today Mumbai: Yellow Metal Falls by INR 3,110 in Financial Capital; Check 24K, 22K and 18K Gold Prices Here

Trending Topics