Your privacy, your choice

We use essential cookies to make sure the site can function. We also use optional cookies for advertising, personalisation of content, usage analysis, and social media.

By accepting optional cookies, you consent to the processing of your personal data - including transfers to third parties. Some third parties are outside of the European Economic Area, with varying standards of data protection.

See our privacy policy for more information on the use of your personal data.

for further information and to change your choices.

Skip to main content

Table 5 Significance testing of different models on simple questions

From: Effectiveness of various general large language models in clinical consensus and case analysis in dental implantology: a comparative study

Group comparison

p-value

ChatGPT-4 vs. Qwen 2.0 72B

0.035

ChatGPT-4 vs. Claude 3 Opus

0.752

ChatGPT-4 vs. Gemini Pro 1.5(0801)

0.316

Qwen 2.0 72B vs. Claude 3 Opus

0.074

Qwen 2.0 72B vs. Gemini Pro 1.5(0801)

0.002

Claude 3 Opus vs. Gemini Pro 1.5(0801)

0.187