Register Now
Evaluating LLM Accuracy at Answering Quantitative Occupational Hygiene Questions
Anna Lee
Poster Presenter
University of Minnesota
Minneapolis, MN
Tue, 6/2: 10:00 AM - 11:00 AM CDT
1676
Ernest N. Morial New Orleans Convention Center
How effectively do different large language models respond to occupational hygiene queries? This research looked at three commonly used models - OpenAI's ChatGPT, Anthropic's Claude, and Google's Gemini - to assess accuracy and compare model performance when tasked with open-ended quantitative questions. Variations in prompts were applied when engaging the models to determine the impact of prompt engineering. Test methodology included a comparison between model engagement with standalone queries versus multi-turn conversations to provide insights on model effectiveness based on user tendencies.
P. Raynor, University of Minnesota, Minneapolis, MN, USA.
none
Keywords
Education and training
You have unsaved changes.