Register Now

Evaluating LLM Accuracy at Answering Quantitative Occupational Hygiene Questions

Anna Lee Poster Presenter
University of Minnesota
Minneapolis, MN 
 
Tue, 6/2: 10:00 AM - 11:00 AM CDT
1676 
Ernest N. Morial New Orleans Convention Center 

Description

How effectively do different large language models respond to occupational hygiene queries? This research looked at three commonly used models - OpenAI's ChatGPT, Anthropic's Claude, and Google's Gemini - to assess accuracy and compare model performance when tasked with open-ended quantitative questions. Variations in prompts were applied when engaging the models to determine the impact of prompt engineering. Test methodology included a comparison between model engagement with standalone queries versus multi-turn conversations to provide insights on model effectiveness based on user tendencies.

Co-Authors

P. Raynor, University of Minnesota, Minneapolis, MN, USA. 

Acknowledgements & References

none 

Keywords

Education and training