Print Close

Evaluating LLM Accuracy at Answering Quantitative Occupational Hygiene Questions

Presented During: Poster Session 2 - Student

Anna Lee Poster Presenter
University of Minnesota
Minneapolis, MN

Tue, 6/2: 10:00 AM - 11:00 AM CDT
1676

Ernest N. Morial New Orleans Convention Center

Room: Exhibit Hall - Start of the 1500 Aisle

Description

How effectively do different large language models respond to occupational hygiene queries? This research looked at three commonly used models - OpenAI's ChatGPT, Anthropic's Claude, and Google's Gemini - to assess accuracy and compare model performance when tasked with open-ended quantitative questions. Variations in prompts were applied when engaging the models to determine the impact of prompt engineering. Test methodology included a comparison between model engagement with standalone queries versus multi-turn conversations to provide insights on model effectiveness based on user tendencies.

Co-Authors

P. Raynor, University of Minnesota, Minneapolis, MN, USA.

Acknowledgements & References

none

View Proposal 1676

Keywords

Education and training