
September 11 at 4:00PM
Harvard TH Chan School of Public Health, FXB-301
Junwei Lu, PhD
Associate Professor of Biostatistics, Harvard TH Chan School of Public Health
Human preference alignment has been shown to be effective in training the large language models (LMs). It allows the LLM to understand human feedback and preferences. Despite the extensive literature dealing with algorithms aligning the rank of human preference, uncertainty quantification for the ranking estimation still needs to be explored and is of great practical significance. For example, it is important to overcome the problem of hallucination for LLM in the medical domain, and an inferential method for the ranking of LM answers becomes necessary. In this talk, we will present a novel framework called “Fisher random walk” to conduct semi-parametric efficient preference inference for language models and illustrate its application in the language models for medical knowledge.