Evaluation of Fairness and Factualness with LLM

Developed a zero-shot evaluation framework using Microsoft Phi-2 and Mistral-7B, incorporating GPT-generated evidence and chain-of-thought reasoning to assess fairness and factuality in textual claims, achieving a 76% accuracy rate.