Diagnosing Bias and Instability in LLM Evaluation: A Scalable Pairwise Meta-Evaluator | Synapse