The trade-off between robustness and reliability in chinese legal large language models: an empirical study | Synapse