Large language models (LLMs) present new opportunities for decision support in geotechnical engineering but remain prone to uncertainty and domain-specific hallucinations. This paper develops a semantic entropy framework to diagnose and mitigate such uncertainty in geotechnical design. The proposed semantic entropy framework decomposes LLM outputs into atomic factoids, evaluates their entropy through entailment-based clustering of diverse sampled answers, and refines uncertain factoids via targeted reprompting, knowledge-based verification, and calibrated hedging. A Reliability Index (RI) is also proposed to merge aggregated entropy measures with BERT similarity to empirical sources, enabling task-specific reliability estimation. Experiments in foundation design and soil consolidation testing domains are adopted for validation, which highlights distinct uncertainty distribution profiles linked to domain standardization. Results also show successful remediation of 82-88% of initially hallucinated responses while preserving 100% of clean outputs. Reprompting emerges as the most effective mitigation strategy, achieving 39-46% entropy reduction. The RI demonstrates strong predictive capability (R²=0.88) in identifying reliable LLM responses. Overall, the framework provides a practical decision-support mechanism for integrating uncertainty-aware LLM outputs into geotechnical design practice.
Building similarity graph...
Analyzing shared references across papers
Loading...
Njock et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69fd7f25bfa21ec5bbf07815 — DOI: https://doi.org/10.1139/cgj-2025-0648
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:
Pierre Guy Atangana Njock
Zhen-Yu Yin
Hao-Ruo Xu
Canadian Geotechnical Journal
Hong Kong Polytechnic University
Istituto di Scienze Marine del Consiglio Nazionale delle Ricerche
Building similarity graph...
Analyzing shared references across papers
Loading...