February 19, 2024Open Access

Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

Key Points

Significant limitations in temporal reasoning were found, suggesting that these models struggle with understanding event sequences.
Models exhibited knowledge gaps more often, with a notable trend linked to their level of uncertainty awareness.
Assessment using the TempUN dataset reveals that fine-tuning does not result in major performance improvements for these models' temporal reasoning capabilities.

Abstract

Large Language Models (LLMs) are increasingly becoming ubiquitous, yet their ability to reason about and retain temporal information remains limited. This hinders their application in real-world scenarios where understanding the sequential nature of events is crucial. This paper experiments with state-of-the-art models on a novel, large-scale temporal dataset, TempUN, to reveal significant limitations in temporal retention and reasoning abilities. Interestingly, closed-source models indicate knowledge gaps more frequently, potentially suggesting a trade-off between uncertainty awareness and incorrect responses. Further, exploring various fine-tuning approaches yielded no major performance improvements. The associated dataset and code are available at the following URL (https: //github. com/lingoiitgn/TempUN).

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Himanshu Beniwal

Kowsik Nandagopan D

Mayank Singh

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider