February 23, 2024Open Access

Using early LLMs for corpus linguistics: Examining ChatGPT's potential and limitations

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

This study evaluates the extent to which information can be obtained from early Large Language Models (LLMs) for corpus linguistic research. Various tasks were conducted using ChatGPT 3.5, such as generating word frequency lists, collocations, words that fit certain grammatical patterns, and identifying genres. These were then compared with the search results from a large-scale general corpus (COCA). While favorable results were not achieved in identifying the genres of words or paragraphs, there was notable congruence in the frequency lists (75.0%), collocations (42.8%), and grammatical patterns (53.0%) for the top 20 items. Even when the generated items did not perfectly match those from COCA, it was evident that high-frequency items were produced. Although LLMs may not be sufficient for rigorous academic research, the results are adequate for discerning overall trends or assisting learners. In addition, the results of this study show that the ability to search at the phrase level is an advantage of using LLMs for corpus research.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Satoru Uchida (Fri,) studied this question.

www.synapsesocial.com/papers/68e77c94b6db6435876f0fcd — DOI: https://doi.org/10.1016/j.acorp.2024.100089

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Exploring the potential of using an AI language model for automated essay scoring· 2023 · 516 citations
The Routledge Handbook of Corpus Linguistics· 2022 · 140 citations
Do large language models resemble humans in language use?· 2023 · 49 citations
Developing the Academic Collocation List (ACL) – A corpus-driven and expert-judged approach· 2013 · 283 citations
Writing with ChatGPT: An Illustration of its Capacity, Limitations & Implications for Academic Writers· 2023 · 123 citations

Authors

Satoru Uchida

Journals

Applied Corpus Linguistics

Actions

Institutions

Kyushu University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Using early LLMs for corpus linguistics: Examining ChatGPT's potential and limitations

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion