Anyanyelv, MI és jog: Hogyan tud majd magyarul a mesterséges intelligencia?

Mihály Gábor Kolláth

doi:10.59648/filologia.2025.3-4.10

Language, AI and Law

How Much Hungarian Will Artificial Intelligence Speak?

Mihály Gábor Kolláth

doi: 10.59648/filologia.2025.3-4.10

Abstract

Artificial intelligence is one of the most promising technologies of our time, but also one that poses many risks. In recent years, we have ceded more and more of the control and organisation of our lives to this technology. There is an increasing number of AI-generated content, text and images on the Internet, and we are using it more and more often for study, work and leisure. This paper presents the history, functioning and market relevance of one of the most common types of AI, the large language models, and describes the geopolitical risks of the technology, with a particular focus on language preservation and maintenance. Such risks include the uncontrolled nature of the texts used to train AI, their linguistic homogeneity, the preference for certain cultures, and the impact on the thinking and behaviour of users of AIs taught in this way, which favour certain languages and worldviews.

Keywords:

artificial intelligence large language models language policy geopolitics law

How to Cite

Kolláth, M. G. (2025). Language, AI and Law: How Much Hungarian Will Artificial Intelligence Speak?. Filológia.Hu, 16(3–4), 134–146. https://doi.org/10.59648/filologia.2025.3-4.10

References

Szakirodalom

Alammar, Jay – Grootendorst, Maarten 2024: Hands-On Large Language Models – Language Understanding and Generation. Sebastopol: O’Reilly Media.

Alkaissi, Hussam – McFarlane, Samy I. 2023: Artificial Hallucinations in ChatGPT: Implications in Scientific Writing, Cureus, 15/2: e35179. Online: https://doi.org/10.7759/cureus.35179

Bass, Dina – Ghaffary Shirin 2025: Microsoft Probing If DeepSeek-Linked Group Improperly Obtained OpenAI Data. Bloomberg, 2025. január 29. Online: https://www.bloomberg.com/news/articles/2025-01-29/microsoft-probing-if-deepseek-linked-group-improperly-obtained-openai-data?embedded-checkout=true

Bender, Emily M. – Friedman, Batya 2018: Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science. Transactions of the Association for Computational Linguistics 2018/6: 587–604. Online: https://doi.org/10.1162/tacl_a_00041

Berryman, John – Ziegler, Albert 2025: Prompt Engineering for LLMs – The Art and Science of Building Large Language Model–Based Applications. Sebastopol: O’Reilly Media.

Cabello, Adria 2023: The Evolution of Language Models: A Journey Through Time. Medium, 2023. november 2. Online: https://medium.com/@adria.cabello/the-evolution-of-language-models-a-journey-through-time-3179f72ae7eb

Caelen, Olivier – Blete, Marie-Alice 2024: Developing Apps with GPT-4 and ChatGPT – Build Intelligent Chatbots, Content Generators, and More. Sebastopol: O’Reilly Media.

Chang, Susan Shu 2024: Machine Learning Interviews Kickstart Your Machine Learning and Data Career. Sebastopol: O’Reilly Media.

Cheng, Adam – Calhoun, Aaron – Reedy, Gabriel 2025: Artificial intelligence assisted academic writing: recommendations for ethical use. Advances in Simulation 10/22. Online: https://doi.org/10.1186/s41077-025-00350-6

Common Craw [é. n.]: Statistics of Common Crawl Monthly Archives, Common Craw, [é. n.]. Online: https://commoncrawl.github.io/cc-crawl-statistics/plots/languages

Copeland, B. J. 2025: History of artificial intelligence. Encyclopedia Britannica, 2025. február 25. Online: https://www.britannica.com/science/history-of-artificial-intelligence

Dwivedi, Yogesh K. – Kshetri, Nir – Hughes, Laurie – Slade, Emma Louise – Jeyaraj, Anand – Kar, Arpan Kumar – Baabdullah, Abdullah M. – Koohang, Alex – Raghavan, Vishnupriya – Ahuja, Manju – Albanna, Hanaa – Albashrawi, Mousa Ahmad – Al-Busaidi, Adil S. – Balakrishnan, Janarthanan – Barlette, Yves – Basu, Sriparna – Bose, Indranil – Brooks, Laurence – Buhalis, Dimitrios... Wright, Ryan, 2023: “So what if ChatGPT wrote it?” Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy. International Journal of Information Management 71: 102642. Online: https://doi.org/10.1016/j.ijinfomgt.2023.102642

Fan, Shelly 2020: Lecserél-e minket a mesterséges intelligencia? Budapest: Scolar Kiadó.

Foster, David 2023: Generative Deep Learning – Teaching Machines to Paint, Write, Compose, and Play. Sebastopol: O’Reilly Media.

Gazdasági Versenyhivatal 2023: A GVH megvizsgálja, hogy a Microsoft miként tájékoztatja a fogyasztókat új keresőszolgáltatásáról. Gazdasági Versenyhivatal, 2023. július 25. Online: https://gvh.hu/sajtoszoba/sajtokozlemenyek/2023-as-sajtokozlemenyek/a-gvh-megvizsgalja-hogy-a-microsoft-mikent-tajekoztatja-a-fogyasztokat-uj-keresoszolgaltatasarol

Gervai Pál – Trautmann László 2003: Filozófiai irányzatok és jövőképek – Az Ókor. In: Nováky Erzsébet – Kristóf Tamás: Ókori jövőképek és jövőkutatás a vállalati gyakorlatban. Budapest: Budapesti Közgazdaságtudományi és Államigazgatási Egyetem Jövőkutatási Kutatóközpont.

Gyekiczky Tamás 2021: A digitális társadalom olvasatai – VII. Jogászvilág, 2021. január 7. Online: https://jogaszvilag.hu/vilagjogasz/a-digitalis-tarsadalom-olvasatai-vii/

Jelencsik-Mátyus Kinga – Héja Enikő – Varga Zsófia – Váradi Tamás 2022: D1.18 Report on the Hungarian Language. European Language Equality, 2022. február 28. Online: https://european-language-equality.eu/wp-content/uploads/2022/03/ELE___Deliverable_D1_18__Language_Report_Hungarian.pdf

Kiss Tamás 2024: A jövő marketingosztálya: az AI és emberi vezetés harmóniája. Kontext, 2024. szeptember 16. Online: https://kontext.hu/blog/marketingosztaly-ai/?utm_term=&utm_campaign=SN+-+DSA&utm_source=adwords&utm_medium=ppc&hsa_acc=6228439794&hsa_cam=12063221206&hsa_grp=121917755691&hsa_ad=491046641655&hsa_src=g&hsa_tgt=dsa-39587879683&hsa_kw=&hsa_mt=&hsa_net=adwords&hsa_ver=3&gad_source=1&gbraid=0AAAAABq50Fdg4d8GtrSAvqocXnvvmkgpS&gclid=Cj0KCQjw2ZfABhDBARIsAHFTxGzB0pIsQvI1Mh8Q6SHt-Q_sC99o3lVTxfI7abejrrcGQgaPTcfIgkgaAjRlEALw_wcB

Koltay András 2019: A social media platformok jogi státusa a szólásszabadság nézőpontjából. In: Medias Res 2019/1: 1–56.

Manning, Christopher D. 2022: Human Language Understanding & Reasoning, Daedalus 151/2: 127–138. Online: https://doi.org/10.1162/daed_a_01905

Mayor, Adrienne 2018: Tyrants and Robots in Ancient Greece. History Today, 2018. november 11. Online: https://www.historytoday.com/archive/feature/tyrants-and-robots-ancient-greece

Mayor, Adrienne 2020: Gods and Robots Myths, Machines, and Ancient Dreams of Technology. Princeton: Princeton University Press.

Natale, Simone 2018: If software is narrative: Joseph Weizenbaum, artificial intelligence and the biographie of ELIZA. New Media & Society 21/3: 712–728. Online: https://doi.org/10.1177/1461444818804980

Prószéky Gábor – Olaszy Gábor – Váradi Tamás 2006: Nyelvtechnológia. In: Kiefer Ferenc (szerk.): Magyar nyelv. Budapest: Akadémiai Kiadó.

Prószéky Gábor 2018: A nyelvtudomány találkozásai a gépi intelligenciával In: Tolcsvai Nagy Gábor (szerk.): A humán tudományok és a gépi intelligencia. Budapest: Gondolat Kiadó.

Szalay Klára 2023: Mesterséges intelligencia, szabályozási irányok. Elemzés Országgyűlési képviselők részére. Parlament, 2023. szeptember 20. Online: https://www.parlament.hu/documents/d/guest/mesterseges-intelligencia

Tóth András 2022: A digitális állam információbiztonsági kihívásai. Budapest: Ludovika Egyetemi Kiadó.

Tranquillin, Marco – Lakshmanan, Valliappa – Tekiner, Firat 2023: Architecting Data and Machine Learning Platforms – Enable Analytics and AI-Driven Innovation in the Cloud. Sebastopol: O’Reilly Media.

Turing, Alan 1936: On Computable Numbers, with an Application to the Entscheidungsproblem. Proceedings of the London Mathematical Society 2/42: 230–267. Online: https://doi.org/10.1112/plms/s2-42.1.230

Uspenskyi, Serhii 2025: Large Language Model Statistics and Numbers. Springs, 2025. február 21. Online: https://springsapps.com/knowledge/large-language-model-statistics-and-numbers-2024

Weizenbaum, Joseph 1966: ELIZA – a computer program for the study of natural language communication between man and machine. Communications of the ACM 9/1: 36–45. Online: https://doi.org/10.1145/365153.365168

Források

A Büntető Törvénykönyvről szóló 2012. évi C. törvény (Röviden: Btk.)

Európai Parlament 2018: Az Európai Parlament 2018. szeptember 11-i állásfoglalása a nyelvek közötti egyenlőségről a digitális korban (2018/2028(INI)) (2019/C 433/07)

Európai Parlament 2021: Mesterséges intelligencia az oktatásban, a kulturális és az audiovizuális ágazatban – Az Európai Parlament 2021. május 19-i állásfoglalása a mesterséges intelligencia alkalmazásáról az oktatásban, a kulturális és az audiovizuális ágazatban (2020/2017(INI)) (2022/C 15/04)

Gazdasági Versenyhivatal 2024: A mesterséges intelligencia piaci versenyre és fogyasztókra gyakorolt hatásainak vizsgálata – A Gazdasági Versenyhivatal AL/234/2024. számú piacelemzésének eredményeként készült tanulmány. Gazdasági Versenyhivatal, 2024. október 21. Online: https://gvh.hu/pfile/file?path=/dontesek/agazati_vizsgalatok_piacelemzesek/piacelemzesek/mesterseges-intelligencia_piacelemzes_tanulmany_2024_10_21&inline=true

Gazdasági Versenyhivatal 2025: Magyarul tanítja a Microsoft a mesterséges intelligenciát a GVH-nak köszönhetően. Gazdasági Versenyhivatal, 2025. május 30. Online: https://gvh.hu/sajtoszoba/sajtokozlemenyek/2025-os-sajtokozlemenyek/magyarul-tanitja-a-microsoft-a-mesterseges-intelligenciat-a-gvh-nak-koszonhetoen

Magyar Hírlap 2024: A mesterséges intelligencia és a nyelvhasználat közötti kapcsolat. Magyar Hírlap, 2024. március 27. Online: https://www.magyarhirlap.hu/tech/20240327-a-mesterseges-intelligencia-es-a-nyelvhasznalat-kozotti-kapcsolat

Microsoft [é. n.]: Mik a nagy nyelvi modellek (LLM-ek)? Microsoft Azure, [é. n.]. Online: https://azure.microsoft.com/hu-hu/resources/cloud-computing-dictionary/what-are-large-language-models-llms

OpenAI [é. n.]: How ChatGPT and our foundation models are developed. OpenAI, [é. n.]. Online: https://help.openai.com/en/articles/7842364-how-chatgpt-and-our-foundation-models-are-developed

Prószéky Gábor 2024: Veszélyben van-e a magyar nyelv? Mit hoz a digitális világ? InfoRádió, Aréna (Interjú). 2024. augusztus 22. Online: https://www.youtube.com/watch?v=WzNTXn0gZ3Y

Speer, Robyn 2017: ConceptNet Numberbatch 17.04: better, less-stereotyped word vectors. ConceptNet Blog, 2017. április 17. Online: https://blog.conceptnet.io/posts/2017/conceptnet-numberbatch-17-04-better-less-stereotyped-word-vectors/