Kort om seminariet på engelska

Natural language models are a technology associated with Artificial Intelligence (AI) that are increasingly being used within society to perform various tasks. While traditional tasks include spelling auto-correct, audio-to-text conversion, speech recognition and machine translation, the models are becoming increasingly powerful and their sphere of operation increasingly wider. These models are able to identify patterns and hidden insights in data sets too large for humans to manage. While these models can be put to good uses, such as extracting insights from health data, in the wrong hands or used for unintended purposes, they potentially also pose a danger to society.

Natural language models can be described as a field of study within Applied Language Technology, which concerns how computers and other digital devices analyse, produce, modify and respond to human texts and speech. At the heart of these models lie advanced algorithms that, having learned the rules associated with a specific natural language, are then able to apply them not only to predict text but even produce new text.

A language model that is gaining attention is called ‘GPT3’ (Generative Pre-trained Transformer 3). Developed by a private company this language model uses 175 billion machine learning parameters in its operations. The unique aspect of GPT3, besides extracting knowledge from texts, is its ability to produce texts of such a high quality that it is impossible to identify if written by human or machine. This can pose multiple challenges in many sectors. In the university context for example, how do we know that the texts students produce have not been written by a natural language model? Would plagiarism systems be able to detect this? To what extent could language models trick AI analytical tools being used in higher education to gauge the performance of students?

Helping us to answer these questions and many more are our distinguished speakers Jussi Karlgren and Magnus Sahlgren. They will help us to understand more about what natural language models are, how they work, what advantages they hold but also what potential risks they bring with their increased use within society.

Jussi Karlgren researches linguistic use and stylistic variation in language and how it can be represented and used as support to find what one wants to read or listen to. He is an associate professor of language technology at the University of Helsinki and a principal research scientist at Spotify.
Magnus Sahlgren is a computational linguist whose research is centered around questions about what it means to understand language, and how we can build machines with such capacity. Sahlgren has worked on computational models of meaning for the last 20 years, and he currently leads the research on natural language understanding and language models at RISE and at AI Sweden.


Observera att seminariet genomförs i hybridformat. Det finns möjlighet att delta via Zoom, men det finns också ett begränsat antal fysiska platser. Om du vill delta på plats, mejla Stanley Greenstein senast klockan 12.00 onsdag 27 oktober. Seminariet genomförs hos Institutionen för data- och systemvetenskap, adressen är Borgarfjordsgatan 12, Kista (Nodhuset). Ta hiss E till tredje våningen och vänta där på att bli insläppt.

Seminariet arrangeras av DHV-hubb, en samlingsplats för forskare vid Stockholms universitet som är intresserade av digital humanvetenskap. DHV-seminarierna är tvärvetenskapliga och öppna för alla forskare som intresserar sig för digitala artefakter och miljöer, samt deras betydelse för samhället och mänskligheten. Dela gärna denna inbjudan med kollegor som kan vara intresserade.