You are not logged in | Log in

Speaker(s): Konrad Staniszewski
Affiliation: Uniwersytet Warszawski
Date: June 1, 2023, 12:15 p.m.
Information about the event: https://uw-edu-pl.zoom.us/j/92107522918
Seminar: Seminarium "Machine Learning"

Large language models store their knowledge in parameters and require costly fine-tuning to update. An interesting alternative is to provide new knowledge in the model's context. However, typical models have relatively short context lengths.
In this presentation, I will discuss one of the potential solutions to this problem - retrieval augmented transformer models. Those models utilize a large external database to store information about already processed parts of the text and retrieve most matching entries to improve the performance during the inference.

Retrieval Augmented Language Models

Careers

Website

Safety on Campus