Retrieval Augmented Language Models
- Speaker(s)
- Konrad Staniszewski
- Affiliation
- Uniwersytet Warszawski
- Date
- June 1, 2023, 12:15 p.m.
- Information about the event
- https://uw-edu-pl.zoom.us/j/92107522918
- Seminar
- Seminarium "Machine Learning"
Large language models store their knowledge in parameters and require costly fine-tuning to update. An interesting alternative is to provide new knowledge in the model's context. However, typical models have relatively short context lengths.
In this presentation, I will discuss one of the potential solutions to this problem - retrieval augmented transformer models. Those models utilize a large external database to store information about already processed parts of the text and retrieve most matching entries to improve the performance during the inference.