Retrieval Augmented Language Models
- Prelegent(ci)
- Konrad Staniszewski
- Afiliacja
- Uniwersytet Warszawski
- Termin
- 1 czerwca 2023 12:15
- Informacje na temat wydarzenia
- https://uw-edu-pl.zoom.us/j/92107522918
- Seminarium
- Seminarium "Uczenie maszynowe"
Large language models store their knowledge in parameters and require costly fine-tuning to update. An interesting alternative is to provide new knowledge in the model's context. However, typical models have relatively short context lengths.
In this presentation, I will discuss one of the potential solutions to this problem - retrieval augmented transformer models. Those models utilize a large external database to store information about already processed parts of the text and retrieve most matching entries to improve the performance during the inference.