Nie jesteś zalogowany | Zaloguj się

Prelegent(ci): Konrad Staniszewski
Afiliacja: Uniwersytet Warszawski
Termin: 1 czerwca 2023 12:15
Informacje na temat wydarzenia: https://uw-edu-pl.zoom.us/j/92107522918
Seminarium: Seminarium "Uczenie maszynowe"

Large language models store their knowledge in parameters and require costly fine-tuning to update. An interesting alternative is to provide new knowledge in the model's context. However, typical models have relatively short context lengths.
In this presentation, I will discuss one of the potential solutions to this problem - retrieval augmented transformer models. Those models utilize a large external database to store information about already processed parts of the text and retrieve most matching entries to improve the performance during the inference.

Retrieval Augmented Language Models

Kariera

Strona internetowa