Querying the Deep Web: old and new perspectives
- Speaker(s)
- Andrea Cali
- Affiliation
- Birkbeck College, University of London
- Date
- Nov. 4, 2015, 2:15 p.m.
- Room
- room 5870
- Seminar
- Seminar Automata Theory
The Deep Web is constituted by data that are accessible on the
web, typically through HTML forms, but are not indexable by
search engines due to their static nature. Processing queries on
Deep Web data poses significant challenges as data sources cannot
be normally accessed with arbitrary queries. In this talk we
illustrate techniques for processing queries on the Deep Web and
we survey some of the core problems underlying this task. We
then propose a novel framework for identifying relevant sources
in this context. This work has been carried out with Igor Razgon
and Umberto Straccia.