Vai al contenuto| Home page|

   Ti trovi in: HOME »Programmi, progetti e risultati »I progetti »PRIN - Programmi di ricerca di Rilevante Interesse Nazionale»Programma di ricerca
INIZIO_TESTO_DA_INDICIZZARE

RESEARCH PROGRAM

italiano - inglese
Similar research programs:
Scientific and education field classification
International Patent Classification
Geographical classification
Bibliografia
[A98]B.Adelberg.NoDoSE.A tool for semi-automatically extracting structured and semistructured data from text documents.SIGMOD98
[ABS04]S.Amer-Yahia,C.Botev,J.Shanmugasundaram.TeXQuery: A Full-Text Search Extension to XQuery.WWW04
[AG03]A.Arasu,H.Garcia-Molina.Extracting Structured Data from Web Pages.SIGMOD03
[AVF+98]S.Abiteboul,V.Vianu,B.Fordham,Y.Yesha.Relational Transducers for Electronic Commerce.PODS98
[BBC+98]P.Bernstein et. al. The asilomar report on database research,1998. [BCD+03]D.Berardi,D.Calvanese,G.De Giacomo,M.Lenzerini,M.Mecella.Automatic Composition of E-Services that Export their Behavior.ICSOC03
[BCG+05]D.Berardi,D.Calvanese,G.De Giacomo,R.Hull,M.Mecella.Automatic composition of transition based semantic web services with messaging.VLDB05
[BFG01]R.Baumgartner,S.Flesca,G.Gottlob.Supervised Wrapper Generation with Lixto. VLDB01
[BFHS03]T.Bultan,X.Fu,R.Hull,J.Su.Conversation specification: a new approach to design and analysis of e-service composition.WWW03
[BLR97]C.Beeri,A.Y.Levy,M.C.Rousset.Rewriting queries using views in description logics.PODS97
[C03]A.Calì.Reasoning in data integration systems: Why LAVand GAV are siblings.ISMIS03
[CC02]A.Calì,D.Calvanese.Optimized querying of integrated data over the Web.EISIC02
[CCDL01]A.Calì,D.Calvanese,G.De Giacomo,M. Lenzerini.Accessing data integration systems through conceptual schemas.ER01
[CDL01a]D.Calvanese,G.De Giacomo,M.Lenzerini.A framework for ontology integration.SWWS01
[CDL01b] D. Calvanese, G. De Giacomo, M. Lenzerini. Ontology of integration and integration of ontologies. Description Logic Workshop 2001
[CDLV00]D.Calvanese,G.De Giacomo,M.Lenzerini,M.Y.Vardi.View-based query processing and constraint satisfaction.LICS00
[CGL+98]D.Calvanese,G.De Giacomo,M.Lenzerini,D.Nardi,R.Rosati.Information integration: Conceptual modeling and reasoning support.CoopIS98
[CHK01]V.Christophides,R.Hull,A.Kumar.Querying and Splicing of XML Workflows.
[CM01]V.Crescenzi,G.Mecca,P.Merialdo.RoadRunner: Towards Automatic Data Extraction from Large Web Sites.VLDB01
[CM04]V.Crescenzi,G.Mecca.Automatic information extraction from large websites. J. of the ACM,51(5),2004
[CS01]F.Casati,M.Shan.Dynamic and adaptive composition of e-services. In Information Systems 26(3),2001
[DCFS04]S.Das,E.I.Chong,G.Eadon,J.Srinivasan.Supporting Ontology-Based Semantic matching in RDBMS.VLDB04
[DDH01]A.Doan,P.Domingos,A.Y.Halevy.Reconciling Schemas of Disparate Data Sources: A Machine Learning Approach.SIGMOD01
[DHM04]X.Dong,A.Y.Halevy,J.Madhavan,E.Nemes,J.Zhang.Similarity Search for Web Services. VLDB04
[DL97]O.M.Duschka,A.Y.Levy.Recursive plans for information gathering. IJCAI97
[DLN05]A.Deutsch,B.Ludascher,A.Nash.Rewriting queries using views with access patterns under integrity constraints.ICDT05
[DR02]H.H.Do,E.Rahm.COMA-A System for Flexible Combination of Schema Matching Approaches.VLDB02.
[DSV04]A.Deutsch,L.Sui,V.Vianu.Specification and verification of data-driven web services.PODS04
[E02]C.M.Eastman.30,000 hits may be better than 300: Precision anomalies in Internet
searches. J. ASIST 53,11,2002
[ECJL+99] D.W.Embley, M.D.Campbell, Y.S.Jiang, S.W.Liddle, Y.K.Ng, D.Quass, R.D.Smith. Conceptual-model-based data extraction from multiple-record Web pages. Data Knowl.Eng.99
[F98] D.Freitag. Information extraction from html: Application of a general learning approach. AAAI98
[FBS04]X.Fu,T.Bultan,J.Su.Analysis of interacting BPEL web services.2004.
[FGK02]D.Florescu,A.Gruenhagen,D.Kossmann.XL: A Programming Language for Web Service Specification and Composition.WWW02
[FKMP03]R.Fagin,P.G.Kolaitis,R.J.Miller,L.Popa.Data exchange: Semantics and query answering.ICDT03
[FLM99]M.Friedman,A.Y.Levy,T.Millstein. Navigational plans for data integration.AAAI99
[FLMS99]D,Florescu,A.Y.Levy,I.Manolescu,D.Suciu.Query optimization in the presence of limited access patterns.SIGMOD99
[GLR00]F.Goasdoue,V.Lattes,M.C.Rousset.The use of CARIN language and algorithms for information integration: the picsel system.Int. J. on Cooperative Information Systems,2000
[GRGK97]V.N.Gudivada,V.V.Raghavan,W.I.Grosky,R.Kasanagottu.Information retrieval on the World Wide Web. IEEE Internet Comput.Sept-Oct,1997
[GWG96]S.Gauch,G.Wang,M.Gomez.Profusion: Intelligent fusion from multiple, different search engines. J. Univ. Comput.Sci.2,9,Sept,997
[H04]Y.Halevy.Structures, Semantics and Statistics.VLDB04
[HBCS03]R.Hull,M.Benedikt,V.Christophides,J.Su.E-Services: A Look Behind the Curtain.PODS03
[HDIM03]A.Halevy,O.E.A.Doan,Z.Ives,J.Madhavan.Crossing the structure chasm.CIDR03
[J00]B.J.Jansen.The effect of query complexity on web searching results.Inf. Res.,6(1),2000.
[JLVV99]M.Jarke,M.Lenzerini,Y.Vassiliou,P.Passiliadis,editors.Fundamentals of Data Warehouses.Springer1999.
[JQC+00]M.Jarke,V.Quix,D.Calvanese,M.Lenzerini,E.Franconi,S.Ligoudistiano,P.Vassiliadis,Y.Vassiliou.Concept based design of data warehouses: The DWQ demonstra-tors.SIGMOD00
[L02]M.Lenzerini.Data integration: A theoretical perspective.PODS02
[LC00]C.Li,E.Chang.Query planning with limited source capabilities.ICDE00
[LC01]C.Li,E.Chang.On answering queries in the presence of limited access patterns.ICDT01.
[LGMK04]K.Lerman,C.Gazen,S.Minton,C.A.Knoblock.Populating the semantic web.AAAI04 Workshop on Advances in Text Extraction and Mining
[LSK95]A.Y.Levy,D.Srivastava,T.Kirk.Data model and query evaluation in global information systems. J. of Intelligent Information Systems,5,1995.
[LT02]W.Lucas,H.Topi.Form And Function: The Impact Of Query Term And Operator Usage On Web Search Results. J. Asist 53,2,2002
[MBR01]J.Madhavan,P.A.Bernstein,E.Rahm.Generic Schema Matching with Cupid.VLDB01.
[MBR05]J.Madhavan,P.A.Bernstein,A.H.Doan,A.H.Halevy.Corpus-based Schema Matching.ICDE05.
[MGR02]S.Melnik,H.Garcia-Molina,E.Rahm.Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching.ICDE02
[MM03]S.A.McIlraith,D.L.Martin.Bringing semantics to Web Services.IEEE Intelligent Systems,18(1):90-93,2003
[MMK99]I.Muslea,S.Minton,C.A.Knoblock.A hierarchical approach to wrapper induction.Conference on Autonomous Agents 1999.
[MS02]S.A.McIlraith,T.Cao Son.Adapting Golog for composition of Semantic Web services.KR02
[MZ98]T.Milo,S.Zohar.Using Schema Matching to Simplify Heterogeneous Data Translation. VLDB98
[N04]N.F.Noy.Semantic Integration.A Survey Of Ontology-Based Approaches.SIGMOD Record33(4),2004
[NL04]A.Nash,B.Ludascher.Processing first-order queries under limited access patterns.PODS04
[RB01]E.Rahm,P.A.Bernstein.A survey of approaches to automatic schema matching.VLDB J. 10(4),2001
[RSU95]A.Rajaraman,Y.Sagiv,J.D.Ullman.Answering queries using templates with binding patterns.PODS95
[S99]S.Soderrland.Learning information extraction rules for semistructured and free text. Mach. Learn.99.
[SDWG95]M.A.Sheldon,A.Duda,R.Weiss,D.K.Gifford.Discover: A resource discovery system based on content routing.WWW95.
[SO00]W.Sadiq,M.Orlowska.Analyzing Process Models Using Graph Reduction.In Information Systems 25(2):2000
[SPW+04]E.Sirin,B.Parsia,D.Wu,J.A.Hendler,D.S.Nau.Htn planning for web service composition using shop2. Journal of Web Semantics,1(4),2004
[SS04]S.Staab,R.Studer(Editors).Handbook on Ontologies, Springer 2004
[TP04]P.Traverso,M.Pistore.Automated composition of semantic web services into executable processes. Semantic Web Conference04
[U97]J.D.Ullman.Information integration using logical views.ICDT97
[UG04]M.Uschold,M.Grunninger. Ontologies and Semantics for Seamless Connectivity. SIGMOD Record 33(4),2004
[VMP04]Y.Velegrakis,R.J.Miller,L.Popa.Preserving mapping consistency under schema changes. VLDB J.13(3),2004
[WGST04]G.Weikum,J.Graupmann,R.Schenkel,M.Theobald.Towards a Statistically Semantic Web.ER2004
[WMB94]I.H.Witten,A.Moffat,T.C.Bell.Managing Gigabytes: Compressing and Indexing Documents and Images.Von Nostrand Reinhold,New York,1994.
[WYDM04]W.Wu,C.T.Yu,A.Doan,W.Meng.An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web.SIGMOD04
Keywords
WEB SERVICE, WEB SEARCH, JOIN, ONTOLOGIES, WRAPPER

New technologies and tools for the integration of Web search services

Politecnico di Milano
Abstract
The current evolution of the Web is characterized by an increasing number of search engines and query interfaces, ranging from generic ones (Google) to domain-specific ones (geo-localization services or on-line catalogs). Meanwhile, wrapping technology is evolving so as to enable the development of specialized services extracting content from data-intensive Web sites (wrappers of sites delivering bond quotes), and exposing them as Web Services.

While an increasing amount of search services on the Web becomes available, they still work in isolation; their intrinsic limit is the inability to support complex queries ranging over multiple domains. Queries such as “search all vegetarian restaurants close to Milan” require combining search engines specialized over different domains, such as geographic locations and restaurants. The focus of this research proposal is to contribute to the development of a new generation search engine (NGS) which integrates known services and provides the user with a single interface.

The focus of the project is on technology integration and in the development of new algorithms for matching the search requests to independent services. This proposal is not concerned with search engine methods per se, but rather in improving the overall power of search engines by means of the combination of techniques from various fields of research (specifically: keyword-driven concept matching, user-driven query optimization, wrapping >>>

Principal Investigator
Stefano Ceri Politecnico di MILANO
Research Objectives
The current evolution of the Web is characterized by an increasing number of search engines and query interfaces, ranging from generic ones (Google) to domain-specific
ones (geo-localization services or on-line catalogs). Meanwhile, wrapping technology is evolving so as to enable the development of specialized services extracting content from data-intensive Web sites (wrappers of sites delivering bond quotes), and exposing them as Web Services. While each search engine or wrapper interface can be separately used to issue focused queries, their intrinsic limit is the inability to support complex queries, ranging over multiple domains. Such queries can be only answered, at the current state of art, by a deep involvement of a knowledgeable user, who inspects services one at a time to determine which are relevant for the given request, and then possibly feeds the results of one search as input to the next. However users do not want to be bothered by distinctions between many heterogeneous data sources, and desire to have one system available for querying such sources; moreover, while they can accept to interact multiple times when their query is rather complex, they certainly do not want to “cut-and-paste” query results into query inputs, as such approach is time-consuming and error-prone.

The focus of this research proposal is to contribute to the development of a new generation search engine (NGS) which integrates known services and provides the user with >>>

Timescale
24 months
National and international background
The general problem of searching the Web with more powerful tools than current search engines is described in details in [WGST04]. This is just one further formulation of a problem which has been posed many times, and addressed each time with respect to the technological state of the art. As an example, eight years ago the database community considered the issue in the Asilomar report [BBC+98].

Search Engines and Information Retrieval

Search Engines, among the most sophisticated and useful resources available on the Internet, assist the user in the task of rapidly and effectively navigating the Web. To some extent, the problem of finding information on the Web can be rephrased as the problem of knowing where search engines are, what they are designed to retrieve, and how to use them. Two different types of engines have been developed so far: large-scale and specific search engines. Large-scale engines exemplify the trade-off between breadth and quality, while the specific ones are more likely to quickly focus a search in one particular area.
Information Retrieval systems are software tools which help users in the task of finding documents contained in a specific corpus or database. Such systems are also widely used on the Web for finding scholarly information as well as for many other recreational activities. The most popular information retrieval technique involves combining the full text of all documents within a >>>