Contenuto
Ti trovi in: HOME »Programmi, progetti e risultati »I progetti »PRIN - Programmi di ricerca di Rilevante Interesse Nazionale»Programma di ricercaINIZIO_TESTO_DA_INDICIZZARE
RESEARCH PROGRAM
italiano - inglese
Research Units
Similar research programs:
- 1 - Cryptographic databases
- 2 - Web Ram: Web Retrieval and Mining
- 3 - Peer to peeR beyOnd FILE Sharing (PROFILES)
- 4 - Advanced multivariate statistical methods for quality assessment in public utility services: effectiveness-efficiency, risk of the provider, customer satisfaction
- 5 - Learning Hierarchical, Abstract Models from Temporal or Spatial Data
- 6 - The geomatics in support of the actions of Government of the territory
- 7 - New multivariate statistical methods of classification and dimensionality reduction for quality assessment and customer satisfaction in public utility services
- 8 - Similarity-based Methods for Computer Vision and Pattern Recognition: Theory, Algorithms, Applications
- 9 - ESTEEM: Emergent Semantics and cooperaTion in multi-knowledgE EnvironMents - Advanced methods and tools for semantic cooperation in Web virtual communities
- 10 - New method for the analysis of biodiversity: application of pyrosequencing to the study of soil organisms
Scientific and education field classification
International Patent Classification
- PHYSICS
- COMPUTING; CALCULATING; COUNTING (score computers for games A63; combinations of writing applicances with computing devices B43K29/08)
- ELECTRICAL DIGITAL DATA PROCESSING (computers in which a part of the computation is effected hydraulically or pneumatically G06D; optically G06E; self-contained input or output peripheral equipment G06K; impedance networks using digital techniques H03H) [C9603]
- COMPUTING; CALCULATING; COUNTING (score computers for games A63; combinations of writing applicances with computing devices B43K29/08)
Geographical classification
- Region: Lombardia
Bibliografia
[A98]B.Adelberg.NoDoSE.A tool for semi-automatically extracting structured and semistructured data from text documents.SIGMOD98[ABS04]S.Amer-Yahia,C.Botev,J.Shanmugasundaram.TeXQuery: A Full-Text Search Extension to XQuery.WWW04
[AG03]A.Arasu,H.Garcia-Molina.Extracting Structured Data from Web Pages.SIGMOD03
[AVF+98]S.Abiteboul,V.Vianu,B.Fordham,Y.Yesha.Relational Transducers for Electronic Commerce.PODS98
[BBC+98]P.Bernstein et. al. The asilomar report on database research,1998. [BCD+03]D.Berardi,D.Calvanese,G.De Giacomo,M.Lenzerini,M.Mecella.Automatic Composition of E-Services that Export their Behavior.ICSOC03
[BCG+05]D.Berardi,D.Calvanese,G.De Giacomo,R.Hull,M.Mecella.Automatic composition of transition based semantic web services with messaging.VLDB05
[BFG01]R.Baumgartner,S.Flesca,G.Gottlob.Supervised Wrapper Generation with Lixto. VLDB01
[BFHS03]T.Bultan,X.Fu,R.Hull,J.Su.Conversation specification: a new approach to design and analysis of e-service composition.WWW03
[BLR97]C.Beeri,A.Y.Levy,M.C.Rousset.Rewriting queries using views in description logics.PODS97
[C03]A.Calì.Reasoning in data integration systems: Why LAVand GAV are siblings.ISMIS03
[CC02]A.Calì,D.Calvanese.Optimized querying of integrated data over the Web.EISIC02
[CCDL01]A.Calì,D.Calvanese,G.De Giacomo,M. Lenzerini.Accessing data integration systems through conceptual schemas.ER01
[CDL01a]D.Calvanese,G.De Giacomo,M.Lenzerini.A framework for ontology integration.SWWS01
[CDL01b] D. Calvanese, G. De Giacomo, M. Lenzerini. Ontology of integration and integration of ontologies. Description Logic Workshop 2001
[CDLV00]D.Calvanese,G.De Giacomo,M.Lenzerini,M.Y.Vardi.View-based query processing and constraint satisfaction.LICS00
[CGL+98]D.Calvanese,G.De Giacomo,M.Lenzerini,D.Nardi,R.Rosati.Information integration: Conceptual modeling and reasoning support.CoopIS98
[CHK01]V.Christophides,R.Hull,A.Kumar.Querying and Splicing of XML Workflows.
[CM01]V.Crescenzi,G.Mecca,P.Merialdo.RoadRunner: Towards Automatic Data Extraction from Large Web Sites.VLDB01
[CM04]V.Crescenzi,G.Mecca.Automatic information extraction from large websites. J. of the ACM,51(5),2004
[CS01]F.Casati,M.Shan.Dynamic and adaptive composition of e-services. In Information Systems 26(3),2001
[DCFS04]S.Das,E.I.Chong,G.Eadon,J.Srinivasan.Supporting Ontology-Based Semantic matching in RDBMS.VLDB04
[DDH01]A.Doan,P.Domingos,A.Y.Halevy.Reconciling Schemas of Disparate Data Sources: A Machine Learning Approach.SIGMOD01
[DHM04]X.Dong,A.Y.Halevy,J.Madhavan,E.Nemes,J.Zhang.Similarity Search for Web Services. VLDB04
[DL97]O.M.Duschka,A.Y.Levy.Recursive plans for information gathering. IJCAI97
[DLN05]A.Deutsch,B.Ludascher,A.Nash.Rewriting queries using views with access patterns under integrity constraints.ICDT05
[DR02]H.H.Do,E.Rahm.COMA-A System for Flexible Combination of Schema Matching Approaches.VLDB02.
[DSV04]A.Deutsch,L.Sui,V.Vianu.Specification and verification of data-driven web services.PODS04
[E02]C.M.Eastman.30,000 hits may be better than 300: Precision anomalies in Internet
searches. J. ASIST 53,11,2002
[ECJL+99] D.W.Embley, M.D.Campbell, Y.S.Jiang, S.W.Liddle, Y.K.Ng, D.Quass, R.D.Smith. Conceptual-model-based data extraction from multiple-record Web pages. Data Knowl.Eng.99
[F98] D.Freitag. Information extraction from html: Application of a general learning approach. AAAI98
[FBS04]X.Fu,T.Bultan,J.Su.Analysis of interacting BPEL web services.2004.
[FGK02]D.Florescu,A.Gruenhagen,D.Kossmann.XL: A Programming Language for Web Service Specification and Composition.WWW02
[FKMP03]R.Fagin,P.G.Kolaitis,R.J.Miller,L.Popa.Data exchange: Semantics and query answering.ICDT03
[FLM99]M.Friedman,A.Y.Levy,T.Millstein. Navigational plans for data integration.AAAI99
[FLMS99]D,Florescu,A.Y.Levy,I.Manolescu,D.Suciu.Query optimization in the presence of limited access patterns.SIGMOD99
[GLR00]F.Goasdoue,V.Lattes,M.C.Rousset.The use of CARIN language and algorithms for information integration: the picsel system.Int. J. on Cooperative Information Systems,2000
[GRGK97]V.N.Gudivada,V.V.Raghavan,W.I.Grosky,R.Kasanagottu.Information retrieval on the World Wide Web. IEEE Internet Comput.Sept-Oct,1997
[GWG96]S.Gauch,G.Wang,M.Gomez.Profusion: Intelligent fusion from multiple, different search engines. J. Univ. Comput.Sci.2,9,Sept,997
[H04]Y.Halevy.Structures, Semantics and Statistics.VLDB04
[HBCS03]R.Hull,M.Benedikt,V.Christophides,J.Su.E-Services: A Look Behind the Curtain.PODS03
[HDIM03]A.Halevy,O.E.A.Doan,Z.Ives,J.Madhavan.Crossing the structure chasm.CIDR03
[J00]B.J.Jansen.The effect of query complexity on web searching results.Inf. Res.,6(1),2000.
[JLVV99]M.Jarke,M.Lenzerini,Y.Vassiliou,P.Passiliadis,editors.Fundamentals of Data Warehouses.Springer1999.
[JQC+00]M.Jarke,V.Quix,D.Calvanese,M.Lenzerini,E.Franconi,S.Ligoudistiano,P.Vassiliadis,Y.Vassiliou.Concept based design of data warehouses: The DWQ demonstra-tors.SIGMOD00
[L02]M.Lenzerini.Data integration: A theoretical perspective.PODS02
[LC00]C.Li,E.Chang.Query planning with limited source capabilities.ICDE00
[LC01]C.Li,E.Chang.On answering queries in the presence of limited access patterns.ICDT01.
[LGMK04]K.Lerman,C.Gazen,S.Minton,C.A.Knoblock.Populating the semantic web.AAAI04 Workshop on Advances in Text Extraction and Mining
[LSK95]A.Y.Levy,D.Srivastava,T.Kirk.Data model and query evaluation in global information systems. J. of Intelligent Information Systems,5,1995.
[LT02]W.Lucas,H.Topi.Form And Function: The Impact Of Query Term And Operator Usage On Web Search Results. J. Asist 53,2,2002
[MBR01]J.Madhavan,P.A.Bernstein,E.Rahm.Generic Schema Matching with Cupid.VLDB01.
[MBR05]J.Madhavan,P.A.Bernstein,A.H.Doan,A.H.Halevy.Corpus-based Schema Matching.ICDE05.
[MGR02]S.Melnik,H.Garcia-Molina,E.Rahm.Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching.ICDE02
[MM03]S.A.McIlraith,D.L.Martin.Bringing semantics to Web Services.IEEE Intelligent Systems,18(1):90-93,2003
[MMK99]I.Muslea,S.Minton,C.A.Knoblock.A hierarchical approach to wrapper induction.Conference on Autonomous Agents 1999.
[MS02]S.A.McIlraith,T.Cao Son.Adapting Golog for composition of Semantic Web services.KR02
[MZ98]T.Milo,S.Zohar.Using Schema Matching to Simplify Heterogeneous Data Translation. VLDB98
[N04]N.F.Noy.Semantic Integration.A Survey Of Ontology-Based Approaches.SIGMOD Record33(4),2004
[NL04]A.Nash,B.Ludascher.Processing first-order queries under limited access patterns.PODS04
[RB01]E.Rahm,P.A.Bernstein.A survey of approaches to automatic schema matching.VLDB J. 10(4),2001
[RSU95]A.Rajaraman,Y.Sagiv,J.D.Ullman.Answering queries using templates with binding patterns.PODS95
[S99]S.Soderrland.Learning information extraction rules for semistructured and free text. Mach. Learn.99.
[SDWG95]M.A.Sheldon,A.Duda,R.Weiss,D.K.Gifford.Discover: A resource discovery system based on content routing.WWW95.
[SO00]W.Sadiq,M.Orlowska.Analyzing Process Models Using Graph Reduction.In Information Systems 25(2):2000
[SPW+04]E.Sirin,B.Parsia,D.Wu,J.A.Hendler,D.S.Nau.Htn planning for web service composition using shop2. Journal of Web Semantics,1(4),2004
[SS04]S.Staab,R.Studer(Editors).Handbook on Ontologies, Springer 2004
[TP04]P.Traverso,M.Pistore.Automated composition of semantic web services into executable processes. Semantic Web Conference04
[U97]J.D.Ullman.Information integration using logical views.ICDT97
[UG04]M.Uschold,M.Grunninger. Ontologies and Semantics for Seamless Connectivity. SIGMOD Record 33(4),2004
[VMP04]Y.Velegrakis,R.J.Miller,L.Popa.Preserving mapping consistency under schema changes. VLDB J.13(3),2004
[WGST04]G.Weikum,J.Graupmann,R.Schenkel,M.Theobald.Towards a Statistically Semantic Web.ER2004
[WMB94]I.H.Witten,A.Moffat,T.C.Bell.Managing Gigabytes: Compressing and Indexing Documents and Images.Von Nostrand Reinhold,New York,1994.
[WYDM04]W.Wu,C.T.Yu,A.Doan,W.Meng.An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web.SIGMOD04
Keywords
WEB SERVICE, WEB SEARCH, JOIN, ONTOLOGIES, WRAPPERNew technologies and tools for the integration of Web search services
Politecnico di MilanoAbstract
The current evolution of the Web is characterized by an increasing number of search engines and query interfaces, ranging from generic ones (Google) to domain-specific ones (geo-localization services or on-line catalogs). Meanwhile, wrapping technology is evolving so as to enable the development of specialized services extracting content from data-intensive Web sites (wrappers of sites delivering bond quotes), and exposing them as Web Services.While an increasing amount of search services on the Web becomes available, they still work in isolation; their intrinsic limit is the inability to support complex queries ranging over multiple domains. Queries such as “search all vegetarian restaurants close to Milan” require combining search engines specialized over different domains, such as geographic locations and restaurants. The focus of this research proposal is to contribute to the development of a new generation search engine (NGS) which integrates known services and provides the user with a single interface.
The focus of the project is on technology integration and in the development of new algorithms for matching the search requests to independent services. This proposal is not concerned with search engine methods per se, but rather in improving the overall power of search engines by means of the combination of techniques from various fields of research (specifically: keyword-driven concept matching, user-driven query optimization, wrapping >>>
Principal Investigator
Stefano Ceri Politecnico di MILANOResearch Objectives
The current evolution of the Web is characterized by an increasing number of search engines and query interfaces, ranging from generic ones (Google) to domain-specificones (geo-localization services or on-line catalogs). Meanwhile, wrapping technology is evolving so as to enable the development of specialized services extracting content from data-intensive Web sites (wrappers of sites delivering bond quotes), and exposing them as Web Services. While each search engine or wrapper interface can be separately used to issue focused queries, their intrinsic limit is the inability to support complex queries, ranging over multiple domains. Such queries can be only answered, at the current state of art, by a deep involvement of a knowledgeable user, who inspects services one at a time to determine which are relevant for the given request, and then possibly feeds the results of one search as input to the next. However users do not want to be bothered by distinctions between many heterogeneous data sources, and desire to have one system available for querying such sources; moreover, while they can accept to interact multiple times when their query is rather complex, they certainly do not want to “cut-and-paste” query results into query inputs, as such approach is time-consuming and error-prone.
The focus of this research proposal is to contribute to the development of a new generation search engine (NGS) which integrates known services and provides the user with >>>
Timescale
24 monthsNational and international background
The general problem of searching the Web with more powerful tools than current search engines is described in details in [WGST04]. This is just one further formulation of a problem which has been posed many times, and addressed each time with respect to the technological state of the art. As an example, eight years ago the database community considered the issue in the Asilomar report [BBC+98].Search Engines and Information Retrieval
Search Engines, among the most sophisticated and useful resources available on the Internet, assist the user in the task of rapidly and effectively navigating the Web. To some extent, the problem of finding information on the Web can be rephrased as the problem of knowing where search engines are, what they are designed to retrieve, and how to use them. Two different types of engines have been developed so far: large-scale and specific search engines. Large-scale engines exemplify the trade-off between breadth and quality, while the specific ones are more likely to quickly focus a search in one particular area.
Information Retrieval systems are software tools which help users in the task of finding documents contained in a specific corpus or database. Such systems are also widely used on the Web for finding scholarly information as well as for many other recreational activities. The most popular information retrieval technique involves combining the full text of all documents within a >>>



