CC Web de Datos
Administrativia Segundo Semestre 2011:
-
Horario: Lunes y Miercoles, 4to bloque
-
Programa del Curso
Lecturas obligatorias
- Primera Semana: [12], [20], [33], [50] (Part 3.)
- Segunda Semana:
The Fourth Paradigm: Jim Gray on eScience: a transformed scientific method;
T. Hannay: From Web 2.0 to the Global Database.
Ch. Anderson: The End of Theory: The Data Deluge Makes the Scientific
Method Obsolete. Wired Maganize, 16.07
Evans et al., Machine Science, Science, Vol. 329, 2010.
D. Lazer et al. Computational Social Science, Science, vol. 323.
- Tercera Semana: Newman,
The structure and function of complex networks, Secc. I, II, y III.
Halevy et al. The unreasonable effectiveness of Data, IEEE Intelligent Systtems, 2009.
Jeff Hammerbacher, Information Platforms and the rise of the Data Scientist. (En Beatiful Data, Edit. T. Segaran and J. Hammerbacher, O'Reilly, 2009.)
- Cuarta Semana:
T. Heat, Ch. Bizer,
Linked Data: Evolving the Web into a Global Data Space
- Quinta semana
J. Hoem,
Openness in communication , First Monday, 2006.
Bibliografía Preliminar (y desordenada)
Metadata
Introduction to Metadata
Tony Gill, Anne J. Gilliland, Maureen Whalen, and Mary S. Woodley
Edited by Murtha Baca
Diluvio de Datos
[11] G. Bell, J. Gray, A. Szalay, Petascale Computational Systems: Balanced CyberInfrastructure in a Data-Centric World, Computer, Volume 39, Issue 1 (January 2006), 110 - 112.
[12] G. Bell, T. Hey, A. Szalay, Beyond the Data Deluge, Science, Vol. 323, March 2009, pp. 1297-1298.
[20] A. Szalay, J. Gray, Science in an Exponential World, Nature, Vol. 440,
[21] T. Hey, S. Tansley, and K. Tolle (eds.),
The Fourth Paradigm: Data-Intensive Scientific Discovery , 2010
[33] Mike Loukides,
What is data science?
[50] Tim O'Reilly, What Is Web 2.0 , 2005
Modelos de Datos
[1] R. Angles, C. Gutierrez, Survey of Graph Database Models, ACM Computing Surveys, Vol. 40, No. 1, February 2008.
[2] R. Agrawal et al.,
The Claremont Report on Database Research, 2008
[3] S. Abiteboul, Querying semi-structured data, International Conf. on Database Theory-ICDT'97, 1997.
[4] S. Abiteboul, V. Vianu, Queries and Computation on the Web, ICDT 1997: pp .262-275.
[5] S. Abiteboul, V. Vianu, Queries and Computation on the Web, Theor. Comput. Sci. 239(2): 231-255 (2000).
[6] S. Abiteboul, P. Buneman, D. Suciu, Data on the Web. From Relations to Semistructured Data and XML, Morgan Kaufmann Publ. California, 2000.
[7] L. G. Alex Sung, N. Ahmed, R. Blanco, H. Li, M. Ali Soliman, D. Hadaller, A Survey of Data Management in Peer-to-Peer Systems}, Web Data
[13] Ch. Bizer, T. Heath, T. Berners-Lee,
Linked Data - The Story So Far ,
International Journal on Semantic Web and Information Systems, 3 (2009), pp. 1-22.
[14] P. Buneman, Semistructured data, ACM PODS, 1997.
[15] T. Bray, J. Paoli, C. M. Sperberg-McQueen, C. M. 1998.
Extensible Markup Language (XML) 1.0, W3C Recommendation 10, (February 1998).
[26] Alon Y. Halevy, M. J. Franklin, D. Maier, Principles of dataspace systems, PODS 2006: 1-9.
[37] A. O. Mendelzon, The Web is not a Database, Workshop on Web Information and Data Management 1998.
[28] M. Hausenblas, M. Karnstedt, Understanding Linked Open Data as a Web-Scale Database, 1st Internat. Conf. on Advances in Databases, pp. 56-61, 2010.
[44] S. Raghavan, H. Garcia-Molina, Complex Queries over Web Repositories, VLDB 2003, pp. 33--44.
[45] Y. Papakonstantinou, H. Garcia-Molina, J. Widom, Object exchange across heterogeneous information sources, 11th International Conference on Data Engineering (ICDE), 1995, pp. 251-260.
[8] Tim Berners-Lee,
Design Issues/Linked Data
[9] Tim Berners-Lee,
Linked Open Data. What is the idea?
[22] T. Green, V. Tannen, Models for Incomplete and Probabilistic Information, EDBT Workshops, Munich, Germany, March 2006.
[46] D. Suciu, Probabilistic Databases, Database Theory Column, SIGMOD Record, 2008.
[47] M. Spielmann, J. Tyszkiewicz, J. Van den Bussche, Distributed computation of web queries using automata, PODS 2002, pp. 97-108.
Acceso a Datos
[10] P. A. Bernstein, F. Giunchiglia, A. Kementsietsidis, J. Mylopoulos, L. Serafini, I. Zaihrayeu, Data Management for Peer-to-Peer Computing: A Vision, WebDB, Workshop on Databases and the Web, 2002.
[16] S. Brin, L. Page The anatomy of a large-scale hypertextual Web search engine, Computer Networks and ISDN Systems, 1998, pp. 107-117.
[17] M. Cai, M. Frank, RDFPeers: a scalable distributed RDF repository based on a structured peer-to-peer network, WWW'04, 2004, pp. 650-657.
[18] DATA.gov project
[19] O Erling, I. Mikhailov, Towards Web Scale RDF, 4th International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS2008), 2008.
[24] T. Guan, L. Saxton, A complexity model for web queries}, Fundamentals of Information Systems, Ch. 1, Kluwer, 1999.
[25] S. Gribble, A .Halevy, Z. Ives, M. Rodrig, D. Suciu, What Can Databases Do for Peer-to-Peer?, WebDB, Workshop on Databases and the Web, 2001.
[27] O. Hartig, C. Bizer, J.-C. Freytag, Executing sparql queries over the web of linked data , ISWC '09, 2009, pp. 293-309.
[34] R. Himmer\"oder, G. Lausen, B. Lud\"ascher, Ch. Schlepphorst, On a Declarative Semantics for Web Queries, DOOD'97, 1997, LNCS 1341.
[35] W. Lokea, A. Davison, LogicWeb: Enhancing the Web with Logic Programming, The Journal of Logic Programming Volume 36, Issue 3, September 1998, Pages 195-240.
[38] A. O. Mendelzon, T. Milo, Formal Models of Web Queries, PODS 1997: 134-143.
[39] A. O. Mendelzon, T. Milo, Formal Models of Web Queries, Inf. Syst. 23(8): 615-637 (1998).
[52] J.T. Horng, Y.Y. Tai, Pattern-based approach to structural queries on the World Wide Web, Proc. Natl. Sci, Counc. ROC(A). Vol. 24, No. 1, 2000. pp. 31-43.
Procesamiento de Datos
[23] L. Wookey , J. Geller, Semantic Hierarchical Abstraction of Web Site Structures, Journal of Research and Practice in Information Technology, vol. 36, 2004, pp. 71--82.
[41] D. Konopnicki, O. Shmueli, Bringing Database Functionalities to the WWW, The World Wide Web and Databases, LNCS 1590/1999, pp. 63-77.
[30] R. Kumar, P. Raghavan, S. Rajagopalan, D. Sivakumar, A. Tompkins, E. Upfal, The Web as a Graph, PODS 2000, pp. 1-10.
[48] M. Stonebraker, S. Madden, D. J. Abadi, S. Harizopoulos, N. Hachem,
and P. Helland, The end of an architectural era: (it's time for a complete rewrite), VLDB '07, 2007, pp. 1150-1160.
[49] Y. Cao, E-P. Lim, Data model for warehousing historical Web information, Information and Software Technology, Volume 45, Issue 6, 15 April 2003, pp. 315-334.
[51] Evimaria Terzi , Mohand-Saďd Hacid , Athena Vakali , Saďd Hacid, Modeling and Querying Web Data: A Constraint-Based Logic Approach, Information modeling for internet applications book contents, 2003, pp. 1-21.
[53] P. Valduriez, E. Pacitti, Data Management in Large-scale P2P Systems, VECPAR 2004: 104-118.
Principios, buenas practicas, etc.
[31] Th. Lee, Attribution Principles for Data Integration: Technology and Policy Perspectives, Thesis (Ph. D.)--Massachusetts Institute of Technology, Engineering Systems Division, Technology, Management, and Policy Program, 2002.
[32]
Linked Data Connect Distributed Data across the Web
[40] A. Rubinstein, Modeling Bounded Rationality, MIT Press, 1998.
Estandares y refrencias
[42] No SQL, http://nosql-database.org/
[36] D.L. McGuinness, F. van Harmelen,
OWL Web Ontology Language Overview, W3C Recommendation 10 February 2004
[29] G. Klyne, J. Carroll,
Resource Description Framework (RDF) Concepts and Abstract Syntax, W3C Recommendation, 2004
Algunos enlaces complementarios para el curso
Exposiciones
- Javier Fernandez,
Stonebraker et al., The end of an architectural era: (it's time for a complete rewrite)
- Maira Marques,
Havely et al., Dataspaces
- Felipe Bravo,
Mendelzon & Milo, Formal Models of Web Queries
- Andres Abeliuk,
Kumar et al., The Web as a Graph
- Juan Enrique Muñoz
Queries and Computation on the Web
S. Abiteboul, V. Vianu
- Gustavo Pabon:
Social Aware Cloud Storage ,
T. Berners-Lee
- Guillermo Cabrera:
Datos Astron\'omicos
- Claremont Report