Courses

Algorithms for Data Science (M2 DataScience, U. Paris-Saclay)

Language: English Last version: 2021–2022

Course materials and submission of homeworks/project on eCampus. Temporary anonymous access password: AlgoDS2021!

Structure:

  • Week 1 (10/09/2021) - Intro, Frequent Itemset Mining
  • Week 2 (17/09/2021) - Mining Similar Items
  • Weeks 3,4,5 (tentative) - Data Stream Algorithms
  • Week 6 (tentative) - Advertising on the Web

References:

  1. J. Leskovec, A. Rajaraman, J. Ullman. “Mining of Massive Datasets”. [site]

Social and Graph Data Management (M2 Data Science, U. Paris-Saclay)

Language: English Last version: 2021–2022

Details to follow

References:

  1. A.-L. Barabási. “Network Science.” Cambridge University Press [site]
  2. M. Newman. “Networks: An Introduction.” Oxford University Press
  3. D. Easley, J. Kleingber. “Networks, Crowds, and Markets.” Cambridge University Press [site]

Bases de données (Polytech APP3, U. Paris-Saclay)

Langue : Français Dernière version : 2020–2021

Ce cours reprend le cours BD2 (L3 Info Paris-Saclay) par Emmanuel Waller

Seances de cours :

  • 15/10/2020 : Introduction [pdf], Modèle [pdf] , Mises à jour [pdf], Persistance [pdf], Interrogation [pdf]
  • 15/10/2020 : Contraintes [pdf]
  • 19/10/2020 : PL/SQL - Intro [pdf], Bases [pdf]
  • 19/10/2020 : PL/SQL - Curseurs [pdf]
  • 07/12/2020 : JDBC [pdf1] [pdf2]
  • 11/12/2020 : JDBC [pdf1] [pdf2]

TD/TP :

Cahier de charges pour les TD/TP [pdf] ; Instructions de connexion à la base Oracle [pdf]

  • 15/10/2020 : Mises à jour [pdf]; corrigé [sql]
  • 15/10/2020 : Contraintes [pdf]; corrigé [sql]
  • 19/10/2020 : PL/SQL Bases [pdf]; corrigé [sql]
  • 22/10/2020 : PL/SQL Curseurs [pdf]; corrigé [sql]
  • 07/12/2020 : JDBC 1 [pdf] [Menu.java]; corrigé [java]
  • 07/12/2020 : JDBC 2 [pdf]; corrigé [java]
  • 11/12/2020 : JDBC 3 [pdf]; corrigé [java]
  • 17/12/2020 : JDBC 4 [pdf]

Exemples :

Old Courses

Data Science Project (M2 Data Science, U. Paris-Saclay)

Language: English Last version: 2020–2021

Online via [eCampus]

Schedule:

  • 08/01/2021: Project Presentation – Collaborative Filtering-Based Systems [pdf]
  • 15/01/2021: Team composition [list]
  • 22/01/2021: First presentation [guidelines]

Datasets:

  • GroupLens - MovieLens ratings of movies, also contains tags of movies

Bibliography:

  1. R. Chen, Q. Hua, Y.-S. Chang, B. Wang, L. Zhang, X. Kong. “A Survey of Collaborative Filtering-Based Recommender Systems: From Traditional Methods to Hybrid Methods Basedon Social Networks”. IEEE Access, 2018 [pdf]
  2. J. Leskovec, A. Rajaraman, J. Ullman. “Mining of Massive Datasets”. (chapters 9, 3, 11) [site]

Web Data Models (M2 Data&Knowledge, U. Paris-Saclay)

Language: English Last version: 2018–2019

Course dates and slides:

Practical labs and project:

References:

  1. Makoto Murata, Dongwon Lee, Murali Mani, and Kohsuke Kawaguchi. 2005. “Taxonomy of XML schema languages using formal language theory”. ACM Trans. Internet Technol. 5, 4, 660-704. [paper]
  2. Georg Gottlob, Christoph Koch, and Reinhard Pichler. 2005. “Efficient algorithms for processing XPath queries”. ACM Trans. Database Syst. 30, 2, 444-491. [paper]
  3. Todd J. Green, Ashish Gupta, Gerome Miklau, Makoto Onizuka, and Dan Suciu. 2004. “Processing XML streams with deterministic automata and stream indexes”. ACM Trans. Database Syst. 29, 4, 752-788. [paper]
  4. Michael Benedikt and Christoph Koch. 2009. “XPath leashed”. ACM Comput. Surv. 41, 1, Article 3, 54 pages. [paper]
  5. Thomas Schwentick. 2004. “XPath query containment”. SIGMOD Rec. 33, 1, 101-109. [paper]
  6. Gerome Miklau and Dan Suciu. 2004. “Containment and equivalence for a fragment of XPath”. J. ACM 51, 1, 2-45. [paper]
  7. Felipe Pezoa, Juan L. Reutter, Fernando Suarez, Martín Ugarte, and Domagoj Vrgoc. 2016. “Foundations of JSON Schema”. ACM WWW. [paper]

Useful reading:

  • C. Maneth’s course “XML and Databases” [page]
  • S. Abiteboul et al. “Web Data Management”. 2011. Cambridge University Press [page]
  • H. Comon et al. “Tree Automata Techniques and Applications”. 2007 [page]
  • W3Schools tutorials [site]

Previous exams: 2015–2016 [pdf], 2017–2018 [pdf]

Architectures for Massive Data Management (M2 Data&Knowledge, U. Paris-Saclay)

Language: English Last version: 2018–2019

Courses:

  • 02/10/2018: JSON Stores [slides]
  • 23/10/2018: Graph Stores [slides]

Practical labs: