How Database Theory Helps Teach Relational Queries in Database Education (Invited Talk)

How Database Theory Helps Teach Relational Queries in Database Education (Invited Talk)

Authors Sudeepa Roy , Amir Gilad , Yihao Hu , Hanze Meng , Zhengjie Miao , Kristin Stephens-Martinez , Jun Yang



PDF
Thumbnail PDF

File

LIPIcs.ICDT.2024.2.pdf
  • Filesize: 0.64 MB
  • 9 pages

Document Identifiers

Author Details

Sudeepa Roy
  • Duke University, Durham, NC, USA
  • RelationalAI, Berkeley, CA, USA (Visiting Scientist)
Amir Gilad
  • Hebrew University of Jerusalem, Israel
Yihao Hu
  • Duke University, Durham, NC, USA
Hanze Meng
  • Duke University, Durham, NC, USA
Zhengjie Miao
  • Simon Fraser University, Burnaby, Canada
Kristin Stephens-Martinez
  • Duke University, Durham NC, USA
Jun Yang
  • Duke University, Durham, NC, USA

Acknowledgements

The authors are grateful to graduate and undergraduate students Alexander Bendeck, Stephen Chen, Tiangang Chen, Jeremy Cohen, Kevin Day, James Leong, Qiulin Li, James Lim, Jeffrey Luo, Allen Pan, Aanya Sanghavi, Sharan Sokhi, Aparimeya Taneja, and Zachary Zheng who contributed to building the tracing tool for SQL queries. This work was done when Zhengjie Miao (part of his PhD dissertation research) and Amir Gilad were at Duke University.

Cite As Get BibTex

Sudeepa Roy, Amir Gilad, Yihao Hu, Hanze Meng, Zhengjie Miao, Kristin Stephens-Martinez, and Jun Yang. How Database Theory Helps Teach Relational Queries in Database Education (Invited Talk). In 27th International Conference on Database Theory (ICDT 2024). Leibniz International Proceedings in Informatics (LIPIcs), Volume 290, pp. 2:1-2:9, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024) https://doi.org/10.4230/LIPIcs.ICDT.2024.2

Abstract

Data analytics skills have become an indispensable part of any education that seeks to prepare its students for the modern workforce. Essential in this skill set is the ability to work with structured relational data. Relational queries are based on logic and may be declarative in nature, posing new challenges to novices and students. Manual teaching resources being limited and enrollment growing rapidly, automated tools that help students debug queries and explain errors are potential game-changers in database education. We present a suite of tools built on the foundations of database theory that has been used by over 1600 students in database classes at Duke University, showcasing a high-impact application of database theory in database education.

Subject Classification

ACM Subject Classification
  • Theory of computation → Database theory
  • Information systems → Data management systems
  • Information systems → Structured Query Language
Keywords
  • Query Debugging
  • SQL
  • Relational Algebra
  • Relational Calculus
  • Database Education
  • Boolean Provenance

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Serge Abiteboul, Richard Hull, and Victor Vianu. Foundations of Databases. Addison-Wesley, 1995. URL: http://webdam.inria.fr/Alice/.
  2. Alireza Ahadi, Julia Prior, Vahid Behbood, and Raymond Lister. A quantitative study of the relative difficulty for novices of writing seven different types of sql queries. In Proceedings of the 2015 ACM Conference on Innovation and Technology in Computer Science Education, ITiCSE '15, pages 201-206, New York, NY, USA, 2015. ACM. URL: https://doi.org/10.1145/2729094.2742620.
  3. Paul Ammann and Jeff Offutt. Introduction to software testing. Cambridge University Press, 2016. Google Scholar
  4. Yael Amsterdamer, Daniel Deutch, and Val Tannen. Provenance for aggregate queries. In PODS, pages 153-164, 2011. URL: https://doi.org/10.1145/1989284.1989302.
  5. Marcelo Arenas, Pablo Barceló, Leonid Libkin, Wim Martens, and Andreas Pieris. Database Theory. Open source at https://github.com/pdm-book/community, 2022.
  6. Ashok K. Chandra and Philip M. Merlin. Optimal implementation of conjunctive queries in relational data bases. In Proceedings of the Ninth Annual ACM Symposium on Theory of Computing, STOC '77, pages 77-90, 1977. URL: https://doi.org/10.1145/800105.803397.
  7. Data Analytics Market Share, Market Research Future. URL: https://www.marketresearchfuture.com/reports/data-analytics-market-1689.
  8. Ronald Fagin, Phokion G. Kolaitis, Renée J. Miller, and Lucian Popa. Data exchange: Semantics and query answering. In ICDT, pages 207-224, 2003. URL: https://doi.org/10.1007/3-540-36285-1_14.
  9. Ronald Fagin, Phokion G. Kolaitis, and Lucian Popa. Data exchange: getting to the core. In PODS, pages 90-101, 2003. URL: https://doi.org/10.1145/773153.773163.
  10. Amir Gilad, Zhengjie Miao, Sudeepa Roy, and Jun Yang. Understanding queries by conditional instances. In Zachary G. Ives, Angela Bonifati, and Amr El Abbadi, editors, SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12-17, 2022, pages 355-368. ACM, 2022. URL: https://doi.org/10.1145/3514221.3517898.
  11. Gradescope. URL: https://www.gradescope.com/.
  12. Todd J. Green, Gregory Karvounarakis, and Val Tannen. Provenance semirings. In PODS, pages 31-40, 2007. URL: https://doi.org/10.1145/1265530.1265535.
  13. Todd J Green, Grigoris Karvounarakis, and Val Tannen. Provenance semirings. In PODS, pages 31-40, 2007. URL: https://doi.org/10.1145/1265530.1265535.
  14. Tomasz Imielinski and Witold Lipski Jr. Incomplete information in relational databases. J. ACM, 31(4):761-791, 1984. URL: https://doi.org/10.1145/1634.1886.
  15. Hanze Meng, Zhengjie Miao, Amir Gilad, Sudeepa Roy, and Jun Yang. Characterizing and verifying queries via CINSGEN. In Sudipto Das, Ippokratis Pandis, K. Selçuk Candan, and Sihem Amer-Yahia, editors, Companion of the 2023 International Conference on Management of Data, SIGMOD/PODS 2023, Seattle, WA, USA, June 18-23, 2023, pages 143-146. ACM, 2023. URL: https://doi.org/10.1145/3555041.3589721.
  16. Zhengjie Miao, Tiangang Chen, Alexander Bendeck, Kevin Day, Sudeepa Roy, and Jun Yang. I-rex: An interactive relational query explainer for SQL. Proc. VLDB Endow., 13(12):2997-3000, 2020. URL: https://doi.org/10.14778/3415478.3415528.
  17. Zhengjie Miao, Sudeepa Roy, and Jun Yang. Explaining wrong queries using small examples. In SIGMOD, pages 503-520, 2019. URL: https://doi.org/10.1145/3299869.3319866.
  18. Zhengjie Miao, Sudeepa Roy, and Jun Yang. Ratest: Explaining wrong relational queries using small examples. In Peter A. Boncz, Stefan Manegold, Anastasia Ailamaki, Amol Deshpande, and Tim Kraska, editors, Proceedings of the 2019 International Conference on Management of Data, SIGMOD Conference 2019, Amsterdam, The Netherlands, June 30 - July 5, 2019, pages 1961-1964. ACM, 2019. URL: https://doi.org/10.1145/3299869.3320236.
  19. Joan C. Miller and Clifford J. Maloney. Systematic mistake analysis of digital computer programs. Commun. ACM, 6(2):58-63, 1963. URL: https://doi.org/10.1145/366246.366248.
  20. Glenford J Myers, Tom Badgett, Todd M Thomas, and Corey Sandler. The art of software testing, volume 2. Wiley, 2004. Google Scholar
  21. International Workshop on Data Systems Education (DataEd) 2023. Google Scholar
  22. Sudeepa Roy and Jun Yang. Letter from the special issue editor. IEEE Data Eng. Bull., 45(3):2-3, 2022. URL: http://sites.computer.org/debull/A22sept/p2.pdf.
  23. Yehoshua Sagiv and Mihalis Yannakakis. Equivalences among relational expressions with the union and difference operators. J. ACM, 27(4):633-655, oct 1980. URL: https://doi.org/10.1145/322217.322221.
  24. Oded Shmueli. Equivalence of datalog queries is undecidable. J. Log. Program., 15(3):231-241, feb 1993. URL: https://doi.org/10.1016/0743-1066(93)90040-N.
  25. Boris Trakhtenbrot. The impossibility of an algorithm for the decidability problem on finite classes. Proceedings of the USSR Academy of Sciences, 70(4):569-572, 1950. Google Scholar
  26. Moshe Y. Vardi. The complexity of relational query languages (extended abstract). In Proceedings of the Fourteenth Annual ACM Symposium on Theory of Computing, pages 137-146, 1982. URL: https://doi.org/10.1145/800070.802186.
  27. Project website:. URL: https://dukedb-hnrq.github.io/.
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail