Abstract
We focus on human-in-the-loop, information-integration settings where users gather and evaluate data from a broad variety of sources and where the levels of trust in sources and users change dynamically. In such settings, users must use their judgment as they collect and modify data. As an example, a battlefield information officer preparing a report to inform his or her superiors about the current state of affairs must gather and integrate data from many (including non-computerized) sources. By tracking multiple sources for individual values, the officer may eliminate a value from the current state whenever all of the sources where this value was found are no longer trusted. We define a conceptual model for a curated database with provenance for such settings, the Multi-granularity, Multi-provenance Model (MMP), which supports multiple insertions and multiple (copy-and-)paste operations for a single database element, captures the external source for all operations, and includes a Data Confidence Language that allows users to confirm or doubt values to record their atomic judgments about the data. In this paper, we briefly summarize the MMP model and show how it can be extended to support potentially complex operations including compound judgment operators (such as merging tuples to achieve entity resolution), while capturing a complete record of data provenance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agrawal, P., Benjelloun, O., Das Sarma, A., Hayworth, C., Nabar, S., Sugihara, T., Widom, J.: Trio: a system for data, uncertainty, and lineage. In: Proceedings of the 32nd International Conference on Very Large Data Bases, VLDB 2006. VLDB Endowment (2006)
Archer, D.W., Delcambre, L.M.L.: Definition and Formalization of Entity Resolution Functions for Everyday Information Integration. In: Schewe, K.-D., Thalheim, B. (eds.) SDKB 2008. LNCS, vol. 4925, pp. 126–142. Springer, Heidelberg (2008)
Archer, D., Delcambre, L.: A Conceptual Model and Predicate Language for Data Selection and Projection Based on Provenance. In: Proceedings of the Second Workshop on the Theory and Practiceof Provenance (TaPP 2010), San Jose, CA (February 2010)
Archer, D.: Conceptual Modeling of Data with Provenance. PhD dissertation. Portland State University (2011)
Bhagwat, D., Chiticariu, L., Tan, W., Vijayvargiya, G.: An annotation management system for relational databases.In Proceedings of the 30thInternational Conference on Very Large Data Bases, VLDB 2004. VLDB Endowment (2004)
Buneman, P., Chapman, A., Cheney, J., Vansummeren, S.: A provenance model for manually curated data. In: Moreau, L., Foster, I. (eds.) IPAW 2006. LNCS, vol. 4145, pp. 162–170. Springer, Heidelberg (2006)
Buneman, P., Cheney, J., Vansummeren, S.: On the expressivenesss of implicit provenance in query and update languages. ACM Transactions on Database Systems 33(4) (2008)
Cui, Y., Widom, J., Wiener, J.: Tracing the lineage of view data in a warehousing environment. ACM Transactions on Database Systems 25(2) (2000)
Green, T., Karvounarakis, G., Taylor, N., Biton, O., Ives, Z., Tannen, V.: Orchestra: facilitating collaborative data sharing. In: SIGMOD 2007: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data. ACM, New York (2007)
Green, T., Karvounarakis, G., Tannen, V.: Provenance semirings. In: PODS 2007: Proceedings of the Twenty-Sixth ACM SIGMOD-SIGACTSIGART Symposium on Principles of Database Systems, ACM, New York (2007)
Levitin, A.: How to measure size, and how not to. In: Proceedings of the Tenth COMPSAC Conference. IEEE Computer Society Press, Washington DC (1986)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Archer, D.W., Delcambre, L.M.L., Maier, D. (2013). User Trust and Judgments in a Curated Database with Explicit Provenance. In: Tannen, V., Wong, L., Libkin, L., Fan, W., Tan, WC., Fourman, M. (eds) In Search of Elegance in the Theory and Practice of Computation. Lecture Notes in Computer Science, vol 8000. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41660-6_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-41660-6_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41659-0
Online ISBN: 978-3-642-41660-6
eBook Packages: Computer ScienceComputer Science (R0)