Abstract
The provenance of workflows is essential, both for the data they derive and for their specification, to allow for the reproducibility, sharing and reuse of information in the scientific community. Although the formal modelling of scientific workflow provenance was of interest and studied, in many fields like semantic web, yet no provenance model has existed, we are aware of, to model control-flow driven scientific workflows. The provenance models proposed by the semantic web community for data-driven scientific workflows may capture the provenance of control-flow driven workflows execution traces (i.e., retrospective provenance) but underspecify the workflow structure (i.e., workflow provenance). An underspecified or incomplete structure of a workflow results in the misinterpretation of a scientific experiment and precludes conformance checking of the workflow, thereby restricting the gains of provenance. To overcome the limitation, we present a formal, lightweight and general-purpose specification model for the control-flows involved scientific workflows. The proposed model can be combined with the existing provenance models and easy to extend to specify the common control-flow patterns. In this article, we inspire the need for control-flow driven scientific workflow provenance model, provide an overview of its key classes and properties, and briefly discuss its integration with the ProvONE provenance model as well as its compatibility to PROV-DM. We will also focus on the sample modelling using the proposed model and present a comprehensive implementation scenario from the agricultural domain for validating the model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
ProvONE+ Model: https://github.com/anilabutt/ProvONEplus.
- 4.
- 5.
- 6.
- 7.
- 8.
References
Altintas, I., Berkley, C., Jaeger, E., Jones, M., Ludascher, B., Mock, S.: Kepler: an extensible system for design and execution of scientific workflows. In: Proceedings of the 16th International Conference on Scientific and Statistical Database Management, pp. 423–424 (2004)
Belhajjame, K., et al.: Using a suite of ontologies for preserving workflow-centric research objects. J. Web Semant. 32, 16–42 (2015)
Cabral, L., Norton, B., Domingue, J.: The business process modelling ontology. In: Proceedings of the 4th International Workshop on Semantic Business Process Management, pp. 9–16. ACM (2009)
Cuevas-Vicenttín, V., et al.: ProvONE: a PROV extension data model for scientific workflow provenance. https://purl.dataone.org/provone-v1-dev (2016). Accessed 23 July 2020
Curcin, V., Ghanem, M., Wendel, P., Guo, Y.: Heterogeneous workflows in scientific workflow systems. In: Shi, Y., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2007. LNCS, vol. 4489, pp. 204–211. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-72588-6_36
Deelman, E., Gannon, D., Shields, M., Taylor, I.: Workflows and e-science: an overview of workflow system features and capabilities. Future Gener. Comput. Syst. 25(5), 528–540 (2009)
Garijo, D., Gil, Y.: A new approach for publishing workflows: abstractions, standards, and linked data. In: Proceedings of the 6th Workshop on Workflows in Support of Large-scale Science, pp. 47–56 (2011)
Herschel, M., Diestelkàmper, R., Ben Lahmar, H.: A survey on provenance: what for? What form? What from? VLDB J. Int. J. Very Large Data Bases 26(6), 881–906 (2017)
Hull, D., et al.: Taverna: a tool for building and running workflows of services. Nucleic Acids Res. 34(suppl\_2), W729–W732 (2006)
Moreau, M.: World Wide Web consortium PROV-DM: the PROV data model W3C recommendation. https://www.w3.org/TR/prov-dm/ (2013). Accessed 23 July 2020
Moreau, L.: The foundations for provenance on the web. Found. Trends Web Sci. 2, 99–241 (2010)
Oliveira, W., Oliveira, D.D., Braganholo, V.: Provenance analytics for workflow-based computational experiments: a survey. ACM Comput. Surv. 51(3), 53:1–53:25 (2018). https://doi.org/10.1145/3184900
Russell, N., Ter Hofstede, A.H., Van Der Aalst, W.M., Mulyar, N.: Workflow control-flow patterns: a revised view. BPM Center Report BPM-06-22, BPMcenter. org, pp. 06–22 (2006)
Shiroor, A.R.: Scientific workflow management systems and workflow patterns. Ph.D. thesis, Purdue University (2009)
Taylor, I., Shields, M., Wang, I., Harrison, A.: The triana workflow environment: architecture and applications. In: Workflows for e-Science, pp. 320–339. Springer (2007). https://doi.org/10.1007/978-1-84628-757-2_20
Van Der Aalst, W.M.: Workflow patterns. In: Encyclopedia of Database Systems, pp. 3557–3558 (2009). https://doi.org/10.1007/978-0-387-39940-9_826
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Butt, A.S., Fitch, P. (2020). ProvONE+: A Provenance Model for Scientific Workflows. In: Huang, Z., Beek, W., Wang, H., Zhou, R., Zhang, Y. (eds) Web Information Systems Engineering – WISE 2020. WISE 2020. Lecture Notes in Computer Science(), vol 12343. Springer, Cham. https://doi.org/10.1007/978-3-030-62008-0_30
Download citation
DOI: https://doi.org/10.1007/978-3-030-62008-0_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-62007-3
Online ISBN: 978-3-030-62008-0
eBook Packages: Computer ScienceComputer Science (R0)