ProvONE+: A Provenance Model for Scientific Workflows | SpringerLink
Skip to main content

ProvONE+: A Provenance Model for Scientific Workflows

  • Conference paper
  • First Online:
Web Information Systems Engineering – WISE 2020 (WISE 2020)

Abstract

The provenance of workflows is essential, both for the data they derive and for their specification, to allow for the reproducibility, sharing and reuse of information in the scientific community. Although the formal modelling of scientific workflow provenance was of interest and studied, in many fields like semantic web, yet no provenance model has existed, we are aware of, to model control-flow driven scientific workflows. The provenance models proposed by the semantic web community for data-driven scientific workflows may capture the provenance of control-flow driven workflows execution traces (i.e., retrospective provenance) but underspecify the workflow structure (i.e., workflow provenance). An underspecified or incomplete structure of a workflow results in the misinterpretation of a scientific experiment and precludes conformance checking of the workflow, thereby restricting the gains of provenance. To overcome the limitation, we present a formal, lightweight and general-purpose specification model for the control-flows involved scientific workflows. The proposed model can be combined with the existing provenance models and easy to extend to specify the common control-flow patterns. In this article, we inspire the need for control-flow driven scientific workflow provenance model, provide an overview of its key classes and properties, and briefly discuss its integration with the ProvONE provenance model as well as its compatibility to PROV-DM. We will also focus on the sample modelling using the proposed model and present a comprehensive implementation scenario from the agricultural domain for validating the model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 5719
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 7149
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    http://www.opmw.org/model/p-plan/.

  2. 2.

    https://www.w3.org/TR/prov-dm/.

  3. 3.

    ProvONE+ Model: https://github.com/anilabutt/ProvONEplus.

  4. 4.

    http://www.workflowpatterns.com/patterns/control/index.php.

  5. 5.

    http://www.ontologydesignpatterns.org/cp/owl/controlflow.owl.

  6. 6.

    https://www.w3.org/OWL/.

  7. 7.

    https://research.csiro.au/digiscape/digiscapes-projects/digital-services-for-carbon-farming-markets/.

  8. 8.

    https://www.csiro.au/.

References

  1. Altintas, I., Berkley, C., Jaeger, E., Jones, M., Ludascher, B., Mock, S.: Kepler: an extensible system for design and execution of scientific workflows. In: Proceedings of the 16th International Conference on Scientific and Statistical Database Management, pp. 423–424 (2004)

    Google Scholar 

  2. Belhajjame, K., et al.: Using a suite of ontologies for preserving workflow-centric research objects. J. Web Semant. 32, 16–42 (2015)

    Article  Google Scholar 

  3. Cabral, L., Norton, B., Domingue, J.: The business process modelling ontology. In: Proceedings of the 4th International Workshop on Semantic Business Process Management, pp. 9–16. ACM (2009)

    Google Scholar 

  4. Cuevas-Vicenttín, V., et al.: ProvONE: a PROV extension data model for scientific workflow provenance. https://purl.dataone.org/provone-v1-dev (2016). Accessed 23 July 2020

  5. Curcin, V., Ghanem, M., Wendel, P., Guo, Y.: Heterogeneous workflows in scientific workflow systems. In: Shi, Y., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2007. LNCS, vol. 4489, pp. 204–211. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-72588-6_36

    Chapter  Google Scholar 

  6. Deelman, E., Gannon, D., Shields, M., Taylor, I.: Workflows and e-science: an overview of workflow system features and capabilities. Future Gener. Comput. Syst. 25(5), 528–540 (2009)

    Article  Google Scholar 

  7. Garijo, D., Gil, Y.: A new approach for publishing workflows: abstractions, standards, and linked data. In: Proceedings of the 6th Workshop on Workflows in Support of Large-scale Science, pp. 47–56 (2011)

    Google Scholar 

  8. Herschel, M., Diestelkàmper, R., Ben Lahmar, H.: A survey on provenance: what for? What form? What from? VLDB J. Int. J. Very Large Data Bases 26(6), 881–906 (2017)

    Article  Google Scholar 

  9. Hull, D., et al.: Taverna: a tool for building and running workflows of services. Nucleic Acids Res. 34(suppl\_2), W729–W732 (2006)

    Google Scholar 

  10. Moreau, M.: World Wide Web consortium PROV-DM: the PROV data model W3C recommendation. https://www.w3.org/TR/prov-dm/ (2013). Accessed 23 July 2020

  11. Moreau, L.: The foundations for provenance on the web. Found. Trends Web Sci. 2, 99–241 (2010)

    Article  MathSciNet  Google Scholar 

  12. Oliveira, W., Oliveira, D.D., Braganholo, V.: Provenance analytics for workflow-based computational experiments: a survey. ACM Comput. Surv. 51(3), 53:1–53:25 (2018). https://doi.org/10.1145/3184900

  13. Russell, N., Ter Hofstede, A.H., Van Der Aalst, W.M., Mulyar, N.: Workflow control-flow patterns: a revised view. BPM Center Report BPM-06-22, BPMcenter. org, pp. 06–22 (2006)

    Google Scholar 

  14. Shiroor, A.R.: Scientific workflow management systems and workflow patterns. Ph.D. thesis, Purdue University (2009)

    Google Scholar 

  15. Taylor, I., Shields, M., Wang, I., Harrison, A.: The triana workflow environment: architecture and applications. In: Workflows for e-Science, pp. 320–339. Springer (2007). https://doi.org/10.1007/978-1-84628-757-2_20

  16. Van Der Aalst, W.M.: Workflow patterns. In: Encyclopedia of Database Systems, pp. 3557–3558 (2009). https://doi.org/10.1007/978-0-387-39940-9_826

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anila Sahar Butt .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Butt, A.S., Fitch, P. (2020). ProvONE+: A Provenance Model for Scientific Workflows. In: Huang, Z., Beek, W., Wang, H., Zhou, R., Zhang, Y. (eds) Web Information Systems Engineering – WISE 2020. WISE 2020. Lecture Notes in Computer Science(), vol 12343. Springer, Cham. https://doi.org/10.1007/978-3-030-62008-0_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-62008-0_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-62007-3

  • Online ISBN: 978-3-030-62008-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics