Dynamic Workflow Management and Monitoring Using DDS
Author
Abstract

Large scientific computing data-centers require a distributed dependability subsystem that can provide fault isolation and recovery and is capable of learning and predicting failures to improve the reliability of scientific workflows. This paper extends our previous work on the scientific workflow management systems by presenting a hierarchical dynamic workflow management system that tracks the state of job execution using timed state machines.

Year of Publication
2010
Conference Name
7th IEEE International Workshop on Engineering of Autonomic & Autonomous Systems (EASe)
Attachments
Document
Google Scholar | BibTeX | XML