Reliable Job Scheduler using RFOH in Grid Computing
Abstract
Distributed and dynamic nature of Grids causes the probability of failure is great in such systems. So fault tolerance has become a crucial area in computational Grid. In this paper, we propose a new Genetic Algorithm that used RFOH for having reliable job scheduling in computational Grid. This strategy maintains the fault occurrence history of resources in Grid Information Server (GIS). Genetic Algorithm with RFOH finds a near optimal solution for the problem. Furthermore, it increases the percentage of jobs executed within specified deadline. The simulation results demonstrate that proposed strategy decreases the probability of failure and therefore increases reliability. Also it reduces total execution time of jobs. So we will have a combination of reliability and user satisfaction.
References
- I. Foster, C. Kesselman, and S. Tueke, “The anatomy of the grid: Enabling scalable virtual organizations,” Supercomputing Applications, 2001.
- Foster and C. Kesselman, “The Grid Blueprint for a Future Computing Infrastructure,” San Mateo, CA: Morgan Kaufmann, 1999.
- M. Baker, R. Buyya and D. Laforenza , “Grids and Grid Technologies for Wide-area Distributed Computing,” Software-Practice & Experience, Vol. 32, No.15, 2002, pp: 1437-1466.
- A.Y. Zomaya, R.C. Lee, and S. Olariu , “An Introduction to Genetic-Based Scheduling in Parallel-Processor Systems,” Solutions to Parallel and Distributed Computing Problems: Lessons from Biological Science, A.Y. Zomaya, F. Ercal, and S. Olariu, eds., New York: Wiley, 2001, chapter 5, pp. 111-133.
- HwaMin Lee, KwangSik Chung, SungHo Chin, JongHyuk Lee, DaeWon Lee, 2005 ,"A resource management and fault tolerance services in grid computing", Journal of Parallel and Distributed Computing, Vol. 65, pp. 1305-1317.
- Babar Nazir, Taimoor Khan, “Fault Tolerant Job Scheduling in Computational Grid,” 2nd International Conference on Emerging Technologies Peshawar, Pakistan (IEEE—ICET), 2006 .
- S. Hwang and C. Kesselman. “Grid Workflow: A Flexible Failure Handling Framework for the Grid,” In 12th IEEE International Symposium on High Performance Distributed Computing (HPDC’03), Seattle, Washington, USA, IEEE CS Press, Los Alamitos, CA, USA, June 22 - 24, 2003.
- S. Baghavathi Priya, M. Prakash, Dr. K. K. Dhwan, “Fault Tolerance-Genetic Algorithm for Grid Task Scheduling using Check Point,” The Sixth International Conference on Grid and Cooperative Computing (GCC), 2007.
- Leyli Mohammad Khanli, Maryam Etminan Far, and Amir Masoud Rahmani, “RFOH: a New Fault Tolerant Job Scheduler in Grid Computing”, The 2nd International Conference on Computer Engineering and Applications (ICCEA), Bali Island, Indonesia, March 19-21, 2010.
Full Text: PDF
Refbacks
- There are currently no refbacks.