Publications
Publications
   

Showing records 1 - 10 of 51

Herault, T., Bouteiller, A., Bosilca, G., Gamell, M., Teranishi, K., Parashar, M., Dongarra, J. "Practical Scalable Consensus for Pseudo-Synchronous Distributed Systems," Supercomputing, Austin, TX, November, 2015.

PDF
Herault, Thomas and Bouteiller, Aurelien and Bosilca, George and Gamell, Marc and Teranishi, Keita and Parashar, Manish and Dongarra, Jack "Practical Scalable Consensus for Pseudo-synchronous Distributed Systems," Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, ACM, Austin, Texas, pp. 31:1--31:12, Nov, 2015.

Herault, T., Bouteiller, A., Bosilca, G., Gamell, M., Teranishi, K., Parashar, M., Dongarra, J. "Practical Scalable Consensus for Pseudo-Synchronous Distributed Systems: Formal Proof," University of Tennessee Computer Science Technical Report, ICL-UT-15-01, April, 2015.

PDF
George Bosilca, Aurelien Bouteiller, Thomas Herault, Yves Robert and Jack Dongarra "Composing resilience techniques: ABFT, periodic and incremental checkpointing," International Journal of Networking and Computing (IJNC), Computer Science Journals, 501-525, January, 2015.

PDF
Benoit A., Robert, Y., Raina S.K. "Efficient checkpoint/verification patterns for silent error detection," University of Tennessee Computer Science Technical Report, ICL-UT-14-03, May, 2014.

PDF
Bland, W., Bouteiller, A., Herault, T., Hursey, J., Bosilca, G., Dongarra, J.J. "An evaluation of User-Level Failure Mitigation support in MPI," Computing, Springer, Vienna, DOI 10.1007/s00607-013-0331-3, 1-14, May, 2013.

PDF
Wesley Bland and Aurelien Bouteiller and Thomas Herault and Joshua Hursey and George Bosilca and Jack J. Dongarra "An evaluation of User-Level Failure Mitigation support in MPI," Computing, Vol. 95, No. 12, 1171--1184, 2013.

PDF
Bland, W. "User Level Failure Mitigation in MPI," Euro-Par 2012: Parallel Processing Workshops, Caragiannis, I., Alexander, M., Badia, R., Cannataro, M., Costan, A., Danelutto, M., Desprez, F., Krammer, B., Sahuquillo, J., Scott, S., and Weidendorfer, J. eds. Springer Berlin Heidelberg, Rhodes Island, Greece, 7640, 499-504, August, 2012.

PDF
Du, P., Bouteiller, A., Bosilca, G., Herault, T., Dongarra, J. "Algorithm-Based Fault Tolerance for Dense Matrix Factorization," Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012, J. Ramanujam, P. Sadayappan eds. ACM, New Orleans, LA, USA, 225-234, February 25-29, 2012.

PDF
Bland, W., Bosilca, G., Bouteiller, A., Herault, T., Dongarra, J. "A Proposal for User-Level Failure Mitigation in the MPI-3 Standard," University of Tennessee Electrical Engineering and Computer Science Technical Report, ut-cs-12-693, February 24, 2012.

PDF

Showing records 1 - 10 of 51

Jun 29 2022 Admin Login