|
Publications |
| Showing records 1 - 10 of 51 | |
Herault, T., Bouteiller, A., Bosilca, G., Gamell, M., Teranishi, K., Parashar, M., Dongarra, J. "Practical Scalable Consensus for Pseudo-Synchronous Distributed Systems,"
Supercomputing,
Austin, TX, November, 2015.
|
|
Herault, Thomas and Bouteiller, Aurelien and Bosilca, George and Gamell, Marc and Teranishi, Keita and Parashar, Manish and Dongarra, Jack "Practical Scalable Consensus for Pseudo-synchronous Distributed Systems,"
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis,
ACM,
Austin, Texas, pp. 31:1--31:12,
Nov, 2015.
|
Herault, T., Bouteiller, A., Bosilca, G., Gamell, M., Teranishi, K., Parashar, M., Dongarra, J. "Practical Scalable Consensus for Pseudo-Synchronous Distributed Systems: Formal Proof,"
University of Tennessee Computer Science Technical Report,
ICL-UT-15-01,
April, 2015.
|
|
George Bosilca, Aurelien Bouteiller, Thomas Herault, Yves Robert and Jack Dongarra "Composing resilience techniques: ABFT, periodic and incremental checkpointing,"
International Journal of Networking and Computing (IJNC),
Computer Science Journals,
501-525,
January, 2015.
|
|
Benoit A., Robert, Y., Raina S.K. "Efficient checkpoint/verification patterns for silent error detection,"
University of Tennessee Computer Science Technical Report,
ICL-UT-14-03,
May, 2014.
|
|
Bland, W., Bouteiller, A., Herault, T., Hursey, J., Bosilca, G., Dongarra, J.J. "An evaluation of User-Level Failure Mitigation support in MPI,"
Computing,
Springer,
Vienna, DOI 10.1007/s00607-013-0331-3,
1-14,
May, 2013.
|
|
Wesley Bland and Aurelien Bouteiller and Thomas Herault and Joshua Hursey and George Bosilca and Jack J. Dongarra "An evaluation of User-Level Failure Mitigation support in MPI,"
Computing,
Vol. 95, No. 12,
1171--1184,
2013.
|
|
Bland, W. "User Level Failure Mitigation in MPI,"
Euro-Par 2012: Parallel Processing Workshops,
Caragiannis, I., Alexander, M., Badia, R., Cannataro, M., Costan, A., Danelutto, M., Desprez, F., Krammer, B., Sahuquillo, J., Scott, S., and Weidendorfer, J. eds.
Springer Berlin Heidelberg,
Rhodes Island, Greece, 7640,
499-504,
August, 2012.
|
|
Du, P., Bouteiller, A., Bosilca, G., Herault, T., Dongarra, J. "Algorithm-Based Fault Tolerance for Dense Matrix Factorization,"
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012,
J. Ramanujam, P. Sadayappan eds.
ACM,
New Orleans, LA, USA, 225-234,
February 25-29, 2012.
|
|
Bland, W., Bosilca, G., Bouteiller, A., Herault, T., Dongarra, J. "A Proposal for User-Level Failure Mitigation in the MPI-3 Standard,"
University of Tennessee Electrical Engineering and Computer Science Technical Report,
ut-cs-12-693,
February 24, 2012.
|
|
| Showing records 1 - 10 of 51 | |
|