2022
Joachim Protze, Marc-André Hermanns, Matthias S. Müller, Van-man Nguyen, Julien Jaeger, Emmanuelle Saillard, Patrick Carribault and Denis Barthou, MPI detach – Towards automatic asynchronous local completion, In Journal on Parallel Computing vol.109. (pdf|bib|doi)
Manuel Ferat, Romain Pereira, Adrien Roussel, Patrick Carribault, Luiz-Angelo Steffenel, et al.. Enhancing MPI+OpenMP task based applications for heterogenous architectures with GPU support. IWOMP 2022 – 18th International Workshop on OpenMP, Sep 2022, Chattanooga, TH, United States. pp.3-16, (pdf|bib|doi) — Associated MPC Version (tarball).
Jean-Baptiste Besnard, Sameer Shende, Allen Malony, Julien Jaeger, and Marc Perache. 2022. Enabling Global MPI Process Addressing in MPI Applications. In Proceedings of the 29th European MPI Users’ Group Meeting (EuroMPI/USA’22). Association for Computing Machinery, New York, NY, USA, 27–36. (pdf|bib|doi).
Stéphane Bouhrour Thibaut Pépin and Julien Jaeger, Towards leveraging collective performance with the support of MPI 4.0 features in MPC, In Journal on Parallel Computing vol.109. (pdf|bib|doi)
2021
Thomas Dionisi, Stéphane Bouhrour, Julien Jaeger, Patrick Carribault and Marc Pérache, Enhancing Load-balancing of MPI Applications with Workshare, In Euro-Par 2021 . (pdf|bib|doi)
Daniel J. Holmes, Anthony Skjellum, Julien Jaeger, Ryan E. Grant, Purushotham V. Bangalore, Matthew G. F. Dosanjh, Amanda Bienz and Derek Schafer, Partitioned Collective Communication, In International Workshop on Exascale MPI (ExaMPI) 2021 . (pdf|bib|doi)
Romain pereira, Adrien Roussel, patrick Carribault and Thierry Gauthier, Communication-Aware Task scheduling Strategy in Hybrid MPI+OpenMP Applications, In International Workshop on OpenMP (IWOMP) 2021 . (pdf|bib|doi)
Jean-Baptiste Besnard, Julien Adam, Allen D. Malony, Sameer Shende, Julien Jaeger, Patrick Carribault and Marc Pérache, Exploring Space-Time Trade-Off in Backtraces, In Proceedings of the 12th and 13th Workshop on Parallel Tools for High performance Computing. (pdf|bib|doi)
2020
Edgar A. León, Balazs Gerofi, Julien Jaeger, Guillaume Mercier, Rolf Riesen, Masamichi Takagi, Brice Goglin, Application-Driven Requirements for Node Resource Management in Next-Generation Systems, in Proceedings of the 2020 International Workshop on Runtime and Operating Systems for Supercomputers – ROSS 2020. (pdf|bib|doi)
Van Man Nguyen, Emmanuelle Saillard, Julien Jaeger, Denis Barthou, Patrick Carribault, PARCOACH Extension for Static MPI Nonblocking and Persistent Communication Validation, In Fourth International Workshop on Software Correctness for HPC Applications – CORRECTNESS 2020. (pdf|bib|doi)
Bouhrour S. and Jaeger J., Implementation and performance evaluation of MPI persistent collectives in MPC: a case study, in Proceedings of the 27th European MPI Users’ Group Meeting – EuroMPI/USA 2020. (pdf|bib|doi|final version)
Roussel A., Carribault P., and Jaeger J., Preliminary Experience with OpenMP Memory Management Implementation, In International Workshop on OpenMP – IWOMP’20. (pdf|bib|doi|final version)
Nguyen V.-M., Saillard E., Jaeger J., Barthou D., and Carribault P., Automatic Code Motion to Extend MPI Nonblocking Overlap Window, in 1st Workshop on Compiler-assisted Correctness Checking and Performance Optimization for HPC – C3PO’20. (pdf|bib|doi|final version)
Reynier F., Utilisation de cœurs dédiés pour la progression des communications non bloquantes, in Conférence francophone d’informatique en Parallélisme, Architecture et Système – Compas 2020 (in french). (pdf|bib)
2019
A. Denis, J. Jaeger, E. Jeannot, M. Pérache and H. Taboada, Study on progress threads placement and dedicated cores for overlapping MPI nonblocking collectives on manycore processor, in International Journal on High Performance Computing and Applications. (pdf|bib)
Besnard, J. B., Jaeger, J., Malony, A. D., Shende, S., Taboada, H., Pérache, M., & Carribault, P. (2019, September). Mixing ranks, tasks, progress and nonblocking collectives. In Proceedings of the 26th European MPI Users’ Group Meeting. (pdf|bib)
Purushotham V. Bangalore P., Rabenseifner R., Holmes D., Jaeger J., Mercier G., Blaas{-}Schenner C., and Skjellum A. Exposition, clarification, and expansion of {MPI} semantic terms and conventions: is a nonblocking {MPI} function permitted to block?. In Proceedings of the 26th European MPI Users’ Group Meeting (pp. 1-10). (pdf|bib)
Vieira, R. B., Capra, A., Carribault, P., Jaeger, J., Pérache, M., & Roussel, A. (2019, September). Detecting Non-Sibling Dependencies in OpenMP Task-Based Applications. In International Workshop on OpenMP (pp. 231-245). Springer, Cham. (pdf|bib)
Julien Adam, Maxime Kermarquer, Jean-Baptiste Besnard, Leonardo Bautista-Gomez, Marc Pérache, Patrick Carribault, Julien Jaeger, Allen D. Malony, Sameer Shende, Checkpoint/Restart Approaches for a Thread-Based MPI Runtime, in Parallel Computing (2019), doi: https://doi.org/10.1016/j.parco.2019.02.006. (pdf|bib)
2018
A. Denis, J. Jaeger, E. Jeannot, M. Pérache and H. Taboada, Progress Thread Placement for Overlapping MPI Non-Blocking Collectives using Simultaneous Multi-Threading, in EuroPar 2018 Workshops : COLOC (pdf|bib)
H. Brunie, J. Jaeger, P. Carribault and D. Barthou, Profile-Guided Scope-Based Dynamic Data Allocation Method, in MEMSYS 2018. (pdf|bib)
H. Taboada, Recouvrement des Collectives MPI Non-bloquantes sur Processeur Manycore, in Compas 2018 (in french). (pdf|bib)
M. Sergent, M. Dagrada, P. Carribault, J. Jaeger, M. Pérache and G. Papauré, Efficient Communication/Computation Overlap with MPI+OpenMP Runtimes Collaboration, in EuroPar 2018 (pdf|bib)
A. Denis, J. Jaeger, E. Jeannot, M. Pérache and H. Taboada, Dynamic Placement of Progress Thread for Overlapping MPI Non-Blocking Collectives on Manycore Processor, in EuroPar 2018 (pdf|bib)
J. Adam, J.-B. Besnard, A.. Malony, S. Shende, M. Pérache, P. Carribault and J. Jaeger, Transparent High-Speed Network Checkpoint/Restart in MPI, in EuroMPI 2018 (pdf|bib)
2017
A. Loussert, B. Welterlen, P. Carribault, J. Jaeger, M. Pérache and R. Namyst, Resource-Management Study in HPC Runtime-Stacking Context, in 29th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2017), IEEE, Pages 177-184. (pdf|bib)
A. Capra, P. Carribault, J.-B. Besnard, A. Mallony, M. Pérache, and J. Jaeger, User Co-scheduling for MPI+OpenMP Applications Using OpenMP Semantics, in International Workshop on OpenMP 2017 (IWOMP 2017), Springer, Pages 203-216. (pdf|bib)
The final authenticated version is available on https://doi.org/10.1007/978-3-319-65578-9_14
J.-B. Besnard, A. Mallony, S. Shende, M. Pérache, P. Carribault and J. Jaeger, Towards a Better Expressiveness of the Speedup Metric in MPI Context, in 46th International Conference on Parallel Processing Workshops (ICPPW) 2017, PSTI 2017, IEEE (pdf|bib)
2016
J. Adam, M. Pérache, A Parallel and Resilient Frontend for High Performance Validation Suites, in VECPAR 2016, ACM (pdf|bib)
J.-B. Besnard, J. Adam, S. Shende, M. Pérache, P. Carribault, J. Jaeger and A. Mallony, Introducing Task-Containers as an Alternative to Runtime-Stacking, in EuroMPI 2016, ACM (pdf|bib)
H. Taboada, Impact du placement des threads de progression pour les collectives MPI non-bloquantes, in Compas 2016 (in french). (pdf|bib)
2015
J.-B. Besnard, A. Mallony, S. Shende, M. Pérache, P. Carribault and J. Jaeger, An MPI Halo-Cell Implementation for Zero-Copy Abstraction, in EuroMPI 2015, ACM. (pdf|bib)
J. Jaeger, E. Saillard, P. Carribault and D. Barthou, Correctness Analysis of MPI-3 Non-Blocking Communications in PARCOACH, in EuroMPI 2015, ACM (short paper). (pdf|bib|poster)
J. Jaeger, A. Mahéo, P. Carribault, M. Pérache and W. Jalby, Optimisation des opérations collectives dans les applications MPI+OpenMP, in Chocs Avancees 2014, pages 44-45 (2015) (in french). (bib)
MPI Thread-level Checking for MPI+OpenMP Applications, in EuroPar 2015: Parallel Processing, pages 31-42 (2015), Springer. (pdf|bib)
Static/Dynamic validation of MPI collective communications in multi-threaded context, in Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Pages 279-280 (2015), ACM. (pdf|bib)
J. Jaeger, P. Carribault, and M. Pérache, Fine‐grain data management directory for OpenMP 4.0 and OpenACC, in Concurrency and Computation: Practice and Experience (2014), Pages 1528-1539 (2015), ACM. (pdf|bib)
2014
E. Saillard, P. Carribault, and D. Barthou. PARCOACH: Combining static and dynamic validation of MPI collective communications, in International Journal of High Performance Computing Applications (2014): 1094342014552204, ACM, 2014. (pdf|bib)
J. Clet-Ortega, P, Carribault, and M, Pérache, Evaluation of openmp task scheduling algorithms for large numa architectures, in Euro-Par’14, volume 8632 of Lecture Notes in Computer Science, pages 596–607. Springer, 2014. (pdf|bib)
A. Mahéo, P. Carribault, M. Pérache, and W. Jalby, Optimizing collective operations in hybrid applications, in EuroMPI ’14, 2014, ACM. (pdf|bib)
E. Saillard, P. Carribault, and D. Barthou, Static validation of barriers and worksharing constructs in openmp applications, in 10th International Workshop on OpenMP (IWOMP 2014), Springer, 2014. (pdf|bib)
J. Jaeger, P. Carribault, and M. Pérache, Data-Management Directory for OpenMP 4.0 and OpenACC, in Euro-Par 2013: Parallel Processing Workshops. Springer Berlin Heidelberg, 2014. (pdf|bib)
S. Didelot , P. Carribault, M. Pérache and W. Jalby, Improving MPI communication overlap with collaborative polling, in Computing 2014, 96(4) : 263 – 278, Springer-Verlag New York, 2014. (pdf|bib)
2013
J.-B. Besnard, M. Pérache and W. Jalby, Event streaming for online performance measurements reduction, In 42nd International Conference on Parallel Processing (ICPP) 2013, pages 985–994, Oct 2013. (pdf|bib)
S. Valat, M. Pérache and W. Jalby, Introducing kernel-level page reuse for high performance computing, In Proceedings of the ACM SIGPLAN Workshop on Memory Systems Performance and Correctness, MSPC ’13, New York, NY, USA, 2013. ACM. (bib)
Z. Menyhart and M. Pérache, Method, computer program and device for managing memory access in a multiprocessor architecture of NUMA type, US Patent App. 13/993,665, octobre 3 2013. (pdf|bib)
E. Saillard, P, Carribault, and D, Barthou, Combining static and dynamic validation of MPI collective communications, in Proceedings of the 20th European MPI Users’ Group Meeting (EUROMPI’13). ACM, 2013. (pdf|bib)
2012
S. Didelot, P. Carribault, M. Pérache, and W. Jalby, Improving MPI Communication Overlap With Collaborative Polling, in European MPI Users Group Meeting (EUROMPI’12), Springer, 2012. (pdf|bib)
A. Mahéo, S. Koliaï, P. Carribault, M. Pérache, and W. Jalby. Adaptive OpenMP for Large NUMA Nodes, in International Workshop on OpenMP (IWOMP’12), 2012. (pdf|bib)
M. Tchiboukdjian, P. Carribault, and M. Pérache, Hierarchical Local Storage: Exploiting Flexible User-Data Sharing Between MPI Tasks, in IEEE International Parallel and Distributed Processing (IPDPS’12), 2012. (pdf|bib)
J.-Y. Vet, P. Carribault, and A. Cohen, Multigrain Affinity for Heterogeneous Work Stealing, in Programmability Issues for Heterogeneous Multicores. 2012. (pdf|bib)
P. Carribault, F. Diakhate, H. Jourdren, and M. Pérache. MPC: A unified parallel framework for hpc, in Chocs 41, 2012 (in french). (bib)
P. Ballereau, P. Carribault, F. Duboc, D. Dureau, C. Enaux, H. Jourdren, and M. Pérache, Adaptive mesh refinement methods and advanced programming models for high performance computing, in Chocs 41, 2012 (in french). (bib)
P. Carribault, M. Pérache, and H. Jourdren, Hiérarchie des données en parallélisme hybride, in Chocs Avancées 2011, pages 42-43, 2012 (in french). (pdf|bib)
2011
P. Carribault, M. Pérache, and H. Jourdren, Thread-Local Storage Extension to Support Thread-Based MPI/OpenMP Applications, in International Workshop on OpenMP (IWOMP’11), Springer, 2011. (pdf|bib)
2010
K. Pouget, M. Pérache, P. Carribault, and H. Jourdren, User Level DB: a Debugging API for User-Level Thread Libraries, in Workshop on Multithreaded Architectures and Applications (MTAAP’10), IEEE, 2010. (pdf|bib)
P. Carribault, M. Pérache, and H. Jourdren, Enabling low-overhead hybrid MPI/OpenMP parallelism with MPC, in International workshop on OpenMP (IWOMP’10), 2010. (pdf|bib)
V. Arslan, P. Carribault, C. Enaux, H. Jourdren, and M. Pérache, Calcul haute performance en transfert radiatif, in Chocs Avancées 2009, pages 38-39, 2010 (in french). (pdf|bib)
2009
M. Pérache, P. Carribault, and H. Jourdren, MPC-MPI: An MPI Implementation Reducing the Overall Memory Consumption , in 16th european PVM/MPI users group meeting (EuroPVM/MPI 2009), 2009. (pdf|bib)
F. Diakhaté, M. Pérache, R. Namyst and H. Jourdren, Efficient shared memory message passing for inter-vm communications, In Euro-Par 2013: Parallel Processing Workshops, pages 53–62. Springer-Verlag, Berlin, Heidelberg, 2009. (pdf|bib)
2008
S. Zuckerman, M. Pérache et W. Jalby, Fine tuning matrix multiplications on multicore, In High Performance Computing – HiPC 2008, volume 5374 de Lecture Notes in Computer Science, pages 30–41. Springer Berlin Heidelberg, 2008. (pdf|bib)
M. Pérache, H. Jourdren, and R. Namyst, MPC: a unified parallel runtime for clusters of NUMA machines, in Proceedings of the 14th international EURO-PAR conference (EURO-PAR 2008), 2008. (pdf|bib)
2005
M. Pérache, Nouveaux mécanismes au sein des ordonnanceurs de threads pour une implantation efficace des communications sur machines multiprocesseurs, in RenPar’16 (in french), 2005.