瀏覽代碼

Begin work for info propagation about checkpoint readiness. Must contact those who have the previous checkpoint and those who have the new one that CPID and CPINST in the msg are now those up to date.

Romain LION 5 年之前
父節點
當前提交
e7fb5c27d8
共有 1 個文件被更改,包括 4 次插入0 次删除
  1. 4 0
      mpi/src/mpi_failure_tolerance/starpu_mpi_checkpoint_template.c

+ 4 - 0
mpi/src/mpi_failure_tolerance/starpu_mpi_checkpoint_template.c

@@ -21,6 +21,7 @@
 #include <starpu_mpi_private.h>
 #include <starpu_mpi_cache.h>
 #include <mpi_failure_tolerance/starpu_mpi_checkpoint_template.h>
+#include <mpi_failure_tolerance/starpu_mpi_checkpoint.h>
 
 
 #define MAX_CP_TEMPLATE_NUMBER 32 // Arbitrary limit
@@ -30,6 +31,7 @@ starpu_mpi_checkpoint_template_t cp_template_array[MAX_CP_TEMPLATE_NUMBER];
 int                              my_rank;
 int                              size;
 int cp_template_number = 0;
+struct _starpu_mpi_cp_ack_msg last_valid_checkpoint;
 
 typedef int (*backup_of_fn)(int);
 
@@ -37,6 +39,8 @@ void checkpoint_template_lib_init(void) {
 	starpu_pthread_mutex_init(&cp_template_mutex, NULL);
 	starpu_mpi_comm_rank(MPI_COMM_WORLD, &my_rank);
 	starpu_mpi_comm_size(MPI_COMM_WORLD, &size);
+	last_valid_checkpoint.checkpoint_id = -1;
+	last_valid_checkpoint.checkpoint_instance = -1;
 }
 
 void checkpoint_template_lib_quit(void) {