Opened 8 years ago

Closed 5 years ago

#1145 closed bug (wontfix)

Add ftb integration to better detect failed processes

Reported by: buntinas Owned by: buntinas
Priority: major Milestone: future
Component: mpich Keywords:
Cc:

Description

Currently nemesis detects and handles communication errors. However communication errors are not a reliable way to detect process failures. This ticket is to track the integration with FTB to allow the mpi library to detect process failures directly.

Change History (3)

comment:1 Changed 7 years ago by balaji

  • Milestone changed from mpich2-1.4 to mpich2-1.5

Hasn't this been fixed in our recent work with Hydra?

comment:2 Changed 7 years ago by buntinas

  • Milestone changed from mpich2-1.5 to future

comment:3 Changed 5 years ago by balaji

  • Resolution set to wontfix
  • Status changed from new to closed
Note: See TracTickets for help on using tickets.