Hello,
I am trying to use Flower in real computer cluster. I realise that with the setting of strategy for example fedAvg, the server will waiting until get the feedback from at least a number of clients min_clients (fit, eva, avai,…).
It means the server still waiting even all clients was crashed (no responde).
Right now I am using manual kill to stop the server. I wrote a script to kill server/clients in failure by frequency check the output of log file but it is a bit annoying to use.
I tried to see the topic about “drop_out” but seem it does not solve my issue. Do we have any “auto dectect and kill task” in this case? For example set a time limit for server waiting?