Gradient Based Subsampling - Federated XGBoost

user1 · March 5, 2024, 1:16pm

Hi everyone,

What happens (in practice and theoretically) when enabling gradient_based subsampling (see parameters for more information)?

From my understanding, gradient based subsampling is based on Mimimal Variance Sampling (MVS). My questions are:

Is the subsampling based on gradient information from each client or global information?
Can it be used for adaptation to local client data?
Can we prove convergence using MVS in a federated setting?

Hope to start a great discussion!

danielnata · March 5, 2024, 4:11pm

Hi,
great question and welcome to the community! XGBoost with sampling in federated setting is an open research question and we are only aware of this paper.

Topic		Replies	Views
Federated Baseline Contributions! General	6	190	May 9, 2025
FedDF (Baseline Suggestion) Flower Baselines baseline	0	40	January 17, 2025
Paper on bagging aggregation strategy used for XGBoost? Research tree	0	164	February 29, 2024
Does Flower support XGBoost training? What’s the bagging strategy based on? Flower Framework faq , tree	4	146	March 1, 2024
Hierarchical Federated Learning Contributions flower	3	150	June 2, 2025

Gradient Based Subsampling - Federated XGBoost

Related topics