Neural Autoregressive Flows for Data-driven background estimation as demonstrated in a search for four-top quark production in the all-hadronic final state with CMS at 13 TeV
POSTER
Abstract
A novel machine-learning based background estimation technique using normalizing flows in order to estimate background distributions from data in control regions is described in detail. This is demonstrated in four-top quark production in the all hadronic channel in Run II (TOP-21-005), which was facilitated for the first time through the ability of this method to estimate dominant QCD multijets and tt ̄ backgrounds. The flow is able to reliably estimate the distribution of complex variables in signal regions by learning the transformation from input simulated distributions to target data distributions in control regions inspired by the "ABCD" method. This "ABCDnn" method as applied to the four-top all hadronic method is described, including discussion of selections and choice of control regions, validation of the method, estimation of uncertainties, and discussion of closure tests. A public git repository for training the network is also included. This background estimation method is applicable to other hadronic-dominated analyses where QCD multijets simulation is unreliable or statistically limited. This work is part of an ongoing paper (MLG-23-004) and is expected to be included in an anticipated Run 3 iteration of the all hadronic four-top analysis.
Publication: MLG-23-004
Presenters
-
Jet Z Yue
University of California San Diego
Authors
-
Jet Z Yue
University of California San Diego
-
Javier M Duarte
University of California, San Diego
-
Melissa K Quinnan
University of California, San Diego