Annual Report
Report Period: April 2016 - April 2017
PhD Year: 1 2 X 3 4 ≥ 4
PhD Supervisor / Working Group: Prof. Arnulf Quadt, Prof. Vincenzo Cavasinni Thesis Committee: 1. Prof. Arnulf Quadt
2. Prof. Stan Lai
3. Prof. Vincenzo Cavasinni
by
Antonio De Maria
ademari@phys.uni-goettingen.de
CONTENTS i
Contents
1 Measurement of the Higgs boson production cross section 1
1.1 Motivation and Outline . . . . 1
1.2 Events Preselection and Background estimation . . . . 1
1.3 Fake-Factor method to estimate jet faking hadronic taus . . . . 3
1.4 Signal Regions definition . . . . 4
1.5 Missing Mass Calculator retuning for Run2 analysis . . . . 5
1.6 Fit Model . . . . 6
1.7 Outlook and Plans for the Next 12 Months . . . . 6
2 Code development for xTauFramework and FitBox 6 3 Teaching 7 4 Lectures 7 5 Given Talks 8 6 Attended Schools and Conferences 8 7 OTP Shifts 8 7.1 Future OTP activities . . . . 9
8 Further Activities 11
1 PhD Topic: Measurement of the Higgs boson production cross section at 13 TeV in H → τ τ →lep-had decay with the ATLAS detector
1.1 Motivation and Outline
The ATLAS and CMS experiments discovered a Higgs boson in 2012, consistent with the last missing elementary particle in the Standard Model (SM) of electroweak and strong interactions.
Several properties of this boson were measured with 7 and 8 TeV centre-of-mass energy ( √ s) proton-proton (pp) collision data delivered by the Large Hadron Collider (LHC) in 2011 and 2012, respectively. These measurements have not shown significant deviations from the SM expectations. In particular, the coupling of the Higgs boson to the fermion sector has also been established with the observation of the H → τ τ decay mode with a signal significance of 5.5 σ from the the combined ATLAS and CMS 7 and 8 TeV datasets.
Using the collision data available now at √
s = 13 TeV, the detailed program of the Higgs boson property measurements will be extended to reach a higher precision compared to 7 and 8 TeV analyses due to the expected increase of data statistics and the expected increase in the Higgs boson production cross section. The H → τ τ channel will continue to play an important role in terms of measurements of the Higgs boson couplings to τ leptons as well as measurements of the other properties of the Higgs boson, such as its charge-parity (CP) quantum numbers. In the following section, the H → τ τ → lep-had final state will be presented, where lep is referred to muon or electron produced in the leptonic τ decay.
1.2 Events Preselection and Background estimation
Events are selected in order to have exactly one light lepton (electron or muon) and at least one hadronic taus. Among the selected taus, the tau having higher Pt (referred as leading τ ) is selected to be the τ -candidate. The requirement of having exactly on light lepton reduces the contamination of Z → ll events. At this stage of the selection, there are no requirements about the reconstruction quality of the τ (ID ) due to Fake-Factor background estimation method, which is described in the next section.
After these selections, a trigger selection is applied depending on the flavour of the selected light lepton. Moreover, a matching between the trigger and the selected lepton is performed. Actually the analysis is mainly selecting events using the lowest not prescaled Single-Lepton Trigger (SLT ) for both 2015 and 2016 datasets. Beside SLT triggers, an effort is ongoing to insert also a Tau + Lepton Trigger (TLT), which is mainly used to selected events in the low lepton pt region.
In this case, both the lepton and the τ -candidate are required to match the different legs of the combined trigger. The two trigger regions, SLT and TLT, are selected according the lepton transverse momentum and are mutually exclusive in order to avoid events double counting, as shown in fig. 1 .
In addition to these lepton and trigger selection criteria, the following preselection requirements are then imposed:
• lepton requirements:
– gradient isolation, this is required in order to reduce QCD contamination in signal regions;
– medium quality reconstruction;
– transverse momentum greater than a threshold specified according to the used triggers;
1 MEASUREMENT OF THE HIGGS BOSON PRODUCTION CROSS SECTION 2
Figure 1: trigger scheme SLT+TLT
• τ requirements:
– medium quality reconstruction;
– |η| < 2.4 (with an exclusion in the crack region, 1.37-1.52), |q| = 1, p
T> 20 GeV;
– for MC samples, the truth PDG ID of the tau candidate (τ
id), must satisfy the con- ditions |τ
id| > 6 and τ
id6= 21. These conditions remove jets faking taus, which are modelled by the fake-factor method;
• the charges of the lepton and hadronic τ must have opposite sign;
• application of the b-jet veto. This veto removes events where there is at least one b-tagged jet, in order to reduce t¯ t events in the signal region;
• the trasverse mass (m
T) between the lepton and the missing transverse energy (MET ) must be less than 70 GeV. This cut is required in order to reduce the W+jets events in the signal region.
All the backgrounds can be classified into three major categories:
• events with true lepton and τ
hadsignatures – Z/γ
∗, Dibosons, Top
• events where a jet fakes a τ
hadsignature – QCD jets, W+jets, Z → ll+jets, Dibosons
• Events where a light charged lepton fakes a τ
hadsignature – Z → ll + jets
Different control regions are built to select a chosen background inverting the requirement used
to remove these backgrounds from the signal region. A summary of the different control regions
is reported in the tab 1 .The control regions allow the validation of the selections performed to
separate signal and backgrounds.
Control region cuts
Z → ll Two leptons with same flavour Top enriched invert b-veto, m
T> 40 GeV
W enriched m
T> 70 GeV QCD enriched invert lepton isolation
Table 1: CR selections
1.3 Fake-Factor method to estimate jet faking hadronic taus
The background from jets faking taus is a dominant background for the lep-had channel. Con- sisting mostly of W+jets events, fake taus also come from QCD, t¯ t and Z+jets events. The jet faking a tau is different if the jet is quark-initiated or gluon-initiated. The quark-gluon frac- tion is different for different samples, but also dependent on the selection made. Therefore, it’s important to get fake estimate from regions as close as possible to the signal regions. In order to estimate jet faking tau background, the Fake-Factor is actually adopted as default method;
keeping all other selections the same, including the trigger selections and the requirement of the tau-candidate, the medium ID requirement is inverted, giving the definition for an anti-τ . It should be noted that a BDT score cut of 0.35 is applied to ensure that the anti-τ s are sufficiently signal-like for this method since at low BDT score the quark-gluon fraction changes significantly.
Events which contain a real-tau and a fake-lepton are not considered by this method. Further- more, events where taus are faked by electrons are also not considered by this method, as they are accounted for using a dedicated Z → ee tag-and-probe analysis performed by the TauWG. As result of this analysis, two strategies to reject electrons faking taus are provided, one based on a likelihood rejection and the other one based on BDT discriminant; scale factors and systematics are also available for both methods.
The fake estimate is determined in the following way: the number of anti-τ s are taken from data and events from Monte Carlo backgrounds such as Z → τ τ where the anti-tau is not matched to a truth jet are subtracted from this. This is then multiplied by a fake-factor, which is binned in τ transverse momentum and the number of tracks. From this, the estimate for each signal region can be obtained:
N
f akesSR= (N
Dataanti−τ− N
M C,not→tauanti−τ) × F F (1) The combined fake-factor for each signal region is constructed as the sum of the individual fake- factors (F F
i) for each relevant process i, weighted by it’s expected fraction of evetns in the anti-τ region (R
i):
F F = R
WF F
W+ R
ZF F
Z+ R
T opF F
T op+ R
QCDF F
QCD(2) The individual fake-factors (F F
i) are determined in dedicated control-regions for each process, separately for each analysis category. The definitions of the control regions is presented in table 1. It may be noted that these control regions are all defined to be very close to the corresponding signal region, inverting only some cuts to preserve the orthogonality respect to the signal region.
Each control region is then further split into a pass and fail region, depending on if the τ -
jets passed or failed the medium requirement. The signal contribution in the control regions is
assumed to be negligible, as is the QCD-multijet contamination in all CRs expect the one for
QCD. The individual fake-factor F F
iis then obtained in the corresponding CR as the ratio of
data events that pass the tau ID requirement over those that fail it. Contributions from other
1 MEASUREMENT OF THE HIGGS BOSON PRODUCTION CROSS SECTION 4
processes that the one in question (denoted as not-i as well as events where the τ is not faked by a jet (denoted as notj → τ ) are each case subtracted from the data yield:
F F
i= N
datapass,CR−i− N
M C,not−ipass,CR−i− N
M C,notj→τpass,CR−iN
dataf ail,CR−i− N
M C,not−if ail,CR−i− N
M C,notj→τf ail,CR−i(3) The expected fraction of events in the anti-τ region (R
i) is determined from MC (except for R
QCDwhich is obtained as R
QCD= 1 − sumR
i:
R
i= N
i,M C,j→τf ail,SRN
Dataf ail,SR− N
,M C,notj→τf ail,SR(4)
1.4 Signal Regions definition
To exploit signal-sensitive event topologies, two inclusive analysis categories are defined in an exclusive way. The VBF category targets events with a Higgs boson produced via vector boson fusion and it’s characterised by the presence of two high-p
Tjets with a large pseudorapidity separation. Although this category is dominated by VBF events, is also includes smaller contri- butions from ggF and VH production. The boosted category targets events with a boosted Higgs boson produced by the ggF mechanism. Higgs boson candidates are therefore required to have a large transverse momentum. These inclusive categories are further split into multiple signal regions to improve the sensitivity to Higgs boson prediction.
The VBF inclusive category is defined by the following requirements:
• leading jet p
T> 40 GeV
• sub-leading jet p
T30 GeV
• leading and sub-leading jets must be well separated; this is achieved requiring:
– jets ∆η > 3
– jets must be in opposite hemispheres
• visible mass of the jets m
jj> 300 GeV
• the minimum jet η must be less than the light lepton or hadronic τ η, and the maximum jet η must be greater than the light lepton or hadronic τ η; this requirement is usually referred as centrality
• the MET must be greater than 20 GeV
• the |∆η| and ∆R| between the lepton and the τ are required to be less than 1.5 and 3.0, respectively
The VBF category is subsequently split into two further categories called VBF Tight and VBF Loose. The VBF Tight is defined applying the following cuts on top of the VBF region definition:
• the mass of the two jets must be greater than 500 GeV.
• the p
Tof the sum of the four-momenta of the lepton, τ and MET (usually referred as Higgs
p
T, must be greater than 100 GeV)
• the p
Tof the τ must be greater than 30 GeV
For the VBF Loose categorisation, the event must pass the preselection and the VBF inclusive category, but fail the VBF Tight selections.
The ggH Boosted inclusive category is defined by the following requirements:
• events should fail VBF inclusive requirements
• the p
Tof the Higgs must be greater than 100 GeV
• the MET should be greater than 20 GeV
• the p
Tof the τ should be greater than 30 GeV
• the |∆η| and ∆R| between the lepton and the τ are required to be less than 1.5 and 2.5, respectively
The Boosted category is subsequently split into two further categories called Boosted High and Boosted Low. The Boosted High is defined applying the following cuts on top of the Boosted region definition:
• the p
Tof the Higgs must be greater than 140 GeV
• the ∆R between the lepton and the τ is required to be less than 1.5
For the Boost Low categorisation, the event must pass the preselection and the Boosted Inclusive category, but fail the Boost High selection.
1.5 Missing Mass Calculator retuning for Run2 analysis
A key variable of this analysis is the invariant mass of the taus arising from the Higgs decay. An accurate reconstruction of a resonance mass decaying into a pair of tau leptons is a difficult task because of the presence of multiple undetected neutrinos from the tau decays. The Missing Mass Calculator (MMC) is a sophisticated method to optimise the di-τ invariant mass reconstruction.
It is based on the requirement that mutual orientations of the neutrinos and other decay products
are consistent with the mass and decay kinematics of a tau lepton. This is achieved by minimizing
a likelihood function defined in the cinematically allowed phase space region. MMC was already
one of the most powerful tools used in SM-Higgs to tau tau searches in Run1 at LHC. During
Run2, many efforts need to be done to optimise the analysis tools to the new experimental
conditions. Among these tools, MMC requires to be retuned in order to continue play a key role
again in the searches of the Higgs boson in di-tau final states. During this year, I was involved
into the MMC retuning for the H → τ τ → lep − had channel. I have already performed such
a type of studies last year for H → τ τ → had − had channel and my previous results have
been fully confirmed by new similar studies. Results for the lep-had channel were presented in
the Mass Task Force bi-weekly meetings. The retuning procedure has been tested and the final
results are comparable with Run1 results. As final results of these studies, a new retuned version
of the MMC has been released and it’s currently used in the analysis.
2 CODE DEVELOPMENT FOR XTAUFRAMEWORK AND FITBOX 6
1.6 Fit Model
A maximum likelihood fit is performed to extract the parameter of interest (POI):
µ = σ
H× BR(H → τ τ → lep − had) σ
H× BR(H → τ τ → lep − had)
SM