Particle Physics 2018 Final Exam (Answers with Words Only) - Inside Mines

Page created by Ruby Garner

Science

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

Particle Physics 2018 Final Exam (Answers with Words Only)
This was a hard course that likely covered a lot of new and complex ideas. If you are feeling as if
you could not possibly recount all of the details of what we have done, please don't worry. The
truth is that no one usually does. Rather what happens is that we study something like this, learn
it a bit, then feel like we forgot it all. That is, until we go to learn or use it again later, in which case
we realize that it is all a bit easier the second time through. I do not expect that any of you will
leave this course knowing how to calculate a Feynman amplitude "off the couch", nor should you
be able to recite the intricate form of a Higgs'ed Lagrangian. But what I do want you to leave this
class with is an ability to talk about the big picture of what you saw and worked with. So to that
end, I want you to answer each of the questions below qualitatively. There is no need to pull out
fancy equations. Imagine you are trying to explain to an undergraduate physics major what types
of things they could learn about in particle physics. If you are not enjoying this exercise, then you
are missing the point.

1) (10 pts) What is a group? What is a representation of a group? Why are either of these
important for particle physics?

A group is set of objects together with a composition rule for combing two of the objects that
satisfies the following: Closure (the composition of any two elements of the group should return
an element of the group), Identity (the group should contain one element such that when it is
composed with any other element of the group it returns that same element), Inverse (for any
element of the group there is an element which composes with it to form the identity),
Associativity (for a specified order of three group elements, the composition of all three elements
is independent of the order in which the composition is performed).

For a given group we can associate each element to an object to form a representation. If each
group element is mapped to a distinct object then the representation is called "faithful". If more
than one group element is mapped to the same object, the representation is called "unfaithful".
For groups whose elements are transformations themselves, it is often useful to construct linear
(or matrix) representations of the groups for ease of calculation. In this case the objects are
represented by column matrices and the group elements (transformations) are represented by
square matrices.

In particle physics we employ the notion of symmetry to a) simplify calculations, b) enforce the
structure of special relativity and c) deduce the interaction terms in the Standard Model
Lagrangian. To formally describe symmetries, we first need a mathematical means of describing
transformations, and that is where groups become important, i.e. we consider sets of
transformations that form groups. However in physics we typically start with the objects that
these transformations act on, so we usually start with the representations themselves. Another

way to say this is that degrees of freedom of a physical theory must be in some representation of
the group of symmetries being used.

2) (10 pts) What is special relativity?

Special relativity is a replacement of Galilean relativity that is applicable at any relative speed (the
speed connecting two frames of reference), but reduces to Galilean relativity for relative speeds
small compared to the speed of light. It's mathematical structure can be described by starting
with a four‐dimensional space‐time where the distance between two space‐time events is
determined in terms of the rectangular space‐time coordinates of those events (where the time
coordinate is corrected by a constant factor to have spatial dimensions) and the space‐time
metric which takes the form of (almost) the identity with a relative sign difference between its
time and spatial terms. Then one simply posits that the forms of the laws of physics are invariant
under any coordinate change which preserves the form of this metric, e.g. the form of the
Lagrangian is unchanged. Teasing out the explicit form of these transformations reveals invariance
under space‐time translations, spatial rotations and space‐time boosts (altogether forming the
Poincaré group). From this starting point, one obtains all of the love of special relativity including
the constancy of the speed of light (or any massless degree of freedom) to all inertial observers,
the relativity of time, length contraction, the speed of light as an ultimate limit, relation of
energy/momentum/mass, a meaningful description of the behavior of massless degrees of
freedom, and a refinement of the notion of causality

3) (10 pts) What is a spinor?

When considering the case of spatial rotations in three dimensions, we typically start by looking at
how intuitive quantities with a (space‐related) magnitude and possibly direction are transformed.
These include scalars, vectors and other tensors, all of which are linear representations of the
rotation group. While the scalar is not very useful in determining the structure of the group (since
it is extremely degenerate), we also find that higher rank tensors can be understood in terms of
their vector indices. So in the end we can just focus on the vector representation itself. Then we
can deduce that the rotation group is most simply described by three‐by‐three real orthogonal
matrices of unit determinant which act on three real component column vectors. To generalize
this, we abstract the definition of the group in question (rotations in three dimensions) by taking
three "independent" matrices (corresponding to rotations in any three mutually orthogonal
planes) and considering their infinitesimal form. At this stage we are "undoing" the exponential
map used for Lie groups to build finite transformations from the generators of infinitesimal
transformations. This will give us matrix expressions for the generators, from which we can then
determine the Lie algebra by considering the commutators of all pairs of the generators.

The next step is to use the Lie algebra itself as a means of "defining" rotations in three dimensions.
With this in mind, we are now free to seek any other set of generators that satisfy this same Lie
algebra, and infer that they must then somehow be connected to objects which can be "rotated"
in three dimensions. Among the possible solutions (most are three‐by‐three or higher), there turns
out to be a set of generators described by complex two‐by‐two matrices. While at this point it is
often pointed out that the set of generators is given by the Pauli matrices (which also satisfy a
Clifford algebra related to the spatial metric), it is more appropriate (and important later) to
identify that they are expressible as commutators of Pauli matrices. If we take this set of
generators and then apply the exponential map to build up the set of finite transformations, we
find that these take the form of complex two‐by‐two unitary matrices with unit determinant which
then act on complex two component objects (represented by a column matrix) which we call
spinors. Note that I never said anything about "vectors" here. They are just column matrices to be
acted on by the "spinor" version of rotation operators. Altogether, this forms the spinor
representation of rotations in three dimensions. Interestingly, we find that even though the vector
and spinor representations share the same Lie algebra, they are not the same (isomorphic) as
groups. Recall that the Lie algebra of a group describes the behavior of transformations which are
infinitesimal. If we take a large enough transformation, e.g. rotation through 2 , we find that
vectors return to their original state while spinors pick up a minus sign. Another way to say this is
that for any finite rotation by an angle , there is only one result for vectors, while for spinors
there are two possible outcomes: one if we rotate by , and another (with a relative minus sign) if
we rotate by 2 . So we can see that as representations of rotations in three dimensions, the
spinor representation is more faithful than the vector.

All of this can be extended to the case of Lorentz transformations in four‐dimensional space‐time.
In this case we find that for vectors, the corresponding transformations are enacted by four‐by‐
four real matrices which have unit determinant and are orthogonal with respect to the metric of
space‐time (which means that you have a metric factor on each side of the usual orthogonality
condition). Again, we can take the infinitesimal form of an independent set of six of these
matrices (one for each mutually orthogonal plane in four dimensions) and deduce the Lie algebra.
In this case the resulting Lie algebra is complicated because it takes different forms for different
pairings of generators. We can simplify it by using certain linear combinations of the generators to
arrive at a Lie algebra that "factorizes" into two independent algebras, each of which takes the
form of a three dimensional rotation algebra. Then we can immediately carry over the earlier
results and form two‐component spinor representations of each, which must be tensored
together to form the spinor representation of the entire Lorentz group. Tensoring together two‐
component representations results in a 2 dimensional representation, which in this case means
our spinor has four components. But one should note that these are not in any simple way related
to the four components of the vector representations. The generators in the spinor representation
are given by commutators of the Dirac matrices which themselves satisfy a Clifford algebra which

is related to the metric of space‐time. From these generators we can then use the exponential
map to build up the finite form of Lorentz transformations as they act on spinors.

4) (10 pts) How are the Dirac, Proca and Klein‐Gordon equations related?

The Klein‐Gordon equation is a relativistic field equation that is appropriate for scalar fields where
the only degree of freedom is the energy‐momentum of the field configuration, and the KG
equation simply imposes that these degrees of freedom satisfy the relativistic mass‐energy
relation. The Dirac equation on the other hand is relevant for spinor fields where we have
additional degrees of freedom coming from the intrinsic angular momentum (or spin) of the field.
Since Dirac fields also carry energy‐momentum, they must also satisfy the mass‐energy relation
and in fact we find that each component of the Dirac spinor field separately satisfies the KG
equation. So in a sense, the Dirac equation contains the information in the KG equation, but also
provides additional constraints between the various components of the spinor field. Similarly, the
Proca equation, valid for vector fields, contains the information in the KG equation, but also
imposes constraints between the components (or polarizations states) of the vector field.

5) (10 pts) What is a gauge theory?

A gauge theory is a physical theory where the form of the interaction terms is determined by a
principle of local (or gauge) invariance under some continuous symmetry group. To build a gauge
theory, one usually starts with a non‐interacting theory, i.e. only (Dirac or Klein‐Gordon) kinetic
terms for the matter fields, that possesses a global symmetry (where the same transformation is
enacted at each point in space‐time) and then promotes this to a local symmetry (where the
transformation can differ at different points in space‐time). To maintain invariance, one modifies
the derivative (space‐time gradient) in the matter kinetic terms to a new covariant form with the
addition of (dual) vector gauge fields. One can determine the required transformation rule for the
new gauge fields such that the new matter field kinetic term is invariant. Expanding out this new
matter kinetic term, one observes terms that involve products of the original matter field and the
new gauge fields which implies that there is an interaction between them. The final step is to
allow the gauge field to propagate by providing it with its own (gauge invariant) kinetic term. Since
the gauge field is a (dual) vector one uses the Proca Lagrangian. The Proca term itself will be gauge
invariant only if the gauge field is massless. The field strength that appears in the Proca term can
be determined by considering the action of the commutator of the covariant derivative with itself
(with two different space‐time indices). In the case of a non‐abelian gauge group, owing to the
non‐vanishing structure constants, the gauge field strength (and hence the kinetic term) will
involve products between the various gauge fields and thus leads to interactions of the gauge
fields with each other. Such gauge‐gauge interactions are absent in the abelian case, e.g. QED.

6) (10 pts) Explain the Higgs mechanism. Why do we need it? What does it do?

The unified electroweak symmetry group acts differently on the left and right handed chiral states
of matter in the Standard Model. However, to form matter mass terms we must take bilinear
combinations of left and right chiral states. So we find that the matter mass terms are not
invariant under the combined electroweak symmetry group. This despite the fact that we see the
fundamental fields as having mass. Additionally, we observe that the weak interaction bosons
have nonzero mass, which (as stated in the answer to the previous question) spoils invariance of
the gauge kinetic term under the gauge symmetry group.

To address both of these issues, the idea is that the masses we see in the low‐energy form of the
Standard Model Lagrangian are not true fundamental mass terms, but rather are the residual
manifestation of an interaction between the matter and gauge fields with an additional scalar field
called the Higgs. The way this works is that at some higher energy (early time) state of the
universe, the Higgs potential, which itself had a (hard to draw) invariance under the electroweak
symmetry group, afforded a stable solution for a symmetric field configuration. Studying the
fluctuations of all fields around this solution results in massless matter and gauge fields interacting
with the Higgs, e.g. matter‐matter‐Higgs terms, and manifest invariance (linearly realized) under
the electroweak symmetry. At some lower energy (later time) the potential deformed and the
symmetric solution became unstable, so the Higgs field transitioned to a new but non‐symmetric
stable solution. Studying the fluctuations of all fields around this new solution results in nonzero
mass terms for the matter and (some) gauge fields due to their coupling to the Higgs which now
has a nonzero constant term as part of its background configuration, i.e. leading to matter‐matter‐
(constant+Higgs) which includes matter‐matter‐constant mass terms. Parts of the originally
manifest electroweak symmetry group are now obscured (realized nonlinearly) and we call the
gauge symmetry spontaneously broken. Fluctuations of the Higgs field along the original
symmetry directions are referred to as Goldstone modes and are eaten by the gauge fields which
acquire mass in order to provide the extra polarization state they need compared to their massless
version.

7) (10 pts) Explain what you are doing when you draw and evaluate a Feynman diagram.

A single Feynman diagram represents one realization of how to connect the incoming and
outgoing states in a given physical process. The initial incoming and final outgoing states are "free"
and the diagrams connecting them represent their interactions. The collection of all such diagrams
for a given process is used to calculate the amplitude which encodes the quantum description
of the dynamics (interactions and propagation of virtual states) involved. As per the "path‐
integral" approach to quantum mechanics, we must sum over all possible ways of connecting the
initial and final states. For coupling values less than one, we can get away with considering less
complicated diagrams (with a small number of vertices) and calculate to some order (including

all diagrams up to and at that order) which can a provide an estimate for . The amplitude can
then be used in Fermi's golden rule to evaluate decay rates and scattering cross‐sections.

When evaluating a given diagram, we are essentially using a set of rules to assign a (complex)
number to the diagram. The rules include writing down factors which describe the spin states of
incoming and outgoing particles, vertex factors and momentum conserving delta‐functions for
each interaction vertex, and propagators for internal virtual particles. The order of elements is
written down in a manner such that overall expression is a Lorentz and gauge scalar, e.g.
constructing spin and color sandwiches. Finally one performs integrals over all internal momenta.
We can use all but one of the delta functions to evaluate some of the internal momentum
integrals. The final delta function will involve only external momenta and should be eliminated to
avoid redundancy with the delta function that appears in the golden rule.

When adding up contributions from different diagrams, we must anti‐symmetrize the sum of any
diagrams that are related by the exchange of incoming (outgoing) identical particles or anti‐
particles, or incoming (outgoing) particles with their outgoing (incoming) anti‐particle partners.

8) (10 pts) What are the similarities/differences between QED, QCD and the weak interactions.

Common to all three: The forms of all three interactions are (more or less) based on principles of
local gauge invariance (see the answer to 5 above). Thus all three interactions have a basic
interaction vertex that includes two matter fields and one gauge field.

Common to QED and QCD and different for the weak interactions: Both QCD and QED are based
on manifest symmetries observed at low energies and hence have massless gauge bosons. The
weak interactions are the broken part of the electroweak interaction (see the answer to 6 above)
and hence have massive gauge bosons. For the basic QED and QCD vertices, the flavor of the two
matter fields is the same, while for the weak interactions mediate by the the flavor of the two
matter fields is different. The latter is why the charged weak interactions are responsible for
particle decay. One could be more technical and argue that two quarks of the same flavor but
different color are in some sense different "flavors" and then conclude that some of the QCD
vertices change "flavor". But this misses the important point that fields of different flavor tend to
have different masses, while two quarks that only differ by color would have the same mass.

Common to QCD and the weak interactions and different for QED: Both the weak interactions and
QCD are based on non‐abelian symmetry groups and hence include interactions between the
gauge bosons themselves. QED on the other hand is based on an abelian group which is why
photons do not directly interact with each other. Both QCD and the weak interactions are also
effectively short range interactions compared to QED, but for different reasons. The weak
interactions are short ranged due to the large mass of the mediating gauge bosons. QCD on the
other hand is short ranged due to confinement which dictates that quarks only appear in color

singlet bound states, and hence the interactions we see between these hadrons are indirect at
best.

Common to QED and the weak interactions and different for QCD: At low energies the couplings of
QED and the weak interactions are small enough to make perturbation theory (see the answer to
7 above) a useful approach, while the QCD has a coupling greater than one and hence must be
treated non‐perturbatively. All of these couplings change with energy scale (see answer to 9
below) and for QCD in particular, the coupling decreases with increasing energy leading to
asymptotic freedom and the ability to use perturbation theory effectively for QCD at high
energies.

Differences between all three: QED only involves the electrically charged fields of the Standard
Model, i.e. all quarks and the charged leptons, QCD only involves the quarks, and last but not least
the weak interactions involve every single matter field. Oh, and they all have different names.

9) (10 pts) WTF is renormalization, and why does it matter?

In a nutshell, the program of renormalization involves taking a divergent Feynman amplitude,
regularizing it in some way to split it into a finite piece and a piece which diverges when the
regulator is taken away, then explaining away the divergent piece in terms of shifts to inaccessible
"bare" parameters.

At the heart of renormalization is the idea that we often define quantum field theories in terms of
fundamental parameters, e.g. masses and couplings of fields, whose true values we do not
actually know and which must be "corrected" in order to obtain effective field theories whose
content can be evaluated by leading order diagrams in the Feynman expansion. Often these
corrections can come in the way of infinities cancelling infinities which was bothersome at first but
has since become widely accepted.

There ends up being three values of any parameter that are of interest:

 The bare (or fundamental) value is what we could probe if we could do experiments at
arbitrarily high energy, which of course we can't do and so we do not have experimental
access to these values. In some situations it seems that these values may need to be
infinite to provide appropriate cancellations.
 The zero momentum transfer value is what we experimentally measure with low‐
energy/large distance experiments. These are the values which are cataloged in reference
tables and are always finite.
 The parameter value at a particular momentum transfer scale is what we use to simplify
calculations.

When doing calculations we use these in different ways. If we are doing a calculation for a process
where a finite nonzero momentum is exchanged then:

       Working in terms of the bare parameters often leads to divergences from diagrams with
        internal loops which must be cancelled by redefinitions of the parameters which involve
        cancellations between infinities.
       Working in terms of the zero momentum transfer values will require that we include higher
        order diagrams (including loop contributions), but now these will be finite since we are
        working between two experimentally accessible regimes.
       Working in terms of the parameters at the momentum scale of the process at hand means
        that we can evaluate only the leading order Feynman diagram in order to get the correct
        (not approximate) answer.

10) (10 pts) For those who begged….draw a turtle.

There were a lot of great turtles, all of which will make it on to my Particle Physics '18 turtle collage. But the
winner is from Madi Clark:

You can also read