On the Predictive Power of Chemical Concepts - Ingenta ...

Page created by Rachel Solis
 
CONTINUE READING
On the Predictive Power of Chemical Concepts - Ingenta ...
Laureates: Junior Prizes of the SCS Fall Meeting 2020                                                                              CHIMIA 2021, 75, No. 4    311
                          doi:10.2533/chimia.2021.311                                                           Chimia 75 (2021) 311–318 © S. A. Grimmel, M. Reiher

                          On the Predictive Power of Chemical
                          Concepts
                          Stephanie A. Grimmel§ and Markus Reiher*

                          §
                           SCS-DSM award for best virtual poster in Computational Chemistry

                          Abstract: Many chemical concepts can be well defined in the context of quantum chemical theories. Examples
                          are the electronegativity scale of Mulliken and Jaffé and the hard and soft acids and bases concept of Pearson.
                          The sound theoretical basis allows for a systematic definition of such concepts. However, while they are often
                          used to describe and compare chemical processes in terms of reactivity, their predictive power remains unclear.
                          In this work, we elaborate on the predictive potential of chemical reactivity concepts, which can be crucial
                          for autonomous reaction exploration protocols to guide them by first-principles heuristics that exploit these
                          concepts.
                          Keywords: Automated mechanism exploration · Chemical concepts · Conceptual DFT · Reactivity prediction

                      Stephanie A. Grimmel completed her BSc                       stable intermediates, and products that are connected by transition
                      in Chemistry in Karlsruhe (Germany) in                       states. In the most basic formulation of transition state theory, it is
                      2016. Afterwards, she studied Physics                        necessary to identify the stationary points, i.e. local minima and
                      and Chemistry in Amsterdam (The                              first-order saddle points, on a potential energy surface to arrive
                      Netherlands), Rome (Italy) and Lyon                          at such a reaction network at full conformational resolution.
                      (France). For her master’s thesis she worked                 The mathematical definition of the potential energy surface
                      on the optimization of kinetic energy                        may be taken from the Born-Oppenheimer approximation which
                      functionals for subsystem DFT under the                      freezes the nuclear motion compared to that of the electrons and
                      supervision of Prof. Dr. Lucas Visscher at                   introduces the electronic energy. The electronic energy depends
                      Vrije Universiteit Amsterdam. Since 2018                     on the positions of the nuclei frozen in space. It can serve as the
she has been a PhD student in the Theoretical Chemistry group of                   hypersurface for the description of reactive as well as diffusive
Prof. Dr. Markus Reiher at ETH Zurich. Her work revolves around                    processes. Quantum chemistry has provided routine tools to
the development of first-principles heuristics for automated                       calculate and explore such surfaces,[1–3] up to the point where
reaction space exploration.                                                        this is possible in an automated manner[4–6] in order to provide
                                                                                   the full detail of a chemical process in terms of thousands of
                                                                                   interconnected elementary steps. With subsequent inclusion
                       Markus Reiher received his PhD in                           of nuclear motion to arrive at free-energy differences for the
                       theoretical chemistry from the University of                elementary reaction steps, we have all information, obtained in a
                       Bielefeld in 1998, working in the group of                  first-principles way with no experimental information required,
                       Juergen Hinze. After his habilitation in the                to study the kinetics in the network.[7–15]
                       group of Bernd Artur Hess at the University                      While this procedure provides us with a quantitative
                       of Erlangen, he worked as a private docent                  description of chemical reactions in terms of relevant species
                       in Erlangen in 2003 and at the University                   involved, barrier heights, and concentration fluxes, it does
                       of Bonn in 2004. In 2005, he accepted                       not necessarily allow us to describe and understand what is
                       an offer for a professorship in physical                    actually going on. In particular, common reactivity patterns
                       chemistry at the University of Jena, where                  need to be identified from the quantitative data that is available
he worked until he moved to ETH Zurich in 2006. His research                       from experiment and from detailed calculations. In order to
covers many different areas in theoretical chemistry which range                   accomplish this, chemical concepts have been developed, many
from relativistic quantum chemistry to the development of new                      of them introduced in a rather ad hoc fashion to describe changes
electron-correlation theories and smart algorithms for the efficient               in electronic structure upon a chemical reaction. Examples are
exploration of complex chemical reaction networks.                                 the electronegativity concept as defined by Pauling[16,17] and the
                                                                                   hard and soft acids and bases (HSAB) concept by Pearson.[18–20]
                                                                                   In the past decades, it has become possible to root these concepts
1. Introduction                                                                    in quantum chemical theories. While this is natural for electronic
    Understanding chemical processes often starts with structure                   structure-related concepts such as partial charge and bond order,
elucidation of the relevant reactive species and their connection                  it is also possible for electronegativity and HSAB.
in terms of a reaction network. Quantum chemical calculations                           These concepts allow us to relate and discuss data obtained
allow us to detail the reaction mechanism by mapping it to a                       for different molecules and reactive systems. As such, they
network of elementary reaction steps characterized by reactants,                   have a categorizing value that can hardly be over-estimated.

*Correspondence: Prof. M. Reiher, E-mail: markus.reiher@phys.chem.ethz.ch
Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, CH-8093 Zurich, Switzerland
312                          
        CHIMIA 2021, 75, No. 4                                                          Laureates: Junior Prizes of the SCS Fall Meeting 2020

However, their predictive power remains obscure. Even though 2. Chemical Concepts
rigorous mathematical definitions for many concepts exist, most                Electronegativity, hard and soft acids and bases, Fukui
concepts are not unique in a sense that they can be introduced functions, and the dual descriptor, all refer to the electronic
in different ways (e.g. there exist various ways to define atomic structure evaluated for a frozen molecular structure, typically
and hence partial charge as an atom is not rigorously defined in the equilibrium structure of a reactant. Hence, we limit
a molecule without ad hoc assumptions – even Bader’s elegant ourselves to the standard electronic-energy based discussion
atoms in molecules theory[21] is difficult to develop consistently for specific individual structures on the Born-Oppenheimer
in a fundamental relativistic framework without additional surface, typically those that mark local minima. Accordingly,
assumptions;[22,23] loosely speaking, due to the lack of second thermal and entropic effects are left aside so that one can expect
derivatives in the Dirac equation[24]). Moreover, the concepts are predictions for enthalpic but not for entropic contributions.
typically evaluated for the unreactive, isolated reactants, i.e. in Naturally, these can only be sensible if the elementary step
their equilibrium structures. It is not at all clear how they could under consideration is governed by electronic effects. We note,
possibly point the way toward the minimum-energy reaction path however, that there exist generalizations of quantum theory as
and hence to the transition state and barrier height. Therefore, we well as chemical concepts towards thermodynamic ensembles
may anticipate that it will be difficult to define a predictive concept (see, e.g. refs. [36,37]).
that offers a rationale or better a quantitative relation between the
(molecular and/or electronic) structure of reactants and reactivity 2.1 Electronegativity
measures such as activation barriers (that eventually relate to rate
                                                                                   On the Predictive Power of
                                                                               Electronegativity is among the oldest and most widely used
constants, and hence, also to equilibrium constants).                     chemical concepts. The term dates back to Berzelius’ work in the
                                                                                      Chemical Concepts
     Reactivity predictions should allow one to answer questions early 19th century[38] and the concept even further.[39] It denotes
such as these: Will certain reactants react with one another under the tendency of atoms to attract electrons towards themselves.­[16]
                                                                                                                 †                                     ⋆
given reaction conditions? Which reaction paths and/or products Among      Stephanie  the many     quantitative
                                                                                             A. Grimmel              definitions
                                                                                                                    and        Markus   that Reiher
                                                                                                                                               have been   put
are to be expected, i.e. how do the reactants react with each other? forward, the first one by Pauling remains to be the most popular
                                                                                                                   [16]

From which direction is a given reactant attacked preferably?             one,of which
                                                                   Laboratory     PhysicalisChemistry,
                                                                                             why we briefly      consider
                                                                                                       ETH Zurich,              it here. Pauling2, proposed
                                                                                                                         Vladimir-Prelog-Weg         8093 Zurich,
We focus on the prediction of elementary steps as opposed to a thermochemical recipe Switzerland          driven by the availability of such data at
more complex multistep reactions to which our discussion         †  can ofthe
                                                                   Winner   thetime.
                                                                                  Best His  definition
                                                                                       Poster           relates
                                                                                              Presentation   Awarddifferences       in the electronegativity
                                                                                                                       in Computational       Chemistry at the SCS
be generalized in a straightforward manner.                               χ of two elements A Fall    and Meeting
                                                                                                            B to the  2020deviation of the dissociation
     For planning chemical synthesis, it is important to know energy            ⋆    of the heteroatomic diatomic molecule, Ed(AB), from
                                                                                   Corresponding author: markus.reiher@phys.chem.ethz.ch
whether and which of these concepts can be reliably employed. the average of the dissociation energies of the corresponding
Whereas one might argue that traditional chemical synthesis homoatomic molecules, Ed(AA) and Ed(BB):
planning has rather been based on chemical intuition that is
nourished by the vast amount of experimental knowledge (e.g.                                  
                                                                                                               Ed (AA) + Ed (BB)
                                                                                                                                           · (eV−1/2 ) (1)
                                                                               P       P
                                                                                         
by name reactions) – even when exploited in a data-driven                      χA − χB := Ed (AB) −
                                                                                                                             2
                                                                                                                                                                  (1)
manner,   [25–33] recent developments in automated reaction
exploration schemes[4–6] based on extensive quantum chemical
calculations would benefit from knowledge about the predictive                 With regard to the predictive capabilities of the electronegativity
value of reactivity concepts.                                             it should be pointed out M     that by I+ rearranging
                                                                                                                       A               Eqn. (1) one obtains
                                                                                                       χ :=               ,                                       (2)
     We have put forward the idea of first-principles heuristics, an expression for the heteroatomic                2 dissociation energy in terms
i.e. the idea of exploiting features of the electronic wave function of Pauling electronegativities χ and homoatomic dissociation
                                                                                                                     P

to cut the deadwood from the reaction space,[34,35] which for energies. Considering the reliability of a heteroatomic dissociation
bimolecular reactions alone formally grows with the number energy calculated in thismanner                          already sheds doubt on how
of pairs of atoms in both reactants. This can only be fruitful if reliable the electronegativities            ∂Ecan
                                                                                                                 el potentially be. Hence, examples
                                                                                                  χ := −                 = −µ                                     (3)
reactivity predictions based on concepts are reliable (at least to a of poor predictions can be easily        ∂N vfound, e.g. in the case of alkali
certain degree). Therefore, we elicit on what is known about the fluorides and chlorides.[40]
predictive power of chemical reactivity concepts in this work.                 Indeed, the definition of χP through Eqn. (1) has profound
     For a concept to have predictive power, it should enable us to limitations, some of which are:
judge reliably on the reactivity of a given set of reactants without 1. If the evaluation    χ− ≈ Eel (Nof −bond1) − E energies
                                                                                                                         el (N ) = I
                                                                                                                                      relies on enthalpies (4)
any a priori information about possible reaction products and                    of formation  + (as in Pauling’s work
                                                                                             χ ≈ Eel (N ) − Eel (N + 1) = A
                                                                                                                                      [16]), it depends on
                                                                                                                                                                  (5)
paths. Note, however, that it can be equally important – especially              macroscopic environmental    +
                                                                                                                     parameters        such as temperature.
in automated mechanism exploration – to determine what atoms 2. The definition                      χ    +  χ       I  +   A
                                                                                               χ ≈ marries electronic          = with
                                                                                                                                  χM nuclear dynamics (6)
                                                                                                      −
                                                                                                                 ≈
or functional groups in a molecule are unreactive.                               because experimental    2 data was2taken for the parametrization.
     In the following section, we review mathematical definitions                As a consequence, the data cannot be easily calculated with
of four chemical concepts. As examples we select the                             sufficient accuracy.
electronegativity and the chemical hardness, because they belong 3. It is not obvious whether                       only data on diatomics may be
                                                                                                           ǫHOMO + ǫLUMO
to the core of chemical concepts taught to chemistry students                    used for the definition
                                                                                                χM ≈ − or whether bond            .      breaking processes (7)
                                                                                                                     2
from high school education onwards – however, typically, without                 in any molecule can be considered. That is, the definition
discussing their quantum chemical foundations. Furthermore,                      ignores the fact that atoms in different bonding situations
we consider the Fukui functions and the dual descriptor for the                  may not be comparable. 1In other words, one will be forced
identification of nucleophilic and electrophilic sites of reactants.             to find a categorizing scheme that groups atoms in similar
We outline limitations of specific descriptors as well as overall                binding situations, for example, by introducing different
constraints. Finally, we expand on how to gather further insights                valence states (and we refrain here from a discussion of how
upon the predictive capabilities of chemical concepts. We                        that could possibly be accomplished in a rigorous way).
elaborate on why, besides the value of chemical concepts for 4. Moreover, Pauling’s definition only allows for the calculation
describing and understanding chemical processes, massively                       of relative electronegativity values, which requires an
automated reaction exploration algorithms will be important to                   arbitrary definition of an absolute reference. In his original
assess actual reactive behavior in full depth, shedding also light               work, he chose 0.0 for hydrogen, which was later replaced
on the application range of reactivity concepts.                                 by 2.1 and today is typically set to 2.2.[16,17,41,42]
 P
                                                                             χA − χP
                                                                                         Chemical Concepts
                                                                                     B := Ed (AB) −
                                                                                                            Ed (AA) + Ed (BB)
                                                                                                                                         · (eV−1/2 )               (1)
                                                                                                                            2
 Chemical Concepts                                                           χ − χP  := Ed (AB) − d
                                                                              P       
                                                                                                            †
                                                                                                              E    (AA)     + E d (BB)
                                                                                                                                         · (eV−1/2⋆)               (1)
           Laureates: Junior Prizes of the SCS Fall Meeting 2020            PA PB A. GrimmelE
                                                                      Stephanie                                 and       Markus
                                                                                                                            2Ed (BB) Reiher
                                                                                                                                                                  (1) 313
                                                                            χA − χB  := Ed (AB) −            d (AA)     +                      CHIMIA 2021, 75, No. 4
                                                                                                                                       · (eV−1/2  )
                                                                                                                           2
nie A. Grimmel† and Markus Reiher⋆                            Laboratory of Physical Chemistry, ETHMZurich,     I +Vladimir-Prelog-Weg
                                                                                                                       A                      2, 8093 Zurich,
                                                                                                      χ    :=
                                                                                                    Switzerland2          ,                                        (2)
ical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich,                                                  I  +  A
           5. The     parametrization results in an overdetermined
                  Switzerland                               †
                                                              Winner of thesystem     when
                                                                             Best Poster      treating
                                                                                         Presentation χM   := in Computational
                                                                                                         isolated
                                                                                                        Award             ,
                                                                                                                       atoms   or molecules.
                                                                                                                                        ChemistryThisat issue
                                                                                                                                                        the SCS    (2)
                                                                                                                                                               is discussed
                                                                                                               I +2A
                 of equations.
Poster Presentation Award in Computational Chemistry at the SCS
                                                                                      in great   detail
                                                                                                 Fall  M
                                                                                                         elsewhere.
                                                                                                          := 2020, [47,48,50–54]
                                                                                                    χMeeting                                                      (2)
                                                                                                                   2
           6. Fall
                 Depending    on which element pairs are employed the resulting
                    Meeting 2020                                           ⋆
                                                                              CorrespondingInauthor:
                                                                                               practical   applications,         Eqn. (3) is most often evaluated
                                                                                                       markus.reiher@phys.chem.ethz.ch
                                                                                                            ∂Eel
                 electronegativities may differ.                                      within aχfinite
                                                                                                  := − difference=approach.
                                                                                                           ∂N v
                                                                                                                             −µ          The electronegativity     (3)is then
                From
 esponding author:     this list, it becomes clear that Pauling’s scheme is assumedχto
                    markus.reiher@phys.chem.ethz.ch
                                                                                                  := −
                                                                                                      be  equal
                                                                                                            ∂Eelto the average of the left and right derivatives
                                                                                                                   + [47]= −µ                                     (3)
           ambiguous for a couple of reasons. Most of these issues are not of            Eqn. (3), χ−and ∂E∂Nelχ : v
           present in the absolute electronegativity definition by Mulliken,
                                                                          P [43]P            χ := −                   = −µ                                      (3)
                                                                         χA − χB  := Ed (AB) − E∂N        d (AA)   v
                                                                                                                       + Ed (BB)
        which may be rigorously traced back to quantum chemical                                                                    · (eV−1/2 )                (1)
                                                                                                                        2
χP
   
          Equantities
                        d (AA)
                      Ethat    + be
                             can  Ed (BB)
                                      routinely calculated on modern    computer           χ− ≈ Eel (N − 1) − Eel (N ) = I                                         (4) (4)
  B :=      d (AB) −                       · (eV−1/2 )          (1)
   
                                2
           hardware. The Mulliken       electronegativity χM of some atom is the           χ +
                                                                                             − ≈ Eel (N ) − Eel (N + 1) = A
                                                                                           χ ≈ E (N − 1) − E (N ) = I                                              (5)
                                                                                                                                                                   (4)
                                                                                                              el                          el
            average of its ionization energy I and electron affinity A,                             +    χ−(N + χ) +−      I(N+ (N
                                                                                                                                 A ) ==M
                                                                                                 χχ−χ ≈EE
                                                                                                     ≈≈    el (N−
                                                                                                         elM        I1)+ elE
                                                                                                                       E−A
                                                                                                                       ≈      el + 1)
                                                                                                                                   = χIA                             (5) (5)
                                                                                                                                                                    (4)
                                                                                                                                                                     (6)
                                                                                                   +
                                                                                                         χ − 2
                                                                                                               := +          ,2                                   (2)
                                                                                                 χ ≈ Eχel (N+)χ− E2el (N   I ++A1) = A                              (5)
                         I +A
                                                                                    (2)             χ≈                 ≈           = χM                              (6)
                 χM :=        ,                                       (2)                               χ− +2χ+          I +2A
                           2                                                                       χ≈                 ≈            =χ   M
                                                                                                                                                                    (6) (6)
                                                                                                             2               2
                                                                                                       M
                                                                                                                ǫHOMO + ǫLUMO
                                                                                                                       
           which is a natural      definition in the sense that it connects the                    χχ:= −≈ −∂Eel                       .                            (7)
                                                                                                                            2= −µ                                 (3)
                                                                                                                 ∂N
                                                                                                                 ǫ          +  ǫ
           concept ∂Eto elthe two quantities that measure the energy required                     EvenχMthough
                                                                                                          ≈ − derived differently,
                                                                                                                   HOMO  v       LUMO          this expression is equal
                                                                                                                                                                     (7) to the
                            
           χ := −               = −µ                               (3)                                                                  .
           or liberated
                      ∂N whenv
                                     an electron is removed or added to the                   Mulliken
                                                                                                     χ ≈−
                                                                                                                ǫHOMO +2ǫLUMO
                                                                                                      M electronegativity          χ M
                                                                                                                                      .
                                                                                                                                         given  in Eqn. (2). Therefore,
                                                                                                                                                                    (7)
                                                                                                                                                                         finite
           system. In our setting here, we may identify I and A as the                        differences clearly1 link   2 both approaches.
           vertical electronic energy differences calculated between the                          Rather than calculating the electronic energy of the system
           system’s ground state and the corresponding states at the same                     with                 1
                                                                                               χ− ≈different
                                                                                                     Eel (N −electron     numbers
                                                                                                                1) − Eel (N ) = I (i.e. N, N + 1, and N(4)– 1), one
        − structure with one electron removed and added, respectively                         may
                                                                                               χ+ ≈replace
                                                                                                     Eel (N ) ionization   energy
                                                                                                                              = A and electron affinity(5)
                                                                                                                                                         by orbital
                                                                                                                  1
      χ ≈ Eel (N − 1) − Eel (N ) = I                               (4)                                        − Eel (N + 1)
           (with proper sign conventions). With this definition, the concept                  energies−of the+ highest occupied molecular orbital (HOMO) and
        +                                                                                            χ +χ           I +A
      χ ≈ofEelelectronegativity
                (N ) − Eel (N + 1) is =Ano longer limited to atoms (5) but can be             theχ lowest
                                                                                                   ≈        unoccupied
                                                                                                                 ≈         molecular
                                                                                                                           =  χM      orbital (LUMO),(6) ∈HOMO and
                                                                                                          2           2
           generalized
             χ− + χ+ to     I+entire
                                A molecules.                                                  ∈LUMO, respectively, of the system with N electrons by virtue of
        χ≈              ≈          = χM                            (6)
               An,
                 2 at first sight,
                              2     different approach to define electronegativity            Koopmans’ theorem,[55] to obtain
           is taken in the context of conceptual density functional theory
           (cDFT).[44–46] For an extensive review of the foundations of cDFT                                      ǫHOMO + ǫLUMO
           see ref. [47] and for a recent perspective ref. [48]. cDFT provides                      χM ≈ −                      .                                (7)    (7)
                                                                                                                        2
n the Predictive Power of
          χM  ≈−
                   ǫHOMO +definitions
           mathematical        ǫLUMO
                                      .    of numerous chemical    (7) concepts (see
           also below). Its basis is the standard procedure of physical
                            2
  Chemical Concepts
           modeling and that is to expand a quantity that depends on a set                        Whereas Koopmans’
                                                                                                               1         theorem relies on error cancellation in the
           of parameters in terms of a Taylor series. In this case, it is the                 case of Hartree-Fock calculations[56] (owing to the neglected orbital
           electronic 1energy     Eel[N,ν;{Rk}] that depends
phanie A.      Grimmel† and             Markus Reiher⋆ on the number of
           electrons N and the external potential ν. The latter is typically given
                                                                                              relaxation upon oxidation vs. the missing electron correlation in
                                                                                              the independent particle model), it can be considered exact for
           as the Coulomb
Physical Chemistry, ETH Zurich,  potential  of the underlying
                                   Vladimir-Prelog-Weg           nuclear framework
                                                         2, 8093 Zurich,                      ionization energies in exact Kohn-Sham DFT.[57–59] However, the
           in Born-Oppenheimer
                      Switzerland       theory, and hence, it also encodes the                LUMO Kohn-Sham energy cannot be exploited that easily.[60,61] It
           specific molecular structure given by the set of nuclear Cartesian                 refers to an excited Kohn-Sham electron rather than to an added
Best Poster Presentation Award in Computational Chemistry at the SCS
           coordinates of all nuclei, {Rk}.                                                   Kohn-Sham electron[60,62] so that one might consider the HOMO
                   Fall Meeting 2020
               Although this consideration is typically done in the framework                 energy of the N + 1 Kohn-Sham system and reverse its sign to
           of conceptual
 Corresponding               DFT, we emphasize that such a Taylor expansion of
               author: markus.reiher@phys.chem.ethz.ch                                        obtain the electron affinity of the N Kohn-Sham system.[63]
           the electronic energy may be written for any electronic structure                      Mulliken pointed out that the electronegativity is not only an
           model. However, DFT may offer specific advantages over other                       atom’s ground state property, but instead depends on its valence
           models
                   (e.g. as Koopmans’ theorem becomes exact; however, see                    state.[43] This notion was formalized by Hinze and Jaffé[64–67] by
        our discussion
      P                    Edbelow).
                               (AA) + E The   identification
                                         d (BB)               of reactivity  concepts         introduction of orbital electronegativities. The electronegativity
A − χB :=     Ed (AB) −                          · (eV−1/2 )             (1)
           as responses of Eel towards2      changes in N and ν is in line with the           χi of orbital i with Ni being the number of electrons in that orbital
           understanding that they vary upon interactions between reactants                   is given by
           during chemical reactions.
               In 1961 – and hence even before cDFT evolved as a field –                                      
                                                                                                                   ∂Eel
                                                                                                                          
                              I +A                                                                χi := −                         .                        (8)          (8)
           IczkowskiχMand:= Margrave,     identified the electronegativity
                                                                         (2) with the                              ∂Ni
           negative response2 of the energy towards changes in the number of
                                                                                                                              v

           electrons at constant external potential:[49]
                                                                                           Dramatically different electronegativity values were found,
                         
                             ∂Eel
                                                                                      for example, for χ2 different valence states of carbon.[68]
                 χ := −              = −µ                               (3)     (3)              ω :=                                        (9)
                             ∂N                                                            Although 2η   there is a decent correlation between      Mulliken
                                   v
                                                                                       and Pauling electronegativities,[64,69,70] the theoretical basis of
                                                                                       Mulliken-Jaffé electronegativities χM has the salient feature that it
                 Later on, this expression was recognized as the negative of links directly to quantities of electronic structure theory, in which
            the  chemical potential µ by Parr.[44] The minus sign is introducedAh Bany  s+A (fixed)−−
                                                                                              s Bh ↽  molecular
                                                                                                      ⇀
                                                                                                      −− Ah Bh + A scaffold
                                                                                                                     s Bs   represents a point on the Born-
                ≈ Eela(Nhigh
           χto− have       − 1)(positive)
                                − Eel (N ) electronegativity
                                           =I                           (4) tendency Oppenheimer surface given by the electronic energy, as opposed
                                                             flag a strong
           χto+ ≈
                attract
                   Eel (Nelectrons,
                          ) − Eel (N +and
                                        1) =hence,
                                              A    it should be associated
                                                                        (5)    with a to the profound problems associated with Pauling’s definition.
            decrease
                   χ− + inχenergy
                            +
                                 I+upon
                                      A an M increase in N (note that for sufficiently     Electronegativities of atoms are widely used to identify
              χ ≈ molecular
            large             ≈ systems,= the
                                           χ electronic energy can be   (6)considered reactive
                        2           2                                                     1 ∂ 2 Ecenters    1within
                                                                                                                ∂µ molecules. For example, the fact that
                                                                                                                 
                                                                                                   el
            to decrease upon reduction of the system by one electron). Despite η̃ :=   tetrahedral
                                                                                          2 ∂N 2 v
                                                                                                      carbon
                                                                                                          =    has
                                                                                                            2 ∂N v
                                                                                                                    a lower electronegativity
                                                                                                                                            (10)than chlorine
            appearing mathematically concise, Eqn. (3) suffers from the fact (8.07 vs. 10.05               eV according   to the Mulliken-Jaffé values from
            that the number of electrons N is restricted to integer numbers ref. [68]) allows us to predict that a nucleophilic attack on
                         ǫHOMO + ǫLUMO
               χM ≈ −                  .                                    (7)
                               2                                                                    
                                                                                                        ∂ 2 Eel
                                                                                                                         
                                                                                                                              ∂µ
                                                                                                                                      
                                                                                             η :=                     =                                   (11)
                                                                                                        ∂N 2                  ∂N
2η
                                                                                                  −−
                                                                                    Ah Bs + As Bh ↽ −⇀
                                                                                                     − Ah Bh + As B χs2
                                                                                                              ω :=                                   (9)
         314                          
                 CHIMIA 2021, 75, No. 4                                                          Laureates
                                                                                         Ah Bs + A      −−
                                                                                                   s Bh ↽−− :AJhunior
                                                                                                          ⇀         2ηA
                                                                                                                 Bh +   Psrizes
                                                                                                                           Bs of the SCS Fall Meeting 2020
                                                                   Ah Bs + As Bh ↽    −− −⇀− Ah Bh + As Bs
                                                                                2                       
                                                                             1 ∂ Eel             1 ∂µ
                                                                       η̃ :=                  =                                        (10)
         chloromethane should preferably occur at the carbon center. 2This        ∂N1 means
                                                                                      A
                                                                                      
                                                                                         ∂ 2vEel this
                                                                                       2h Bs +  A    hprefactor
                                                                                                 2 s B∂N −−⇀ AhBemerging
                                                                                                        1↽−−  v∂µ    h + As Bs in front of any second-
         The question whether electronegativity can be used to predict η̃    derivative
                                                                               :=
                                                                                    2 ∂Nterm 2   of= aTaylor series expansion is to (10)  be understood
                                                                            1 ∂2E               1 v ∂µ2 ∂N v
         reaction barriers in a quantitative manner was tackled, e.g.η̃ :=
                                                                         in as included
                                                                                     el
                                                                                             = in the  definition,    implying   a scaling
                                                                                                                                      (10)  of the hardness.
         the context of hydrogen abstraction reactions: Nguyen et al. 2 In      ∂N 2 following, 2 ∂N
                                                                                the
                                                                                   2
                                                                                          v         we
                                                                                                           continue
                                                                                                            v           to apply the scaled   hardness.
         established a correlation between the reaction barriers of theη := ∂InEelanalogy     =
                                                                                                  ∂µ
                                                                                               1  ∂to
                                                                                                     2 electronegativity, the chemical hardness is
                                                                                                      Eel
                                                                                                                   
                                                                                                                   1 ∂µ
                                                                                                                           
                                                                                                                                       (11)
                                                                                  ∂N   2η̃ :=      ∂N           =                                    (10)
         abstraction of a given hydrogen atom from propene and methane often calculated  ∂ vEel2 ∂Nwithin
                                                                                           2                      a2 finite
                                                                                                                       ∂N vdifference approach, which
                                                                                                               
                                                                                                        2 v∂µ
                                                                               η :=  2  =                  v                              (11)
         and the electronegativity of attacking radicals. A larger share eventually
                                                         [71]                 
                                                                                ∂ 2 Eel ∂N
                                                                                              reads   as
                                                                                                  ∂µ ∂N v
                                                                                                  v
         of chemical space was investigated in a study of the relation   η :=        2
                                                                                             =                                        (11)
                                                                                ∂N                ∂N v
         between different cDFT properties and 216 electrophilicity                        v

         values from the Mayr database[72–75] by Lee et al.[76] They               η ≈ I − A. ∂ 2 Eel           
                                                                                                                     ∂µ
                                                                                                                                      (12)           (12)
                                                                                           η :=                 =                                    (11)
         observed only a very poor correlation (R = 0.539) between
                                                     2
                                                                                           η ≈ I −∂N A. 2
                                                                                                              v     ∂N v                     (12)
         Mayr’s electrophilicities and electronegativities obtained with
                                                                           η ≈ I − A.                             (12)
         Eqn. (6).
                  Animproved, but still weak correlation (R2 = 0.737)
                  ∂Eel
                                                                            The application of Koopmans’ theorem then yields
        χwas   found with. the cDFT electrophilicity (8) ω, which is a η ≈ ǫLUMO − ǫHOMO .
          i := −                                    [77,78]
                                                                                                                   (13)
                  ∂Ni v
         joint property calculated from the electronegativity χ and the                  η ≈ I − A.                                                          (12)
         chemical hardness η:                                               η ≈ ǫLUMO − ǫHOMO .                         (13)                                  (13)
                                                                                            η ≈ ǫLUMO − ǫHOMO .                              (13)
                     χ2                                                                         The global
                                                                                                       1
                                                                                                           softness, S, is identified as the inverse of the
              ω :=                                              (9)                (9)       hardness,
                                                                                                  S := .                                (14)
                     2η                                                                     η η ≈ 1ǫLUMO − ǫHOMO .                          (13)
                                                                                            S := .                                   (14)
                                                                                           1      η
             The concept of chemical hardness will be considered in the             S := .
                                                                                           η
                                                                                                                                (14)         (14)
        following
Ah Bs + A s Bh −
               ↽ −
                 −− section.
                  ⇀  Ah Bh + The
                             As Bpoor  correlation coefficients may be taken
                                  s                                   LiI(g) + CsF(g) ↽−−−⇀
                                                                                          − LiF(g) + CsI(g) 1
        as a clear indication that kinetic predictions based on Eqns. (6)                            S := .                                 (14)
        and (9) are unreliable. However, it should be pointed out that withLiI(g) +ACsF(g)
                                                                                       prototypical
                                                                                              −−
                                                                                              ↽ −⇀    example
                                                                                                 − LiF(g)         of Pearson’s HSAB principle is the
                                                                                                            η+ CsI(g)
        Mayr’s electrophilicities being obtained from experimental      data,+ CsF(g)
                                                                     LiI(g)     reaction
                                                                                      −
                                                                                      ↽−⇀
                                                                                        −−between
                                                                                             LiF(g) +LiICsI(g)
                                                                                                          and CsF:[81]
        deviations
                  may    notbe exclusively attributed to shortcomings                 2
      1 ∂ 2 Eel
        
                       1 ∂µ
 η̃ := of the  2
                   concepts’
                     =         definitions: Measuring(10)inaccuracies and                       2
      2 the∂Nprotocol
                   v   2 for
                           ∂N the
                                v calculation of electrophilicities from            LiI(g) + CsF(g) − ↽  −
                                                                                                         −⇀− LiF(g) + CsI(g)
                                                                                        2
        experimental rate constants (see ref. [73]) as well as limitations
        of the applied electronic structure method can also contribute to
        the
             discrepancies.
          ∂ 2 Eel
                      
                         ∂µ
                            This issue is considered in some more detail                        With I– being softer
                                                                                                                    2 than F– and Cs+ softer than Li+, the HSAB
   η := in Section
               2
                    =4.                               (11)                                   principle correctly predicts that the conversion should be exothermic.
            ∂N               ∂N
                                                                                             This is especially notable considering that a prediction based on
                                 
                     v             v∂Eel
                         χi := −           .                                  (8)
         2.2 Hard and Soft∂N     Acids
                                  i  v and Bases                                             Pauling electronegativities and a rearranged form of Eqn. (1) fails
             Based on experimental observations Pearson formulated the                       for this example.[81]
         hard and soft acids and bases (HSAB) principle as follows: “Hard                        Nevertheless, the HSAB principle should by no means be applied
            η ≈ I − A.                                (12)
         acids prefer to bind to2hard bases and soft acids prefer to bind to                 blindly. Most importantly, as Pearson himself pointed out,[82] it only
         soft bases.”           χ
                           ω This
                              := means, that with A denoting an acid,
                                                                    (9) B a                  constitutes one amongst many effects determining the interaction
                      [18–20]
                                2η h and s hard and soft, respectively, the
         base, and the subscripts                                                            between two reactants. Considering the terms in the Taylor series
         exchange reaction                                                                   expansion of the electronic energy, this means that an exact
       η ≈ ǫLUMO − ǫHOMO .                                     (13)                          correspondence between chemical reactivity and predictions based
                                                                                             on the HSAB principle can only be expected if all other derivatives
            Ah Bs + As Bh ↽     −− ⇀
                                   − − Ah Bh + As Bs                                         are negligible – a condition that is hardly ever met in chemical
                                                                                             reactions. For example, it is known that the HSAB principle tends
                                           
                                   ∂Eel
                     χi := −                   .                               (8)
                    1            ∂E∂N    
                                      el i    v
                                                                                             to be dominated by the preference of strong acids to recombine with
             S  := χ   .
        should beηiexergonic.
                       :=   −          In Pearson’s
                                              .                    (14)
                                                         early work, soft acids and bases
                                                                              (8)            strong bases.[83] Applications of the HSAB principle to ambident
                         2 ∂N    i v 
        were defined 1 ∂to    Ebe
                                el those     1 that∂µare easily polarizable, while hard      reactants were heavily criticized due to notable failures occurring
                                                       
                :=                       = [18]                                 (10)
        ones η̃are   2hard∂N to2 polarize.
                                      v2     2 ∂NIt vis rather surprising that such a        even for prototypical systems.[74,84]
        phenomenological     ω := concept that was proposed as an attempt
                                     χ
                                                                                (9)    to        Although its now established roots in quantum chemistry
        categorize
 (g) + CsF(g)  − −⇀
               ↽−− ω :=a  wealth
                       LiF(g)   + χ  of
                                    22η
                                   CsI(g)  experimental     observations  can actually be    allow for explicit calculations, the HSAB principle should not
        based on rigorous                                                     (9)
                         2 mathematical
                                  2η                 definitions in a quantum chemical      be mistaken as a universal tool for reactivity predictions. It may
        framework.       cDFT
                           ∂ Eel provided       ∂µsuch a framework, in which hardness        be regarded as one measure for reactivity in the context of all
                                           
                 η :=                    =                                      (11)
        η appears as ∂N     the2resistance
                                      v         ∂Nof vthe chemical potential µ towards       such concepts evaluated for a reactant.
                  2
        changes
           Ah Bs +inAthe s Bhnumber
                               −
                               ↽−−⇀− Ah of  Bh electrons
                                                  + As Bs N. Accordingly, a derivative
        term
         Ah Bsin+ the
                    As Baforementioned
                             −−
                           h ↽ −⇀− Ah Bh + ATaylor  s Bs
                                                            expansion of the electronic      2.3 Fukui Indices and the Dual Descriptor
         energy is interpreted as the hardness:[79]                                               To obtain information about where a reactant is attacked,
                            η ≈ I − A.                              (12)                     local reactivity descriptors are required. One prominent example
                      1 ∂ 2 Eel
                                      
                                       1 ∂µ
                                                                                            is the Fukui function f(r) with the Cartesian coordinate r denoting
               η̃ :=          2
                                    =                            (10) (10)                  some position in space. It was introduced by Parr and Yang as a
                    1 2 ∂ 2∂N
                           Eel    v 1 2 ∂µ∂N v
            η̃ :=                  =                            (10)                         density-functional generalization of Fukui’s Frontier Molecular
                    2 ∂N 2 v         2 ∂N v
                       η ≈ ǫLUMO − ǫHOMO .                          (13)
                                                                                             Orbital (FMO) theory.[85]
              However, the prefactor 1/2 is frequently omitted:[47,80]                            The Fukui function is the mixed second derivative of the
                                                                                             electronic energy with respect to ν and N at a given position r. It can
                         2              
                          ∂ Eel         ∂µ
                  η :=             =                            (11)
                        ∂ 2∂N
                           Eel
                               2
                                  v     ∂N v
                                       ∂µ                                                    be represented either as the response of the chemical potential to a
                η :=               =                            (11) (11)                    change in the external potential at position r or as the response of the
                         ∂N 2 v 1 ∂N v
                              S := .                                (14)                     electron density ρ(r) towards a change in the number of electrons, i.e.
                                    η
                           η ≈ I − A.                                       (12)
f − (r) ≈ ρHOMO (r)                                   (20)
                                                                                                                                           f + (r) ≈ ρLUMO (r),                                  (21)
           Laureates: Junior Prizes of the SCS Fall Meeting 2020                                                                                                           CHIMIA 2021, 75, No. 4      315
                                                                                                                                  f − (r) ≈ ρHOMO (r)                                     (20)
                                                                                                                                       +
                                                                                                                                 f (r) ≈ ρLUMO (r),                                       (21)
                                                                                                                                                   
                             δ ∂Eel                δµ                  ∂ δEel                ∂ρ(r)                               (2)       δη          ∂f (r)
            f (r) :=
                           δv(r) ∂N
                                          =
                                                  δv(r)
                                                              =
                                                                      ∂N δv(r)
                                                                                     =
                                                                                              ∂N
                                                                                                         .   (15) (15)         f (r) :=
                                                                                                                                          δv(r) N
                                                                                                                                                   =
                                                                                                                                                        ∂N v(r)
                                                                                                                                                                                                   (22)
                                                                                                                                                                                                 (22)

                                                                                                                                  
                                                                                                               δη             ∂f (r)
             Regions where  f(r) islargeare supposed      to   be those
                                                                             from which f (2) (r)Within                 =difference approximation, ƒ (r)(22)
                                                                                                                the finite                                    can be written
                                                                                                                                                         (2)
             δ ∂Eel   δµ   ∂ δEel [85]  ∂ρ(r)                                                 :=
   f (r) :=reactive
                δ ∂E  attacks  =areδµthe most
                                             =− favorable.
                                                    ∂ δEel = Due          to the
                                                                      ∂ρ(r)   . derivative
                                                                                      (15)                   δv(r)
                                                                                                as the difference
                                                                                                               (2)    Nbetween
                                                                                                                           +   ∂N the− electrophilic
                                                                                                                                       v(r)          and nucleophilic Fukui
                         el                                                                                  f (r) ≈ f (r) − f (r).                                (23)
   f (r) := δv(r) ∂N − = δv(r)         ∂ρ(r) = ∂N δv(r) =              ∂N     .       (15)
                                             
           discontinuities
              δv(r) ∂Nf (r)with      respect
                                 := δv(r)       to ∂N
                                                    ≈ ρleft
                                                   N,       and
                                                        δv(r)
                                                        N (r) − ρright  derivatives are not(16) functions,
                                                                       ∂N
                                                                 N −1 (r)
           equal, and hence, two∂N        different
                                               v(r) quantities are typically reported,

           ƒ–(r) and ƒ+(r),     which    can be  approximated by finite differences:
                                            +
                                       ∂ρ(r)
                         f + (r) :=                 ≈ ρN +1 (r) − ρN (r)                   (17)      f (2) (r) ≈ f + (r) − f − (r).                           (23)    (23)
                                        ∂N v(r)
                                    −
                             ∂ρ(r) −
               f − (r) := ∂ρ(r)
               f − (r) :=
                                            ≈ ρN (r) − ρN −1 (r)                     (16)
                                                                                      (16)
                               ∂N v(r) ≈ ρN (r) − ρN −1 (r)                           (16)
                              ∂N    +v(r)                                                     ƒ(2)(r) takes values between –1 and 1 with negative and positive
              +
    δ ∂Eel f + (r) :=
                            ∂ρ(r)+
                        δµ ∂ρ(r) ∂− δE
                                                  
                                            ≈ elρN +1 (r) ∂ρ(r)
                                                                
                                                          − ρN (r)                    (17)      values indicating susceptibility towards electrophilic and
               f=(r) :=
 δv(r) ∂N   δv(r) ∂N  ∂N
                                = fv(r)
                               ∂N       k ≈
                                               ρN−1
                                            ≈qk,N +1=(r)  − ρN
                                                      −qk,N   (r).        (15)      (17)
                                                                                      (17) (18) nucleophilic attacks,      3 respectively.[92] The dual descriptor was
                                       v(r)δv(r)           ∂N
                                                                                           (19) reported to be more robust than Fukui functions, especially in the
                                                               
    δ ∂Eel              δµ             ∂+ δEel            ∂ρ(r)
                =               = fk ≈ qk,N −=qk,N +1             .        (15)
  δv(r) ∂N            δv(r)          ∂N δv(r)              ∂N                                   frozen-orbital approximation.[94]
                 Hence, ƒ–(r) describes the response of the system towards a                           As a demonstration
                                                                                                                  3           for the identification of reactive sites
            decrease of N, i.e. towards electrophilic attacks and ƒ+(r) the through the inspection of ƒ(2)(r), we consider propanone in the finite-
            response
                  ∂ρ(r)towardsfk− ≈ anqk,N
                                        increase
                                            −1 − qk,Nof N, i.e. the reactivity with respect
                                                                                         (18)      difference approximation (see Fig. 1). We obtained the underlying
                       − −
   f − (r) :=                 f≈   ≈Nq(r)
                                k+ ρ   k,N− −1ρ−   q (r)
                                               N −1k,N                       (16)        (18)
            to nucleophilic
                   ∂N −          attacks.
                              fk+ ≈ qk,N    To   allow
                                          − − qk,N +1    for  an   easier interpretation,   the electronic densities from a DFT/PBE[95,96]/def2-SVP[97,98]
                                                                                         (19)
                          v(r)          f (r)
                                            − q≈k,NρHOMO   (r)                                (20)
                        
   f − (r) :=    ∂ρ(r)
            Fukui       + f≈
                    functions      ≈ qk,N
                                kare
                                   ρNoften
                                      (r)+− not
                                              ρN −1analyzed
                                                     +1
                                                     (r)       in full spatial(16)       (19)
                                                                                resolution, but calculation with the program Orca.[99] The condensed-to-atom
                  ∂ρ(r)
                   ∂N                   f  (r) ≈   ρ      (r),                                (21)
     +
   f (r) := instead in a condensed-to-atom
                          v(r)  ≈ ρN +1 (r) − ρN (r)  representation. Staying
                                                     LUMO
                                                                             (17) in the finite values given in Table 1 were determined from Hirshfeld charges.[87]
                 ∂N +
                  ∂ρ(r) approximation, these Fukui indices ƒk can be obtained The fact that they turned out to be negative at the oxygen atom
            difference    v(r)                                             ±
     +
 f (r) :=           atom k≈     ρN     (r) − ρN(r)                       (17)
    δ ∂Efor
                any            from   the
                                        ∂ atomic           ∂ρ(r)qk and are given in atomic and positive at the adjacent carbon atom agrees with the notion that
                                                       charges
                   ∂N                +1                         
             el         δµv(r)               δEel
            units=as
 δv(r) ∂N   δv(r) 
                                  =                   =              .       (15)                  these are the primary attack sites for electrophilic and nucleophilic
                                 −  ∂N     δv(r)          ∂N 
                               f−  (r) ≈∂ρHOMO     (r) ∂ρ(r)                          (20)      attacks, respectively.
                                                    
    δ ∂Eel              δµ (2)               δE
                                              δη
                               f (r) ≈ ρHOMO (r)el          ∂f  (r)                      (20)
                 =        f +=(r) :=          ==                            .            (15)                        (22)
  δv(r) ∂N           δv(r) f + (r) ∂N δv(r) (r),
                                   ≈ ρLUMO
                                      δv(r)  N                        ∂N
                                                                       ∂N   v(r)                              (21)
                            f −(r)
                fk− ≈ qk,N −1      ≈ ρLUMO
                                 qk,N       (r),                                         (18)                 (21)
                                                                                                             (18)
                   +
                f−    ≈ qk,N − q−k,N       +1                               (19)
                                                                            (18)
               fkk ≈ qk,N  −
                                 −1     qk,N
                  ∂ρ(r)
                   +
   f (r) := k
     −          f     ≈   q       −
                            k,N ≈ k,Nq ρf (r)
                                           +1 − ρN −1 (r)
                                          (2)         +                     (19)
                                                                            (16)      (19) (23)
                                      N      (r)
                                                ≈ f (r) − f (r).
                                                                 −
                ∂N −       v(r)  δη 
                  ∂ρ(r)
                      (2) 
                                                        ∂f  (r) 
   f (r) := ∂ρ(r)
     −              f (r)+ :=≈ ρNδη(r) − ρN        = (r) ∂f (r)             (16)       (22)
                    f (2) (r)v(r)
               
                   ∂N          := δv(r) N =−1 ∂N v(r)                                  (22)
   f + (r) :=                           N +1 (r)N− ρN (r)
                                    ≈ ρδv(r)              ∂N v(r)           (17)
                   ∂N +
               However,     v(r) atomic charges are yet another concept without
                  ∂ρ(r)
                      (r) ≈
   f + (r) :=    f             ρHOMO≈ ρN(r)                                 (20)
                    −
            a unique
                   ∂N       definition.        Numerous
                                          +1 (r)  − ρN (r) mathematical definitions
                                                                            (17)        exist
                    +        v(r)
            because   (r)of≈
                 ff − (r)  ≈theρ    difficulty
                                 LUMO    (r), to define spatial boundaries
                                 HOMO (r) +
                               ρ(2)                                         (21) of an atom
                                                                            (20)
            in a fquantum           (r) ≈ fsuch
                              f (2)system       (r) − f a (r).                         (23)
                                            f + (r) −asf −  molecule (see above). Therefore,
                                                         −
                    +
                      (r) ≈fρLUMO   (r) ≈(r),              (r).             (21)       (23)
             the choice among these can significantly affect the values of the
             resulting
                 fk− ≈ Fukui
                         qk,N −1 indices.
                                 − qk,N [86] Hirshfeld
                                                 3      charges[87] are
                                                                     (18)a commonly                                         Fig. 1. ƒ(2)(r) = ±0.01 a0–3 isosurface for propane. Blue encodes negative
             recommended
                 fk−+ ≈ qk,N−choice.
                                    +1[88] 
                                 qk,N                                (19)                                                   values, orange positive ones.
                 f ≈δη   qk,N −1 − qk,N
                                      ∂f (r)                         (18)
                 Ifk+
      f (2) (r) :=   frozen       = are assumed, ƒ–(r) and ƒ+(r) can
                             orbitals                                (22)be expressed
                       δv(r)           ∂N
                 fk ≈ of
        (2) in terms δη
                         qk,N
                            theN qk,N +1
                               − removed        incoming orbital upon(19)
                                      ∂f (r)or v(r)                    change of the                                        Table 1. Condensed dual descriptor indices for propanone.
                                           
      f (r) :=                    =                                  (22)
             numberδv(r)of electrons,
                               N       ∂N v(r)
                                                                                                                                             Atom                                 ƒk(2)
             f (2)f(r)                3
                   − ≈ f + (r) − f − (r).                                                (23)
                     (r) ≈ ρHOMO (r) 3                                                   (20)                (20)                             OCO                                 –0.1
                  +
              (2)
                f−  (r)
                     ≈≈
             f f(r)(r) f +ρ    − f(r),                                                   (21)
                          ρ(r)       (r).                                                (23)                                                 CCO                                 0.1
                                   −
                            LUMO
                       ≈    HOMO (r)                                                     (20)
                 +
                f (r) ≈ ρLUMO (r),                                                       (21)                (21)                             Cα                                  0.0

                                                                                                                                           H                                  0.0
                        δη           ∂f (r)
                  in terms
             i.e.:=
      f (2) (r)            of HOMO
                                 =     and LUMO densities, ρHOMO(r)    and ρLUMO(r),
                                                                    (22)
                      δv(r) calculated
             respectively,     N    ∂Nasthe v(r) absolute square of the respective                                            Notably, the dual descriptor allowed for a back-on-the-
                        δη           ∂f (r)
      f (2) (r)  := which3 manifests
             orbital,            =                                  (22) Naturally,
                       δv(r) N         ∂Nthe v(r)
                                               connection to FMO theory.                                                    envelope discussion of the Woodward-Hoffmann rules[100,101]
           this interpretation is prone to fail for quasi-degenerate frontier                                               refuting the common belief that one requires an orbital model to
                            3
           orbitals.
               (2)
                       For +example,     it cannot reliably predict the regioselectivity                                    predict pericyclic reactions.
             f     (r) ≈ f
           of electrophilic  (r) −  f − (r). substitutions.[89,90] In orbital-weighted
                                  aromatic                             (23)                                                     Unfortunately, the applicability of Fukui indices and the dual
           Fukui     functions
             f (2) (r) ≈ f + (r) this
                                 − f −behavior
                                        (r).     is cured by the inclusion
                                                                       (23) of further                                      descriptor for predicting nucleophilicity and electrophilicity is
           orbitals – however, at the cost of introducing an empirically                                                    limited to orbital-controlled soft–soft interactions.[102] The most
           motivated weighting factor.[91]                                                                                  reactive sites for hard–hard interactions can be better predicted
                Instead of considering ƒ–(r) and ƒ+(r) separately to identify                                               using atomic charges or with descriptors extracted from the
           nucleophilic and electrophilic sites, Morell et al. suggested the                                                molecular electrostatic potential.[102,103] For example, Fukui
           use of the dual descriptor ƒ(2)(r) defined as the derivative of                                                  functions are known to poorly predict protonation sites.[102]
           the chemical hardness with respect to the external potential or,                                                 Another level of complexity arises due to the fact that for many
           equivalently,3the response of the Fukui function towards changes                                                 chemical reactions neither charge control nor orbital effects are
           in the number of electrons:[92,93]                                                                               highly dominating, but both have to be considered.[103–105]
                             3
316                         
       CHIMIA 2021, 75, No. 4                                                      Laureates: Junior Prizes of the SCS Fall Meeting 2020

3. Limitations of Reactivity Prediction by Local                             out and analyze vast numbers of quantum chemical calculations,
Concepts                                                                     inaccessible to manual work with quantum chemistry programs, and
    As we outlined for the examples above, the predictive power              can therefore deliver the data required.
of a single chemical concept is very limited. Following the Taylor               The compilation of such a test set is non-trivial: Typically
series expansion of the electronic energy, which explicitly states           researchers hand-pick reference reactions based on their prior
that various partial derivatives, now understood as different                knowledge or the literature or make use of databases such as the
chemical concepts, are to be added up, it is not sufficient to limit         aforementioned Mayr database. However, this approach bears the
oneself to one chemical concept when trying to understand the                risk of assembling biased test sets: Chemical concepts are integral
change in energy upon reaction, but several must be considered               parts of chemistry education. That is why it appears reasonable to
simultaneously. They all need to be incorporated into a reactivity           assume that reactions conforming to these are overrepresented in
prediction protocol. The fact that machine-learned models based              the literature and, hence, in the test sets. This should not affect the
on several indicators clearly outperform schemes based on a                  comparison of different evaluation schemes for a given concept.
single selected descriptor underlines this necessity.[106,107]               However, there might be a tendency to overrate the predictive
    The key difficulty of all reactivity descriptors defined in terms        power of chemical concepts as such in a circular-argument
of a Taylor series expansion of the electronic energy around a               situation. Automated reaction network exploration algorithms allow
reference point is that this is always local information in the sense        one to enumerate large numbers of different reaction coordinates
that it is valid for the unreactive and non-activated equilibrium            – including those that appear to be unpromising from a chemist’s
structure of a reactant, i.e. for the reference point: The local slope       perspective. Hence, there is hope that the bias in favor of known
and curvature of the energy functional is supposed to correlate              chemical concepts is less pronounced.
with the barrier height of eventual transition states. As formulated             Furthermore, reaction databases typically only contain
in Klopman’s non-crossing rule different reaction curves shall not           information on successful reactions, but no explicit information
cross between reactants and transition states.[108] In practice, this        about unproductive reaction coordinates. The possibility to sort out
means that the behavior at the onset of reactions has to be decisive         clearly unreactive reaction guesses is, however, one of the key tasks a
for the overall course of the reaction. This is why Geerlings et             useful reactivity predictor must fulfill, which is why this information
al. state that cDFT descriptors are expected to be unreliable for            should be included in a test set. A brute-force enumeration of
late transition states.[48] Making direct use of this statement for          reaction paths by automated exploration algorithms will certainly
reactivity predictions, however, is impossible because transition            also include unreactive scans. Finally, the fact that the target quantity
states are obviously unknown prior to the analysis.                          of the reactivity prediction (e.g. reaction barriers) is calculated with
    This issue may be considered more severe for all but                     the same quantum mechanical method as the reactivity descriptor
intramolecular reactions due to the fact that to project towards             itself, opposed to experimental data, offers the possibility to study
reaction paths and energy barriers the interactions between all              the reliability of the reactivity scheme separated from possible
reactants have to be considered, which will disturb the information          shortcomings of the underlying method compared to experiment.
gathered at the reference point. cDFT descriptors can be evaluated               So far, such extensive data has not been produced yet, but it can
for associated reactive complexes and even along entire reaction             be anticipated that this will change.[4–6,110]
paths,[109] but the relevant complex geometry and especially the path            Once available, the machine learning evaluation of reactivity
are, obviously, not known a priori in prediction tasks. For a list of        descriptors faced with explicit knowledge of vast reaction networks
further approaches which incorporate the effect of a second reactant,        will provide us with means to assess the extent to which we may
see ref. [48]. In principle, it would be possible to evaluate derivatives    prune quantum chemical reactivity explorations by first-principles
of Eel[N,ν;{Rk}] up to arbitrary order. In practice, however, there are      heuristics[34,35] in order to dramatically accelerate the automated, but
mathematical and technical difficulties associated with taking such          computationally costly exploration process.
derivatives.[48] Moreover, even if these derivatives were evaluated,
there is no reason to believe that the radius of convergence of              5. Conclusions
the series expansion is such that it encodes information up to the                In this work, we discussed why chemical concepts are not only
transition state structure. In other words, the series expansion is not      valuable to explain, describe, and categorize reactions a posteriori,
expected to converge for structures farther away from the reference          but also bear hope for reactivity prediction. Mathematical definitions
point on the Born-Oppenheimer surface.                                       allow for the quantification of well-known chemical concepts by
    Due to these arguments, a perfect matching between reactivity            means of quantum chemistry. In particular, partial derivatives of the
descriptors, which are evaluated at equilibrium structures, and              electronic energy with respect to the decisive parameters of a system
activation barriers (and even less so for reaction energies) cannot          (such as the number of its electrons) allow us to clearly define
be expected. They might, however, still be a valuable tool in order          reactivity concepts relying on electronic structure information.
to (de-)prioritize more or less promising reaction guesses for further            However, ambiguities may arise due to competing definitions of
analyses. I.e. if one can tell (for instance, in an automated mechanism      the same concept as well as different approximation and localization
exploration algorithm), which sites are very likely to be unreactive,        schemes. An exact correspondence between chemical concepts and
then this knowledge accelerates the exploration as unreactive paths          reactivity (measured, e.g. in terms of the activation barrier associated
can be omitted by the exploration protocol without the necessity to          with that process) cannot be expected because of the fact that the
consider them explicitly.                                                    concepts are evaluated for a specific molecular structure, whereas
                                                                             barriers require the comparison of energies of two sets of structures,
4. Reactivity Descriptors and Automated Reaction                             i.e. reactants and transition state structures (for an illustration of this
Space Exploration                                                            problem, see Fig. 2). The usefulness of chemical concepts critically
    To verify the usefulness of a reactivity prediction scheme and           depends on whether, despite these expected uncertainties, they still
gather profound knowledge about the associated uncertainties                 allow for discriminating between reactive and unreactive pathways
detailed statistical analyses have to be carried out. Automated              with a high degree of confidence.
reaction space exploration schemes cannot only benefit from the                   Especially automated quantum chemical reactivity exploration
exploitation of chemical concepts in the context of first-principles         can effectively exploit this situation. If, due to the aforementioned
heuristics,[34,35] they also offer the potential to help validating these.   uncertainties, we do not have clearly separable clusters of
For statistically rigorous analyses large and chemically diverse test        reactive and unreactive pairs of atoms, this will be a sign that our
sets have to be investigated. Automated algorithms allow to carry            methodology cannot cut deadwood in reaction space with sufficient
Laureates: Junior Prizes of the SCS Fall Meeting 2020                                                                                   CHIMIA 2021, 75, No. 4   317

                                                                                  [3] F. Jensen, ‘Introduction to Computational Chemistry’, 3rd ed., Wiley: Hoboken
                                                                                       New Jersey, 2017.
                                                                                  [4] W. M. C. Sameera, S. Maeda, K. Morokuma, Acc. Chem. Res. 2016, 49, 763,
                                                                                       https://doi.org/10.1021/acs.accounts.6b00023.
                                                                                  [5] A. L. Dewyer, A. J. Argüelles, P. M. Zimmerman, Wiley Interdiscip. Rev.
                                                                                       Comput. Mol. Sci. 2018, 8, e1354, https://doi.org/10.1002/wcms.1354.
                                                                                  [6] G. N. Simm, A. C. Vaucher, M. Reiher, J. Phys. Chem. A 2019, 123, 385,
                                                                                       https://doi.org/10.1021/acs.jpca.8b10007.
                                                                                  [7] R. J. Kee, J. A. Miller, T. H. Jefferson, ‘CHEMKIN: A General-Purpose,
                                                                                       Problem-Independent, Transportable, FORTRAN Chemical Kinetics Code
Eel                                                                                    Package’, 1980.
                                                                                  [8] R. J. Kee, F. M. Rupley, J. A. Miller, ‘CHEMKIN-II: A Fortran Chemical
                                                                                       Kinetics Package for the Analysis of Gas-Phase Chemical Kinetics’, 1989.
                                                                                  [9] R. J. Kee, F. M. Rupley, E. Meeks, J. A. Miller, ‘CHEMKIN-III: A FORTRAN
                                                                                       Chemical Kinetics Package for the Analysis of Gas-Phase Chemical and
                                                                                       Plasma Kinetics’, 1996, https://doi.org/10.2172/481621.
                                Ea
                                                                                  [10] S. Hoops, S. Sahle, R. Gauges, C. Lee, J. Pahle, N. Simus, M. Singhal,
                            ?                                                          L. Xu, P. Mendes, U. Kummer, Bioinformatics 2006, 22, 3067,
                                                                                       https://doi.org/10.1093/bioinformatics/btl485.
                                                                                  [11] D. G. Goodwin, H. K. Moffat, R. L. Speth, ‘Cantera: An Object-Oriented
                                                                                       Software Toolkit for Chemical Kinetics, Thermodynamics, and Transport
       Concept evaluation                                                              Processes’, 2017.
                                                                                  [12] D. R. Glowacki, C.-H. Liang, C. Morley, M. J. Pilling, S. H. Robertson, J. Phys.
                                                                                       Chem. A 2012, 116, 9545, https://doi.org/10.1021/jp3051033.
                                                                                  [13] J. Proppe, T. Husch, G. N. Simm, M. Reiher, Faraday Discuss. 2017, 195, 497,
                                                                                       https://doi.org/10.1039/C6FD00144K.
                                                                                  [14] R. Shannon, D. R. A. Glowacki, J. Phys. Chem. A 2018, 122, 1531,
            ξ(N0 ,v0 )                                                   ξ             https://doi.org/10.1021/acs.jpca.7b12521.
                                                                                  [15] J. Proppe, M. Reiher, J. Chem. Theory Comput. 2019, 15, 357,
Fig. 2. Concepts for stable reactant structures (red arrow) shall point                https://doi.org/10.1021/acs.jctc.8b00310.
to a reaction path ξ with barrier �a (orange arrow). Taylor expansion             [16] L.      Pauling,     J.    Am.      Chem.      Soc.      1932,    54,      3570,
of the energy �el around these structures (w.r.t. number of electrons Ν                https://doi.org/10.1021/ja01348a011.
and external potential ν) delivers these concepts as partial derivatives.         [17] L. Pauling, ‘The Nature of the Chemical Bond’, 3rd ed., Cornell University
Reactivity prediction based on them will thus require all of them to be                Press: Ithaca, New York, 1980.
considered, but their local nature will lead to increasing uncertainty            [18] R. G. Pearson, J. Am. Chem. Soc. 1963, 85, 3533,
                                                                                       https://doi.org/10.1021/ja00905a001.
along the path (blue area).
                                                                                  [19] R.      G.     Pearson,      J.    Chem.       Educ.      1968,     45,     581,
                                                                                       https://doi.org/10.1021/ed045p581.
precision. However, if we can define a trust margin that will allow               [20] R.      G.     Pearson,      J.    Chem.       Educ.      1968,     45,     643,
                                                                                       https://doi.org/10.1021/ed045p643.
us to include all of the potentially reactive sites, even at the cost             [21] R. F. W. Bader, ‘Atoms in Molecules: A Quantum Theory’, The International
of including some unreactive ones, then that will still represent a                    Series of Monographs on Chemistry, Clarendon Press: Oxford, 1990, Vol. 22.
reliable basis for an efficient heuristic pruning scheme.                         [22] J. Cioslowski, J. Karwowski, in ‘Fundamentals of Molecular Similarity’, Eds:
    Statistically rigorous analyses are required to quantify the                       R. Carbó-Dorca, X. Gironés, P. G. Mezey, Mathematical and Computational
reliability of reactivity prediction schemes and to discover instances                 Chemistry, Springer US: Boston, MA, 2001, pp 101–112, https://doi.
                                                                                       org/10.1007/978-1-4757-3273-3_7.
in which knowledge of reactant properties can, in fact, allow us to               [23] J. S. M. Anderson, P. W. Ayers, J. Phys. Chem. A 2011, 115, 13001,
make useful predictions about paths and barrier heights. Automated                     https://doi.org/10.1021/jp204558n.
reaction exploration algorithms are a suitable tool to provide the vast           [24] M. Reiher, A. Wolf, ‘Relativistic Quantum Chemistry: The Fundamental
amounts of data required for such analyses. They bear the promise                      Theory of Molecular Science’, 2nd ed., Wiley: Weinheim, 2015.
                                                                                  [25] M. Kowalik, C. M. Gothard, A. M. Drews, N. A. Gothard, A. Weckiewicz, P.
to reduce the bias towards elementary steps that align with known                      E. Fuller, B. A. Grzybowski, K. J. M. Bishop, Angew. Chem. Int. Ed. 2012, 51,
concepts in the test sets and allow for efficient high-throughput                      7928, https://doi.org/10.1002/anie.201202209.
analyses with modern statistical techniques.                                      [26] T. Klucznik, B. Mikulak-Klucznik, M. P. McCormack, H. Lima, S. SzymKuc,
    Notorious machine learning techniques provide efficient                            M. Bhowmick, K. Molga, Y. Zhou, L. Rickershauser, E. P. Gajewska, A.
algorithms to crawl through huge data sets produced in automated                       Toutchkine, P. Dittwald, M. P. Startek, G. J. Kirkovits, R. Roszak, A. Admaski,
                                                                                       B. Sieredzinska, M. Mrksich, S. L. J. Trice, B. A. Grzybowski, Chem. 2018, 4,
mechanism explorations. They are a means to navigate reactivity                        522, https://doi.org/10.1016/j.chempr.2018.02.002.
data efficiently to highlight potentially important correlations that             [27] S. Szymkuc, E. P. Gajewska, T. Klucznik, K. Molga, P. Dittwald, M. Startek,
then make the data accessible to our chemical understanding. As                        M. Bajczyk, B.A. Grzybowski, Angew. Chem. Int. Ed. 2016, 55, 5904, https://
such, there is then no contradiction between the production and                        doi.org/10.1002/anie.201506101.
                                                                                  [28] P. Schwaller, T. Laino, ‘Machine Learning in Chemistry: Data-
handling of huge amounts of (calculated) reactivity data and the                       Driven Algorithms, Learning Systems, and Predictions’, ACS
desire to understand the chemistry in terms of concepts that are                       Symp. Ser.; Am. Chem. Soc., 2019, Vol. 1326; Chap. 4, pp 61–79,
simple, rigorous, and effective. Concerns raised in the literature[111]                https://doi.org/10.1021/bk-2019-1326.ch004.
are likely to be seen resolved in a fruitful interplay of numerical data          [29] P. Schwaller, T. Laino, T. Gaudin, P. Bolgar, C. A. Hunter, C. Bekas, A. Lee,
and conceptual understanding.                                                          ACS Cent. Sci. 2019, 5, 1572, https://doi.org/10.1021/acscentsci.9b00576.
                                                                                  [30] IBM RXN for Chemistry. https://rxn.res.ibm.com/.
                                                                                  [31] M. H. S. Segler, M. P. Waller, Chem. Eur. J. 2017, 23, 5966,
Acknowledgments                                                                        https://doi.org/10.1002/chem.201605499.
    This work has been financially supported by the Swiss National                [32] M. H. S. Segler, M. P. Waller, Chem. Eur. J. 2017, 23, 6118,
Science Foundation (Project No. 200021 182400).                                        https://doi.org/10.1002/chem.201604556.
                                                                                  [33] M. H. S. Segler, M. Preuss, M. P. Waller, Nature 2018, 555, 604,
                                                                                       https://doi.org/10.1038/nature25978.
                                                Received: February 3, 2021        [34] M. Bergeler, G. N. Simm, J. Proppe, M. Reiher, J. Chem. Theory Comput.
                                                                                       2015, 11, 5712, https://doi.org/10.1021/acs.jctc.5b00866.
                                                                                  [35] S. A. Grimmel, M. Reiher, Faraday Discuss. 2019, 220, 443,
                                                                                       https://doi.org/10.1039/C9FD00061E.
[1] J. H. Warren, L. Radom, P. von R. Schleyer, J. Pople, ‘Ab Initio Molecular    [36] R. A. Miranda-Quintana, M. Franco-Pérez, J. L. Gázquez, P. W. Ayers, A. Vela,
    Orbital Theory’, Wiley: New York, 1986.                                            J. Chem. Phys. 2018, 149, 124110, https://doi.org/10.1063/1.5040889.
[2] C. J. Cramer, ‘Essentials of Computational Chemistry: Theories and Models’,   [37] J. L. Gázquez, M. Franco-Pérez, P. W. Ayers, A. Vela, Int. J. Quantum Chem.
    2nd ed., Wiley: Chichester, 2006.                                                  2019, 119, e25797, https://doi.org/10.1002/qua.25797.
You can also read