NP Bayes Functional Regression for a PK/PD Semi-Mechanistic Model: A talk for advertisement and discussion

Page created by Neil Parsons

Cars & Machinery

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

NP Bayes Functional Regression for a PK/PD Semi-Mechanistic Model: A talk for advertisement and discussion

NP Bayes Functional Regression for a PK/PD
         Semi-Mechanistic Model:
  A talk for advertisement and discussion

                     Michele Guindani
(joint work with Peter Müller, Gary Rosner and L. Friberg)

                  Department of Biostatistics
                UT MD Anderson Cancer Center

                 SAMSI, Wed 14th, 2010

Outline

    ¬ Semi-mechanistic PK/PD models

     Bayesian joint PK/PD modeling

    ® The advantage of a Non-parametric (NP) Bayesian Approach

    ¯ A pair of plots to prove we can do things.

    ¯ Discussion, problems, issues þ things to work on!

¬ Semimechanistic PK/PD models: what are those?

 Bayesian joint PK/PD Modeling

® Non-parametric (NP) Bayes

¯ We can do things!

PK/PD from the perspective of a dummy statistician

    + Population Pharmacokinetics (PK) studies the behavior of a
      drug in the body over a period of time (absorption,
      distribution, metabolism, excretion).

       Often, we say that PK is the study of what the body does to
       a drug.

    þ Plays a pivotal role in direct patient care for the construction
      of patient dosing strategies.

PK Time Concentration profiles
      GOAL: quantitatively assess some typical pharmacokinetic
      parameters, upon observation/estimation of the individual plasma
      concentration profiles over time

      Source:Wikipedia!

Data

   ý Data usually follow a precise administration/measurement
     schedule.

       For example, in one of the studies reported in Friberg et al,
       2002, data are from 45 patients with different cancer forms,
       who received paclitaxel in a total of 196 cycles (varying
       between one and 18 cycles per patient; median, three cycles),
       were analyzed.

       Paclitaxel was administered as a 3-hour infusion, with an
       initial dose of 175 mg/m2 every 3rd week. Dose adjustments
       were guided by hematological and nonhematological toxicity,
       which resulted in a final dose range of 110 to 232 mg/m2.
       Plasma concentrations were monitored on course 1 and course
       3, with an average of 3.5 samples per patient and course.

PK model

   ý Typically, the PK models try to artificially “replicate” what
     happens to the drug once in the body.
   ý Graphically, we can represent a simple model as follows (from
     AdaptGuide)

      This is described as a linear two-compartment model. In our
      application, we use it for modeling the unbound plasma
      concentrations of paclitaxel.

ý Mathematically, the response (concentration) in a sample of
  individuals is assumed to reflect both measurement error and
  intersubject variability,
        K
       yij = fij (θiK , xij ) + εij ,   i = 1, . . . , N, j = 1, . . . , ni ,

   where fij (θiK , xij ) is a function for predicting the jth response
   in subject i, θi is a vector of individual PK parameters and xij
   is a vector of known quantities or covariates.
          k denotes the observed concentration for individual i at
   Here yij
   time j, or Ci (tj ).

A system of ODE for the PK

    ý The graphical scheme shown above represents systematically the
      following system of differential equations

                                CL CLd                   CLd
                                         
               dxc (t)
                       =−          +          xc (t) +       xp (t) + r(t)
                 dt             Vc   Vc                  Vp
               dxp (t) CL         CLd
                      = c xc (t) − p xp (t)
                 dt    V          V
                       xc (t)
                 C(t) = c
                        V

        where CL is the system clearance, V c and V p are the volumes of
        distribution of the central and peripheral compartment, CLp is the
        intercompartmental clearance and r(t) is the rate of infusion into
        the first compartment.

    I   C(t) þ time course of the unbound plasma concentrations.

A system of ODE for the PK

    ý The system of linear ODEs for PK modeling is “doable”: it can be
      estimated with non linear least square techniques (ODE solver - but
      see Paolo Vicini’s and Lang Li’s talks yesterday - we need
      prior/penalty terms to enforce identifiability).
    ý PK modelers most often assume that

                                 θi = g(φ, xi ) + ηi

       where φ is a vector of population parameters, xi is a vector of
       known individual specific covariates (held constant across cycles)
       and ηi is the individual random effect.
       Estimation is done through non linear mixed effects models and
       related algorithms. For example, in NONMEM the mixed effect
       model is linearized by using the first order Taylor series expansion
       with respect to ηi (and εij ). (check S. Gosh, R. Leary, P. Vicini)

A system of ODE for the PK

    ý The system of linear ODEs for PK modeling is “doable”: it can be
      estimated with non linear least square techniques (ODE solver - but
      see Paolo Vicini’s and Lang Li’s talks yesterday - we need
      prior/penalty terms to enforce identifiability).
    ý PK modelers most often assume that

                                 θi = g(φ, xi ) + ηi

       where φ is a vector of population parameters, xi is a vector of
       known individual specific covariates (held constant across cycles)
       and ηi is the individual random effect.
       Estimation is done through non linear mixed effects models and
       related algorithms. For example, in NONMEM the mixed effect
       model is linearized by using the first order Taylor series expansion
       with respect to ηi (and εij ). (check S. Gosh, R. Leary, P. Vicini)

Dependence on Covariates

    ü The PK model is often completed by equations, relating specific
      covariates to the parameters of the model; for example,

                    log CL =β1 + β2 × (BSA) + β3 × (BIL)
                     log V c =β4 + β5 × (BSA)
                     log V p =β6 + β7 × (BSA),

       where BSA is the body surface area and BIL is bilirubin
       (hematoidin, excreted in bile).
       For each subject i,

                 θiP K = (β1,i , β2,i , β3,i , β4,i , β5,i , β6,i , β7,i , CLpi )

Dependence on Covariates

    ü The PK model is often completed by equations, relating specific
      covariates to the parameters of the model; for example,

                    log CL =β1 + β2 × (BSA) + β3 × (BIL)
                     log V c =β4 + β5 × (BSA)
                     log V p =β6 + β7 × (BSA),

       where BSA is the body surface area and BIL is bilirubin
       (hematoidin, excreted in bile).
       For each subject i,

                 θiP K = (β1,i , β2,i , β3,i , β4,i , β5,i , β6,i , β7,i , CLpi )

Pharmacodynamics

   + Pharmacodynamics (PD) refers to the time-course and
     intensity of drug action or response.

      We can say that PD studies what a drug does to the body.

   + The pharmacologic response depends on the drug binding to
     its target. Receptors determine the quantitative relationship
     between drug dose and pharmacologic effect. The
     concentration of the drug at the receptor site influences the
     drug’s effect. Hence, PK and PD are inherently related.

+ Joint kinetic–dynamic modeling is important to predict how
  drug concentration affects the response.

       ý GOAL: Establish relationships between drug
         concentrations and individual responses (e.g, in terms
         of myelosuppression)
       þ produce therapeutic benefits while minimizing
         side-effects
       þ find Optimal Dose/Administration schedule

       “The major challenge for health care professionals involved in
    clinical psychopharmacology is to understand and compensate for
          individual variations in drug response” (Greenblat et. al
        -Psychopharmacology -the Fourth Generation of Progress).

+ Joint kinetic–dynamic modeling is important to predict how
  drug concentration affects the response.

       ý GOAL: Establish relationships between drug
         concentrations and individual responses (e.g, in terms
         of myelosuppression)
       þ produce therapeutic benefits while minimizing
         side-effects
       þ find Optimal Dose/Administration schedule

       “The major challenge for health care professionals involved in
    clinical psychopharmacology is to understand and compensate for
          individual variations in drug response” (Greenblat et. al
        -Psychopharmacology -the Fourth Generation of Progress).

Empirical and Mechanistic Dose/Response models

    + Empirical Dose/Response models relate drug exposure (AUC,
      time above threshold) of anticancer drugs to some measure of
      the drug’s effect, such as the nadir of leukopenia or surviving
      fraction of leukocytes at nadir
    + The Emax model is a common descriptor of dose–response
      relationships,
                                      Emax C
                              E=
                                    EC50 + C
       where Emax is the maximum response of the system to the
       drug and EC50 is that concentration of drug producing 50% of
       Emax .

Physiology based Semi-Mechanistic (SM) PK/PD models

   + Physiology based models with parameters that refer to actual
     processes and conditions may be preferable.
       Ideal physiology-based models separate system parameters,
       common across drugs, from drug–specific parameters.

    4 SM PK/PD models describe the entire course of the response
      profile (e.g., leukopenia) by using the entire time course of
      plasma concentration as input.
       (e.g. Minami et al., 1998, 2001, Friberg et al. 2000, 2002,
       2003)

PD model
     For example, Friberg et al. (2002) develop a structural model
     for myelosupression consisting of

   ¬ One compartment representing stem cells and progenitor cells
     (i.e. Proliferative cells, sensitive to drugs – Prol)
    Three transit compartments with maturing cells (– Transit)
   ® A compartment of circulating observed blood cells (– Circ)

PD model

   o The model consists of a system of non-linear differential equations
     (Friberg, 2002)
                                                      γ
                                                 Circ0
          dP rol/dt = kprol P rol (1 − Edrug )            − ktr P rol
                                                 Circ
                    dT ransit1 /dt = ktr P rol − ktr T ransit1
                    dT ransit2 /dt = ktr T ransit1 − ktr T ransit2
                    dT ransit3 /dt = ktr T ransit2 − ktr T ransit3
                        dCirc/dt = ktr T ransit3 − ktr Circ

      The effect of the PK component on the PD model is captured by a
      term like
                    Edrugs = Sl × C(t) or an Emax model

                         θiP D = (Circ0,i , γi , ktr,i , Sli )

PD model

   o The model consists of a system of non-linear differential equations
     (Friberg, 2002)
                                                      γ
                                                 Circ0
          dP rol/dt = kprol P rol (1 − Edrug )            − ktr P rol
                                                 Circ
                    dT ransit1 /dt = ktr P rol − ktr T ransit1
                    dT ransit2 /dt = ktr T ransit1 − ktr T ransit2
                    dT ransit3 /dt = ktr T ransit2 − ktr T ransit3
                        dCirc/dt = ktr T ransit3 − ktr Circ

      The effect of the PK component on the PD model is captured by a
      term like
                    Edrugs = Sl × C(t) or an Emax model

                         θiP D = (Circ0,i , γi , ktr,i , Sli )

PD model

   o Hence, the model explicitly separate between system parameters...
    • mean transit time: M T T = (n + 1)/ktr where n is the number of
      transit compartments
    • baseline: Circ0
    • feedback: γ
    • ..and drug specific parameters: Sl or Emax , and EC50

      The feedback was necessary to describe the rebound of cells
      (overshoot compared with the baseline value Circ0 ). The
      proliferation rate can be affected by endogenous growth factors and
      the G-CSF levels increase when the neutrophil counts are low.

Data
   + The collection of data on the response effect doesn’t seem to
     be as systematic as in the case of the measurements of drug
     concentration.
   + In order to fit the solution of the previous system of ODE’s by
     non-linear least square techniques, we typically need more and
     well spaced points.
                        14
                        12

                                                            ●

                             ●
                        10

                                           ●
                  ANC

                        8
                        6
                        4

                                 ●

                             0       200        400   600       800

                                               time

Data
   + The collection of data on the response effect doesn’t seem to
     be as systematic as in the case of the measurements of drug
     concentration.
   + In order to fit the solution of the previous system of ODE’s by
     non-linear least square techniques, we typically need more and
     well spaced points.
                        9
                        8
                        7
                        6
                  ANC

                        5

                                ●
                        4
                        3
                        2

                            0       200    400   600   800

                                          time

Data
   + The collection of data on the response effect doesn’t seem to
     be as systematic as in the case of the measurements of drug
     concentration.
   + In order to fit the solution of the previous system of ODE’s by
     non-linear least square techniques, we typically need more and
     well spaced points.
                        12
                        11
                        10
                  ANC

                        9
                        8

                                                 ●
                        7

                                 ●
                        6

                             0   200       400       600   800
                                       ●
                                       time

Data
   + The collection of data on the response effect doesn’t seem to
     be as systematic as in the case of the measurements of drug
     concentration.
   + In order to fit the solution of the previous system of ODE’s by
     non-linear least square techniques, we typically need more and
     well spaced points.

                                       ●
                        15
                        10
                  ANC

                        5

                                 ●
                        0

                             0   200        400   600   800

                                           time

¬ Semimechanistic PK/PD models: what are those?

 Bayesian joint PK/PD Modeling

® Non-parametric (NP) Bayes

¯ We can do things!

Bayesian Approach

    o ADVANTAGES:

    • The Bayesian approach allows incorporation of prior
      information (e.g. from existing literature)

    • There are no hidden assumptions: priors make us honest!

    • Inference on the parameters of interest is summarized in a
      posterior distribution, with proper assessment of the
      estimation uncertainty.

    • The estimation of the PK/PD parameters can be obtained
      simultaneously.

Bayesian Approach

    o DATA: concentration and ANC measurements for
      i = 1, . . . , N patients
                      
                        C (t) = f P K (θiP K ) + εi
                       i
                      
                      

                 
                  Circi (t) = f P D (θiP K , θiP D ) + ηi
                 
                 

                with      εi ∼ N (0, σ12 )    ηi ∼ N (0, σ22 )

                       θi = (θiP K , θiP D )∼ N (Θ, Σ)

                            π(σ1 ) π(σ2 ) π(Θ) π(Σ)

Comments related to yesterday’s talks

    þ It’s possible to incorporate covariate information, e.g. by
      assuming
                            θiP K ∼ N (βiK Xi , ΣK
                                                 i )

    þ Prior elicitation and prior regularization are important issues,
      although here I am not concentrating on those - see Johnson’s
      and Thall’s talks yesterday on prior choice.

Bayesian Approach

    o PK models:
      Gelman et al (1996)
      Stroud et al (2001)
      Winbugs implementations:
      Lunn et al (2002)
      Winbugs + Full PK/PD model
      Kathman et al (2007)

   ý Clustering of the patient specific time courses may help
     improve the assessment of the optimal dose for anticancer
     treatments

An argument for Clustering
                    ý MLE estimates for the PD model

                                                                                                                  14
                    6

                                                                                                                  12
                    5

                                                                                                                  10
                        ●
                    4
   ANC patient 3

                                                                                                  ANC patient 7

                                                                                                                  8
                    3

                                                                                                                  6
                                                                    ●

                            ●

                                                                                                                  4
                                                                                                                                                          ●
                    2

                                ●

                                                                                                                        ●

                                                                                                                  2
                                        ●     ●
                                                                                                                              ●
                    1

                                                                                                                                  ●
                                                                                                                                                ●

                                                      ●                                                                                 ●   ●

                                                                                                                  0
                        0                   200               400          600       800   1000                         0         200               400          600   800   1000

                                                                    time                                                                                  time

                                                                                                                       o The different shapes may
                    5

                        ●

                                                                    ●

                                                                                                                            suggest clustering of
                    4

                                                                                 ●

                                                                                                                            patients’ profiles
   ANC patient 14

                    3
                    2

                                    ●
                    1

                                              ●
                                                          ●
                                                                                                                  ý NP Bayes Approach
                        0                   200   ●           400          600       800   1000

                                                                    time

¬ Semimechanistic PK/PD models: what are those?

 Bayesian joint PK/PD Modeling

® Non-parametric (NP) Bayes

¯ We can do things!

NP Bayes: DP Model
  NP Bayes model: Our prior probability is also considered
     “uncertain”,

                        θ | G ∼ G(θ),   G ∼ P (G).

    • One of the most used NP prior is the DP prior.
    • Many alternative definitions are possible (check P. Mueller
      (alias W. Johnson)’s talk on Monday).
      For example,
                                   X∞
                          G(·) =       pk δθk∗ (·),
                                    k=1
               i.i.d.                   Qk−1
      where θk∗ ∼ G∗ , and pk = qk        i=1 (1   − qi ), qi ∼ Beta(1, α),

                             ï G ∼ DP (α, G∗ ),

NP Bayes: DP Model
  NP Bayes model: Our prior probability is also considered
     “uncertain”,

                        θ | G ∼ G(θ),   G ∼ P (G).

    • One of the most used NP prior is the DP prior.
    • Many alternative definitions are possible (check P. Mueller
      (alias W. Johnson)’s talk on Monday).
      For example,
                                   X∞
                          G(·) =       pk δθk∗ (·),
                                    k=1
               i.i.d.                   Qk−1
      where θk∗ ∼ G∗ , and pk = qk        i=1 (1   − qi ), qi ∼ Beta(1, α),

                             ï G ∼ DP (α, G∗ ),

NP Bayes: DP Model
  NP Bayes model: Our prior probability is also considered
     “uncertain”,

                        θ | G ∼ G(θ),   G ∼ P (G).

    • One of the most used NP prior is the DP prior.
    • Many alternative definitions are possible (check P. Mueller
      (alias W. Johnson)’s talk on Monday).
      For example,
                                   X∞
                          G(·) =       pk δθk∗ (·),
                                    k=1
               i.i.d.                   Qk−1
      where θk∗ ∼ G∗ , and pk = qk        i=1 (1   − qi ), qi ∼ Beta(1, α),

                             ï G ∼ DP (α, G∗ ),

NP Bayes: DP Model

  Some properties.

     I   E(G) = G∗ and α is called the mass or precision parameter.
     I   G is discrete ï positive probability of ties of θi ’s.
                       ï Clustering
     I   Predictive distribution (a.k.a. Chinese restaurant process
         or species sampling characterization.)

Unsupervised model-based clustering

NP Bayes Approach
    o In the previous Bayesian model, we substitute the NP
      specification
                
                 C (t) = f P K (θiP K ) + εi
                 i
                
                

                
                 Circi (t) = f P D (θiP K , θiP D ) + ηi
                
                

               with    εi ∼ N (0, σ12 )       ηi ∼ N (0, σ22 )

                 θi = (θiP K , θiP D )|G∼ G
                                    G∼ DP (α, G0 )
                                   G0 ≡ N (Θ, Σ)

                         π(σ1 ) π(σ2 ) π(Θ) π(Σ)

GOAL:

   o We want to provide a coherent probability model that tries to
     address the previously mentioned challenge:
        “The major challenge for health care professionals involved in
        clinical psychopharmacology is to understand and compensate
        for individual variations in drug response”

   þ Cluster the patients according to their PK/PD profiles
   þ Predict an individual PD profiles on the basis of its PK profile
     (or PK parameters)
   þ This is achieved by joint modeling of the PK and PD curves
     and joint inference on the vector parameter θi (þ check back
     Dunson’s talk on Monday)

Some Issues and Challenges

    5 We can model the ODE’s parameters with a DP þ use an
      MCMC algorithm þ describe the full time course of the PK
      and PD and obtain inference on between and within subject
      variability (inter-occasion/inter-individual).
    5 A full, complete, MCMC requires solving the systems of
      ODE’s at each iteration ý it can be slow and painful !!

    5 The system of ODE’s for the PD model is non-linear and
      highly unstable (especially if we have just a few data!)

    5 The likelihood is presumably extremely multimodal

      A possibility: use some approximation of the likelihood; for
      example we could linearize around the value of the MLE
      estimates.

Some Issues and Challenges

    5 We can model the ODE’s parameters with a DP þ use an
      MCMC algorithm þ describe the full time course of the PK
      and PD and obtain inference on between and within subject
      variability (inter-occasion/inter-individual).
    5 A full, complete, MCMC requires solving the systems of
      ODE’s at each iteration ý it can be slow and painful !!

    5 The system of ODE’s for the PD model is non-linear and
      highly unstable (especially if we have just a few data!)

    5 The likelihood is presumably extremely multimodal

      A possibility: use some approximation of the likelihood; for
      example we could linearize around the value of the MLE
      estimates.

Some Issues and Challenges

    5 We can model the ODE’s parameters with a DP þ use an
      MCMC algorithm þ describe the full time course of the PK
      and PD and obtain inference on between and within subject
      variability (inter-occasion/inter-individual).
    5 A full, complete, MCMC requires solving the systems of
      ODE’s at each iteration ý it can be slow and painful !!

    5 The system of ODE’s for the PD model is non-linear and
      highly unstable (especially if we have just a few data!)

    5 The likelihood is presumably extremely multimodal

      A possibility: use some approximation of the likelihood; for
      example we could linearize around the value of the MLE
      estimates.

Some Issues and Challenges

    5 We can model the ODE’s parameters with a DP þ use an
      MCMC algorithm þ describe the full time course of the PK
      and PD and obtain inference on between and within subject
      variability (inter-occasion/inter-individual).
    5 A full, complete, MCMC requires solving the systems of
      ODE’s at each iteration ý it can be slow and painful !!

    5 The system of ODE’s for the PD model is non-linear and
      highly unstable (especially if we have just a few data!)

    5 The likelihood is presumably extremely multimodal

      A possibility: use some approximation of the likelihood; for
      example we could linearize around the value of the MLE
      estimates.

Some Issues and Challenges

    5 We can model the ODE’s parameters with a DP þ use an
      MCMC algorithm þ describe the full time course of the PK
      and PD and obtain inference on between and within subject
      variability (inter-occasion/inter-individual).
    5 A full, complete, MCMC requires solving the systems of
      ODE’s at each iteration ý it can be slow and painful !!

    5 The system of ODE’s for the PD model is non-linear and
      highly unstable (especially if we have just a few data!)

    5 The likelihood is presumably extremely multimodal

      A possibility: use some approximation of the likelihood; for
      example we could linearize around the value of the MLE
      estimates.

Gaussian approximation around the MLE’s

                    θ̂iK |βiK , Xi ∼ N (βiK Xi , Σ̂ki )
         θ̂iD | θiK , βiK , Xi , θiD ∼ N (Ĥi (βiK Xi − θ̂iK ) + θiD , Σ̂D
                                                                         i )

   where θ̂iK , θ̂iD are the MLE estimates and Σ̂ki , Σ̂D
                                                        i the
   corresponding (marginal and conditional) covariance matrices.
   The model is completed by assigning appropriate priors to the
   parameters of interest; in particular,

     θi = (vec(βiP K ), θiP D )|G∼ G
                                G∼ DP (α, G0 )
                               G0 ∼ NP K (β0 , ∆β ) × NP D|P K (θ0D , ∆D
                                                                       0 )

Gaussian approximation around the MLE’s

                    θ̂iK |βiK , Xi ∼ N (βiK Xi , Σ̂ki )
         θ̂iD | θiK , βiK , Xi , θiD ∼ N (Ĥi (βiK Xi − θ̂iK ) + θiD , Σ̂D
                                                                         i )

   where θ̂iK , θ̂iD are the MLE estimates and Σ̂ki , Σ̂D
                                                        i the
   corresponding (marginal and conditional) covariance matrices.
   The model is completed by assigning appropriate priors to the
   parameters of interest; in particular,

     θi = (vec(βiP K ), θiP D )|G∼ G
                                G∼ DP (α, G0 )
                               G0 ∼ NP K (β0 , ∆β ) × NP D|P K (θ0D , ∆D
                                                                       0 )

¬ Semimechanistic PK/PD models: what are those?

 Bayesian joint PK/PD Modeling

® Non-parametric (NP) Bayes

¯ We can do things!

Some plots.
    o Clustering of the MLE estimates

                 250
                 200
                 150
                 100
                 50
                 0

                       3   4            5           6              7   8

                               # clusters across MCMC iterations

Predictive inference.
    o We observe only the concentration profile for some patients
      and want to predict their PD profile

                                        ●
                            6
                            5

                                        ●
                            4
            Concentration

                                        ●
                            3

                                    ●

                                            ●
                            2
                            1

                                                ●

                                                    ●
                                                        ●

                                                             ●
                                                                         ●              ●
                            0

                                0                       10       20          30   40   50

                                                                      time

Predictive inference.
    o Predicted PD profile and comparison with actual (known) data

                            6

                                                                                       ●
            ANC patient 7

                            4

                                ●
                            2

                                          ●
                                              ●                 ●

                                                    ●     ●
                            0

                                0   100       200       300          400   500   600       700

                                                              time

Conclusions/Discussion

    o We can provide a coherent probability model for the analysis
      of PK/PD mechanistic models.

    o By using a NP bayes approach, we obtain inference on
      patients’ clustering according to their concentration/response
      profiles.

    o Eventually, the clustering specification will allow the
      prediction of new patients’ PD profiles on the basis of PK
      profiles (and/or other covariate information)

    o Many challenges and opportunities, connected to the nature
      and availability of the data, the depth of knowledge of the
      individual PK/PD dynamics, the NP machinery we use ad the
      concrete applications.

You can also read