Not playing by the rules: Play, problems, and human cognition - Laura Schulz Early Childhood Cognition Lab BAICS Talk, April, 2020 - Bridging AI ...
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Not playing by the rules: Laura Schulz Early Childhood Play, problems, and human Cognition Lab BAICS Talk, cognition April, 2020
Not playing by the rules: Laura Schulz Early Childhood Play, problems, and human Cognition Lab BAICS Talk, cognition April, 2020
… and so hard to substantiate “The most irritating feature of “No behavioral concept has play is not the perceptual proved more ill-defined, incoherence, as such, but rather, elusive, controversial, and that play taunts us with even unfashionable than inaccessibility. We feel that play” (E. O. Wilson, 1975) something is behind it all, but we do not know, or have forgotten how to see it.” (Fagen, 1981 in Sutton-Smith, 1997)
What is play for? ❖ Non-cognitive ends ❖ For pleasure (Buhler, 1935; Buytendijk, 1933; Gilmore, 1966)
What is play for? ❖ Non-cognitive ends ❖ For pleasure (Buhler, 1935; Buytendijk, 1933; Gilmore, 1966)
What is play for? ❖ Non-cognitive ends ❖ For pleasure (Buhler, 1935; Buytendijk, 1933; Gilmore, 1966) ❖ For performance (Hawkes. & Bird, 2002; Hingham, 2014; Smith, 1976; Zahavi & Zahavi, 1997)
What is play for? ❖ Non-cognitive ends ❖ For pleasure (Buhler, 1935; BuytendiBaldwin & Baldwin, 1974; Bekoff, 1974; Berman, 1982; Drea et al., 1996; Pellis & Pellis, 1992; Porier & Smith, 1974; Spinka et al., 2001; Soderquist & Serena, 2000jk, 1933; Gilmore, 1966) ❖ For performance (Hawkes. & Bird, 2002; Hingham, 2014; Smith, 1976; Zahavi & Zahavi, 1997) ❖ For peace-making (Baldwin & Baldwin, 1974; Bekoff, 1974; Berman, 1982; Drea et al., 1996; Pellis & Pellis, 1992; Porier & Smith, 1974; Spinka et al., 2001; Soderquist & Serena, 2000)
What is play for? ❖ Non-cognitive ends ❖ For pleasure (Buhler, 1935; BuytendiBaldwin & Baldwin, 1974; Bekoff, 1974; Berman, 1982; Drea et al., 1996; Pellis & Pellis, 1992; Porier & Smith, 1974; Spinka et al., 2001; Soderquist & Serena, 2000jk, 1933; Gilmore, 1966) ❖ For performance (Hawkes. & Bird, 2002; Hingham, 2014; Smith, 1976; Zahavi & Zahavi, 1997) ❖ For peace-making (Baldwin & Baldwin, 1974; Bekoff, 1974; Berman, 1982; Drea et al., 1996; Pellis & Pellis, 1992; Porier & Smith, 1974; Spinka et al., 2001; Soderquist & Serena, 2000)
What is play for? ❖ Cognitive ends ❖ For practice (Groos, 1898; see also Bekoff & Byers, 1998; Burghardt, 2005; Fagen, 1981; Pelligrini, Dupuis, & Smith, 2006)
But see Caro, 1980; Sharpe, 2005; Sharpe & Cherry, 2003; Watson, 1993
What is play for? ❖ Cognitive ends ❖ For practice (Groos, 1898; see also Bekoff & Byers, 1998; Burghardt, 2005; Fagen, 1981; Pelligrini, Dupuis, & Smith, 2006)
Why do we play? ❖ Cognitive reasons ❖ For practice (Groos, 1898; see also Bekoff & Byers, 1998; Burghardt, 2005; Fagen, 1981; Pelligrini, Dupuis, & Smith, 2006) ❖ For prediction (Berlyne, 1966; Bonawitz, van Schijndel, Butler & Markman, 2012; Bonawitz, et al., 2011; Buchsbaum, Bridgers, Skolnick Weisberg, & Gopnik, 2012; Chitnis, Silver, Tenenbaum, Kaelbing, & Perez, 2020; Cook, Goodman, & Schulz, 2011; Jirout & Klahr 2012; Gottleib, Oudeyer, Lopes, & Baranes, 2013; Florensa, Held, Geng, & Abbeel, 2017; Gopnik & Walker, 2013; Haber, Mrowca, Wang, Li, & Yamins, 2018; Jabria Eysenbach, Gupta, Levine, & Finn, 2019; Kang et al., 2006; Kulkarni, Narasimhan, Saeedi, & Tenenbaum, 2016; Legare, 2012; Oudeyer, Gottleib, & Lopes, 2016; Oudeyer & Smith, 2016; Pathak, Agrawal, Efros & Darrell, 2017; Schmidhuber, 2011; 2013; Schulz & Bonawitz, 2007; Schulz & Standing, 2008; Sim & Xu, 2008; Singh, Lewis, Barto, & Sorg, 2019; van Schijndel,; Visser, van Bers, & Raijmakers, 2015; Wood-Gush & Vestergaard, 1991)
Even infants explore in ways that are sensitive to expected information gain R ES E A RC H | R E S EA R C H A R T I C LE S R ES E A RC H | R E S EA R C H A R T I C LE S violation events, here we gave all infants the such auditory-visual “matching” is the pattern when the novel sound played (experiment 2) violation events, here we gave all infants the such auditory-visual “matching” is the pattern when the novel sound played (experiment 2) same limited visual exposure to the Knowledge- typically observed in studies of infants’ mapping [F(1,36) = 5.349, P = 0.03, partial h2 = 0.129] same limited visual exposure to the Knowledge- typically observed in studies of infants’ mapping [F(1,36) = 5.349, P = 0.03, partial h2 = 0.129] Consistent and Knowledge-Violation outcomes; abilities (21). (18). This confirms that infants’ performance Consistent and Knowledge-Violation outcomes; abilities (21). (18). This confirms that infants’ performance all infants had just 10 s to encode the event out- We found that infants’ learning of the object- in experiment 1 reflected successful learning all infants had just 10 s to encode the event out- We found that infants’ learning of the object- in experiment 1 reflected successful learning come. A univariate analysis of variance (ANOVA), sound mapping depended on whether they had Piantadosi et al. of an object property, rather than heightened Page 20 come. A univariate analysis of variance (ANOVA), sound mapping depended on whether they had of an object property, rather than heightened with looking time to the event outcome as the de- just seen a Knowledge-Consistent or a Knowledge- visual preference for an object that had violated with looking time to the event outcome as the de- just seen a Knowledge-Consistent or a Knowledge- visual preference for an object that had violated pendent variable and event type (Solidity or Con- Violation event. A univariate ANOVA, with learn- expectations. pendent variable and event type (Solidity or Con- Violation event. A univariate ANOVA, with learn- expectations. tinuity) and outcome type (Knowledge-Consistent ing score as the dependent variable and event In experiment 3, we asked whether viola- tinuity) and outcome type (Knowledge-Consistent ing score as the dependent variable and event In experiment 3, we asked whether viola- or Knowledge-Violation) as fixed factors, showed type (Solidity or Continuity) and outcome type tions of expectation enhance learning specif- or Knowledge-Violation) as fixed factors, showed type (Solidity or Continuity) and outcome type tions of expectation enhance learning specif- no main effect of outcome type [F(1,36) = 0.002, (Knowledge-Consistent or Knowledge-Violation) ically about objects that violated expectations, no main effect of outcome type [F(1,36) = 0.002, (Knowledge-Consistent or Knowledge-Violation) ically about objects that violated expectations, NIH-PA Author Manuscript P = 0.96] (18) (table S1), which was as predicted as fixed factors, yielded only a significant main rather than about anything that might follow P = 0.96] (18) (table S1), which was as predicted as fixed factors, yielded only a significant main rather than about anything that might follow given the short encoding window in our de- effect of outcome type [F(1,36) = 10.691, P = 0.002, a violation. We showed infants (n = 10) the given the short encoding window in our de- effect of outcome type [F(1,36) = 10.691, P = 0.002, a violation. We showed infants (n = 10) the sign. Thus, 2any subsequent differences in learn- partial h2 = 0.229]. Infants’ learning scores were continuity violation from experiment 1, with an sign. Thus, any subsequent differences in learn- partial h = 0.229]. Infants’ learning scores were continuity violation from experiment 1, with an ing cannot be attributed to longer perceptual significantly greater after Knowledge-Violation object (i.e., ball) hidden behind the left screen ing cannot be attributed to longer perceptual significantly greater after Knowledge-Violation object (i.e., ball) hidden behind the left screen exposure to the object in the Knowledge-Violation events than after Knowledge-Consistent events but revealed behind the right. After the object exposure to the object in the Knowledge-Violation events than after Knowledge-Consistent events but revealed behind the right. After the object events. events. (Fig. 2A). We then compared infants’ learning (Fig. 2A). We then compared infants’ learning was revealed in the surprising location, the was revealed in the surprising location, the Immediately after this 10-s exposure to the Immediately scores to chance after(zero). this 10-s Infants exposure showed tono theevi- scores to chancereached experimenter (zero). Infants in with showed a new object no evi- (i.e., experimenter reached in with a new object (i.e., outcome of the Knowledge-Consistent or the outcome dence of of learning the Knowledge-Consistent after events consistent or the with dence of learning a block) after events that and demonstrated consistent it had witha hid- a block) and demonstrated that it had a hid- Knowledge-Violation event, we taught all in- Knowledge-Violation object solidity [t(9) event, = –1.088, we Ptaught = 0.31]allorin- con- object den solidity auditory[t(9) = –1.088, property (e.g.,P it= 0.31] or con- squeaked). We den auditory property (e.g., it squeaked). We fants new information about the object in the fants new [t(9) tinuity information = 1.62, Pabout = 0.14] thebut object in thesig- showed tinuity then [t(9) = 1.62,infants’ measured P = 0.14] but showed learning aboutsig- this then measured infants’ learning about this event. The experimenter demonstrated that the event. The learning nificant experimenter demonstrated after violations thatsolidity to object the nificant learning new object. Asafter violations to1,object in experiment solidity 354 we calculated new object.Infants As in first experiment 1, we calculated saw a Familiarization trial as in Experiment 1. All infants then saw the same object had a hidden auditory property (e.g., it object [t(9)had a hidden = 3.092, auditory P = 0.01] property (e.g., con- and spatiotemporal it [t(9) = 3.092, learning P = 0.01] scores and spatiotemporal by determining con- the proportion learning scores by determining the proportion squeaked) by moving it up and down while the squeaked) by moving tinuity [t(9) = 3.715,itPup and down = 0.005] while2A (18) (Fig. theand tinuity of time [t(9)that = 3.715, infants P =looked 0.005] at (18)this (Fig.new and 355 2A object of time solidity thatevent infantsas infants looked in atthethis Surprising condition of Experiment 1: a Target Object was placed new object sound played synchronously from a hidden sound tableplayed S1). synchronously from a hidden table S1). to a distractor object) during the si- (relative to a distractor object) during the si- (relative central 356 at the top of the ramp and then released so that it rolled Figure down and1.passed behind the occluding In location for 2,12we s. Our askeddependent mea- In lentexperiment 2, wesubtracting asked whether this pat- from lent baseline, then subtracting this value from Piantidosi, Kidd, & Aslin, 2014 central location for 12 s. Our dependent mea- experiment whether this pat- baseline, then this value sure was infants’ learning of this object-sound ternthereflected actualoflearning or simply 1(a):indicated at An examplethe proportion display of time from they looked atthree it dur-boxes and an object (a bottle) appearing from behind one sure was infants’ learning of this object-sound tern reflected actual learning or simply indicated proportion time they looked it dur- 357 screen, above whichKPA the redshowing wall protruded. The experimenter then lifted the screen to reveal that mapping. In the test trial, infants saw the tar- mapping. greater In the test to attention trial, infants objects thatsaw hadthe tar- violated greater ing the attention mappingtotest, objects when thatthehadtaught sound ing the mapping test, when the taught sound violated get object from the preceding event and a new getexpectations. object from the As preceding in experiment event1,and a new infants saw expectations. played. We As in experiment found that infants 1, shaped infants did notsaw relationship map 358 played. between We found the object had come look-away thatto infants probability rest on thedid far not map side of (y-axis), and log probability (x-axis), with raw binned dat the wall, as though it had passed through it. As in distractor object resting silently on the stage distractor an object object violateresting the core silently on the principle of stage solidity anthe object sound violate to the thenewcoreobject principle in theof solidity mapping the sound to the newAdditive Generalized object in Model the mapping(Hastie & Tibshirani 1990), both collapsing over subjects. R ES E A RC H | R ES E A R C H A R T I C L E S (baseline; 5 s). For half the infants, the ball was (baseline; (n = 10)5 s). or For half the(n continuity infants, = 10) and the ball werewas then (n test; = 10)theiror continuity learning scores (n = 10) didand notwere differthen from 359 test; Experiment their learning scores 1, infants haddid not differ 10 seconds from to view this surprising outcome. Then the experimenter the target and either the car or the block was thetaught target that and the either the car or the object had a hidden auditory block was taught chance that the=object [t(9) 0.074,had P = a0.94] hidden (table S1). An chance [t(9) = 0.074, P = 0.94] (table S1). An auditory theproperty distractor; thisitexploration Infants’ was reversed for the other andHowever, hypothesis property to infants(e.g., it squeaked). was arbitrary, in the However, it did during 360 reached thatdown independent-samples and picked t totestup and rotated thethat confirmed red wall, placing it so that its front face was visible the distractor; this was reversed for the other (e.g., squeaked). during independent-samples t sense test that confirmed kinds thatof information are relevant the half. testing after violations of expectation notmapping clearly causally relate to the surprising vio- novelnature of thisthe surprising pattern event? Whensignificantly an ob- half. Then the experimenter moved both ob- theThen mapping the experimenter test, we playedmoved both ob- an entirely novel the this pattern test, we played differed an entirely significantly from that 361 differed from that NIH-PA Author Manuscript Our finding that violations shaped infants’ lations (because the sound made by an object servation to infants. conflicts with priorWhereas in Experiment knowledge, an 1 the red wall was revealed to be solid (Figure 2, Panel 4A), jects up and down simultaneously while the jects up and sound (e.g., down simultaneously rattling). This time, whiledid way,infants the not sound (e.g., of experiment rattling). This time, 1, inexplanation which infants infants for itswere did be- taught not of experiment 1, in which infants were taught Liu, Ulman,Tenenbaum & Spelke, 2017 learning in a targeted enhancing learn- does not offer a direct effective learning strategy would be to seek previously taught sound (e.g., squeaking) played previously increasetaughttheir ing only sound about (e.g., proportion objects ofsqueaking) lookingto to relevant played the the tar- observed increase about their havior). the very Besides proportion of looking object learning enhancing that had to the evidence for violated such tar- con- 362 about that inthe could verytheobject Experiment explain 2 the redthat discrepancy wall be-had wasviolated revealed tocon-have a large opening in its face (Figure 2, Panel from a hidden central location (mapping test; fromgetaobject hidden central violation, when thelocation raises a further novel (mapping sound question played test; about the after getarbitrary object[t(18) tinuity when=the mappings novel [like 2.126, P =sound those acquired 0.048] played after by Hence, (18). tween what tinuity [t(18)and was predicted = 2.126, P = 0.048] (18). Hence, what is observed. 10 violations s). For each nature of thewe infant new informationalearned. calculated learning In ex- nonhumanofanimals violations either (22)], do violations solidity [t(9) =of1.453, ex- Older P = children engage violations 363 inof thisexpectation 4B). Infants’ kind of hypothesis looking wasenhanced measured during learning the 20 seconds that followed. 10 s). For each infant we calculated a learning of either solidity [t(9) = 1.453, P = violations of expectation enhanced learning periments 1 to 3, the new information taught pectation privilege the learning of particular testing, performing targeted actions to support score by determining the proportion of time score 0.18] byordetermining continuity [t(9) the=proportion 0.036, P = 0.97] of time(table 0.18] onlyor forcontinuity the object [t(9)involved = 0.036, Pin= the 0.97]violation (table only for the object involved in the violation or rule out possible explanations for an event that infants looked at the target object (relative that infants S1). looked atANOVA, A univariate the target withobject (relative learning score S1).event, A univariate not for ANOVA,unrelatedwith learning objects. score Further, 364 event, not The exploration forwhether unrelated phase objects. that Further, followed wasin- as in Experiment 1. The experimenter emerged (23, in- 24). But it is unknown preverbal Infant learningthe to the new distractor object) during the base- to the as thenew distractorvariable dependent object) during and eventfollowing: base- type (So- as fants’ the dependent failure tovariable and event learn about the new type object (So- fants’test infants actively failure to learn hypotheses about about events, the new object 365 holding line, then subtracting this value from the pro- line, thenorsubtracting lidity Continuity) thisand valuesound Knowledge-Consistent from the(taught type pro- lidity shows or Continuity) that the enhanced and sound learningtype in especiallyshows (taught experi- events that the involving Target Object theviolations enhanced of core (e.g., truck) and a novel Distractor Object (e.g., ball), said, “Look at learning in experi- knowledge. portion of time they looked at the target object portion soundoffrom timethe theyKnowledge-Violation looked at the targetcondition object sound mentfrom 1 wasthe notKnowledge-Violation due to general arousal condition or nov- mentthese!” 1 was and not placed due toboth general arousal or nov- tray. Infants had 30 seconds to explore the Spelke,, Wynn, Xu, Woodward, etc. Event (Solidity) 366 In experiment 4, we first asked whetherobjects in- on the high-chair during the mapping test, when the taught during the mapping of experiment test, when Knowledge-Violation 1 orEvent novel sound (Solidity) from theexperiment taught of elty. experiment When 1taughtor novelaboutsoundanfrom experiment object thatfantswas(N =elty. When taught 40) preferentially about an object that was seek information sound played (table S1). If infants had success- sound 2) asplayed (table yielded fixed factors, S1). If infants had success- only a significant main 2) completely as fixed factors, yielded only perceptually a significant novel (becausemain itfromhadan367completely object objects. that perceptually violated expectationsnovel over (because it had fully learned the object-sound mapping, they fully learned effect of soundthe object-sound type. Infants’mapping, learning they scores effect never of been soundseen type.before) Infants’ butlearning did notscores an object never violate that didbeen seen before) not. Infants saw an event but did not violate should that eitherany accorded with or violated the prin- should increase the proportion of time they were increase significantly thegreater proportionwhen the Knowledge-Consistent of taught time they sound wereanysignificantly expectations, greater when infants the taught showed sound no evidence expectations, infants showed no evidence Event (Continuity) ciples of object solidity or (extending our in- looked at the target object when the sound played; looked playedat the in target object the mapping when the sound played; test (experiment Knowledge-Violation 1) than played in the mapping test (experiment 1) than of learning. of learning. quiry to another principle) object support (18) Event (Continuity) (movies S5 and S6). The solidity events were identical to those in experiment 1 (Knowledge- -0.3 -0.2 -0.1 0 0.1 0.2 0.3 Consistent outcome, n = 10; Knowledge-Violation Map to Distractor Object Learning Score Map to Target Object 417 effect outcome, of(Fig. n = 10) Object 1A). InType, (1, 36) the support = 19.09, p = .00, η2 = .35, and no Condition by Object Type event (Fig. 1C), infants saw an object (e.g., car) either pushed interaction. 418 along a surface while remaining Infants manuallycom- explored equally after seeing a surprising event accompanied by an Infant exploration following: pletely supported, thereby according with expec- Knowledge-Consistent tations about support (Knowledge-Consistent Event (Solidity) 419 explanation outcome, (Mpreference n = 10), or pushed = -4.50, over the surfaceSEM = 4.48, CI [-10.73, 1.73]) and after seeing an expected event edge without falling, thereby violating expect- NIH-PA Author Manuscript Knowledge-Violation Event (Solidity) ations about 420 support = (Mpreference (Knowledge-Violation 2.38, SEM = 2.76, out-CI [-4.66, 7.86]). come, n 368 = 10) (25). As before, we limited infants’ visual exposure to the event outcomes; a uni- variate ANOVA, with looking Knowledge-Consistent 369 Figure time to the 2. Schematic ofevent Experiment 2. The experimenter released the Target Object at the top of Event (Support) outcome as the dependent variable and event Knowledge-Violation type (Solidity or Support) and outcome type Event (Support) 370 the ramp (Panel 1). The Target Object rolled down the ramp and disappeared behind an (Knowledge-Consistent or Knowledge-Violation) as fixed factors, showed no main effect of Knowledge- Knowledge- Knowledge- Knowledge- Knowledge- -30 Knowledge- -20 -10 0 Knowledge- Knowledge- 10 20 30 Knowledge- Knowledge- outcome type Knowledge- Knowledge- [F(1,36) = 0.794, P = 0.379] (18) Consistent Violation Consistent Consistent Violation Prefer Violation Distractor Object Exploration Score Consistent Consistent Prefer Target Object Violation Violation (table S2). Consistent Violation After infants saw the outcome of the solid- 17 ity or support event, we gave them two objects Fig. 1. Knowledge-Consistent and Knowledge-Violation outcomes in experiments Fig. 1. Knowledge-Consistent and1Knowledge-Violation to 4. (A) Solidity events Infant (movies behavior outcomes inS1experiments following: and S2). (B)1 Continuity events to freelyevents to 4. (A) Solidity explore(movies for 60 s:S1the target and S2).object (B) Continuity events (movies S3 and S4). (C) Support events (movies S5 and S6). from the preceding event (e.g., car) and a new (movies S3 and S4). (C) Support events (movies S5 and S6). Knowledge-Consistent Event (Solidity) distractor object (e.g., ball; for half the infants the car was the target and the ball was the Teglas, et al., 2011 Knowledge-Violation distractor, and for the other half this was re- 92 3 APRIL 2015 • VOL 348 ISSUE 6230 92 3 APRIL 2015 • VOL 348 ISSUE 6230 Event (Solidity) sciencemag.org SCIENCE sciencemag.org SCIENCE versed). We calculated infants’ exploration pre- ference scores by subtracting the amount of Knowledge-Consistent time they explored the new distractor object Event (Support) from the amount of time they explored the tar- Knowledge-Violation get object (table S2). We predicted that infants Event (Support) 421 who had seen a Knowledge-Consistent event would show no preference, whereas infants who -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 422 had seenFigure 3. Visual exploration a Knowledge-Violation Dev Sci.event preference would Author scores manuscript; (looking available to Target in PMC Object 2015 May 01.– looking to Distractor More Dropping Action Tendency Score (z-score) More Banging prefer to explore the object that had just vio- lated their expectations. A univariate ANOVA, Perez & Feigenson, 2020 423 Object) in Experiments 1 and 2. Errors bars reflect SEM. * p < .05. with infants’ exploration preference score as Fig. 2. Results from experiments 1 and 4. (A) Infants’ learning after Knowledge-Consistent and the dependent variable and event type (Solid- Stahl & Feigenson, 2015 Knowledge-Violation events in experiment 1. Bars represent average learning scores (proportion of looking at target object during mapping test minus proportion of looking at target object during baseline). (B) Infants’ exploration after Knowledge-Consistent and Knowledge-Violation events in experiment 4. Bars represent looking at and/or touching the target object minus looking at and/or 424 ity or Support) and outcome type (Knowledge- Consistent or Knowledge-Violation) as fixed factors, yielded a significant main effect of out- 425 Finally, as a further test of whether explanations drove infants’ exploratory behavior, we come type [F(1,36) = 5.933, P = 0.02, partial h2 = touching the new distractor object. (C) Infants’ exploratory behaviors on the target object after 0.14]: Infants who had seen the Knowledge- Knowledge-Consistent and Knowledge-Violation events in experiment 4. Bars represent infants’ 426 examined Violation individual event explored the targetdifferences object more in infants’ looking times and their possible link to subsequent z-scored object-banging behaviors minus z-scored object-dropping behaviors. All error bars rep- than infants who had seen the Knowledge- resent SEM. Consistent event. We then compared infants’ 427 exploration. Because we had hypothesized that seeing an explanation for a surprising event SCIENCE sciencemag.org 428 would abolish surprise-induced 3 APRIL 2015 • VOL 348 ISSUE 6230 93exploration, we also predicted that the longer infants looked at 429 the explanation (i.e., the red wall with the opening in it), the less they would subsequently 430 explore the Target Object. That is, the more infants processed the explanation, the less drive they
• Introduced to “birthday machine” and two objects – Plays happy birthday when “zando” is on top – Does nothing when “not a zando” is on top And children’s exploration • Asked becomes increasingly counterfactuals sophisticated – “if thisthrough early one was not a zando childhood what would happen if we put Advanced Review EDs to the child. A short video demo of the toy can be found at: https://osf.io/e2hwq/ wires.wiley.com/cogsci iew only=bbf6c632dcb646f9a6dc0ac12379ef99. it See onalsothe machine?” Figure 1. – “if this one was a zando, T.J.P. van Schijndel et al. / Journal of Experimental Child Psychology 131 (2015) 104–119 107 have clear hypotheses on children’s knowledge acquisition, we did this in an exploratory manner as 108 opposed T.J.P. to a confirmatory van Schijndel one. et al. / Journal of Experimental Child Psychology 131 (2015) 104–119 1.3 Procedure Method what would happen if we put The experiment was designed for Rule 1 children; the evidence either conflicted with or confirmed the Rule 1Participants group’s theory. The responses on the pre-task were used to select the Rule 1 group. How- ever, because the final classification into theory groups was performed after data collection on the basis of the The total sample before classification into theory groups consisted of 102 4- to 9-year-olds (45 boys results of the latent variable technique (see Appendix), all children were administered and 57 girls, Mage = 66.07 months, SD = 15.56) who were recruited from two primary schools. An addi- ch child was tested in a private room inside the preschool. The experimenter and the all phases tional of the12experiment. children were Importantly, recruited but notweincluded report in only the the results analyses; of theeight of these, Rule 1 group children werein this article it on the machine?” because no clear because excluded hypotheses could an error be formulated was made about in administering theoreffect the pre- of (seven post-task evidence onpushed children the other theory the and group’s play light button during the pre- or post-task and got feedback, and for one child the test leader did learning. not time the free play episode) and four children were excluded because no complete video-recordings of the free play episode were available. The sample characteristics were chosen to allow for an optimal ild sat at a table, facing each other. On the table was the occluder with the toy inside. Introduction to the shadow discrimination school age range, The experimenter machine of a Rule introduced 1 group. Even though we expected to find Rule 1 children mainly in the pre- the sample’s ageshadow the range wasmachine taken widerbytopointing guarantee out sufficient power sources the light for reliably and the pup- classifying children into theory groups. When using a latent variable technique to detect subgroups pets of different sizes and by demonstrating how the puppets could be placed close to or farther away of children, the subgroups need to be large enough to be separated from each other. In the preschool from the light sources. She then to finddemonstrated only a small grouphow to make using the shadows. She therefore, placed two equally Bonawitz, Schijndel, Friel & Schulz, 2012 age range, we expected of children a more advanced rule; sized puppets at equal we included olderdistances from children in the thetolight sample ensuresources, pushed that we could detectthe light button, this advanced and(see rule group said, ‘‘Do you FIGURE 2 | Various reactions from a child interacting with two ‘Blicket Detectors’ (originally used by Gopnik and Sobel65 ), from the protocol Appendix). This way, we avoided preschoolers having an advanced rule being incorrectly assigned to see the shadows? This one [pointing to the left shadow] is equally big as this one [pointing to the right the Rule 1 group. In Results, we describe the Rule 1 sample after classification into theory groups. described in Legare.66 shadow]. They are the same.’’ Legare, 2012; Sobel & Legare, 2014 Buchsbaum, Bridgers, Weisberg & Pre-task Materials yesterday and whether it rains today are statistically the detector dependent on the presence of another Gopnik, 2012 The shadow machine, the setup of the shadow task (Inhelder & Piaget, 1958; Siegler, 1978, 1981), The experimenter was used for all introduced four phases of the thepre-task experiment.byThe saying, ‘‘Now machine we are consisted going of two lightto play aa game. sources, screen Each time I related, as is whether it rains today and whether it object—that is they examined whether children obey will put puppets placed 50in cmplace. from the You lightthen sayand sources, whether youcould puppets that think that the be placed shadow between onsources the light this side and will be the will rain tomorrow. Given those dependencies, it is the Markov assumption. Three- and 4-year-olds were biggest [pointing screen (seetoFig. the1). left Whenside of the a button wasscreen], the pressed, the shadow lights on this(they were activated sidestayed willlitbeasthe long biggest as the [pointing button was held) and shadows of the puppets were portrayed on the screen. There were two small to the right side of the screen], or that they will be the same.’’ She then administered 12 items to the also true that raining yesterday and raining tomor- trained to know that objects that activated the detector puppets (7.5 ! 2.25 cm) and two large puppets (10 ! 3 cm) that could be placed at three distances child: six size from items, the lightinsources which(10, the20,size and of 30 the cm). puppets was varied Relative shadow but the size depended ondistance from both the size the puppets to of the row are dependent. The Markov Assumption states were called ‘blickets’. Then, children observed a set of the light sources object and was thekept constant distance from the (see Fig.to2A), object andsources the light six distance items, (the distance in which from the lightthe distance sources to from the that raining yesterday and tomorrow are independent trials in which objects either independently activated puppets tothethescreen lightwas kept constant). sources was varied but the size of the puppets was kept constant (see Fig. 2B). The given the knowledge of whether it rains today. The the machine, or did so only dependent on the presence items wereProcedure administered in one of two fixed semi-random orders. For each item, the experimenter put only influence raining yesterday has on raining tomor- of another object. Specifically, on the one cause trials, two puppets in place and said, ‘‘I put this puppet here and this puppet here. When I make the shadows, which one will be the Children were biggest? This oneby[pointing tested individually one of twoto the left side experimenters in aof the screen], private thisschool. room at their one [pointing to row is through whether it rains today. children were shown two objects. One object (A) acti- The child and experimenter sat at the same side of a table facing the shadow machine. The child the right side of the screen], or will they be the same?’’ Importantly, during the pre-task, the child did How could we examine whether children are vated the detector by itself. The other object (B) did was first introduced to the machine and then participated in four experimental phases: pre-task, evi- not see shadows and, therefore, did not dence exposure, free play episode, andget any feedback. post-task. Responses The total experiment tookon the pre-task approximately were scored tri- 20 min. reasoning about the relations among events using the not. Children then saw objects A and B activate the chotomously: correct, incorrect ‘‘the same,’’ or incorrect not ‘‘the same.’’ A latent variable technique Markov assumption? One difficulty in answering this detector together (twice). Children labeled only object was used to determine children’s naive theories on the basis of these trichotomous responses. This question is that we need a method that allows us to A as a blicket. Even though object B activated the procedure is explained in the Appendix. test whether children recognize the conditional inde- detector 2 out of the 3 times it was placed on it, it only Evidence exposure pendence relations among events separately from their did so dependent on the presence of object A. If chil- To enable random assignment (stratified by age and sex) of Rule 1 children to the evidence prior knowledge about how these events are related. dren reasoned according to the Markov assumption, conditions (conflicting and confirming condition), the pre-task scores were used to perform an One such experimental paradigm was developed by they would not use the positive association between Gopnik and Sobel,65 who introduced children to a object B and the machine’s activation to infer the effi- Fig. 1. The shadow machine. During test administration for the study described in this article, no people but the child and the ‘blicket detector’ (see Figure 2), a machine that lights cacy of object B, but rather recognize that such efficacy experimenter were present. Photography: Hanne Nijhuis. up and plays music when certain objects are placed is conditionally dependent on the presence of object upon it. The detector presents a novel, nonobvious A. Remove object A from the equation, and object B property of each object: its activation potential. (The lacks efficacy. Children reasoned in this manner and machine is controlled through an ‘enabling’ switch. stated that object B was not a blicket. In contrast, When the switch is on, any object will activate the in an analogous two cause condition, in which an detector. When it is off, no object will activate the object activated the detector independently two out of detector). Because the machine is novel, children three times, children were likely to label the B object a have few expectations about what kinds of objects blicket. Here, object B activates the machine indepen- Fig. 2. Examples of different types of items and experiments on the shadow machine. For the pre- and post-tasks, size items (A) activate it. dent from object A, and thus children should use the and distance items (B) were used (without feedback/shadows). For the evidence exposure, a conflict item (C) was used for the associations they observe.11 Van Shinjndel, Visser, Van Bers, Using this paradigm, researchers have found that conflicting condition and a confound item (D) was used for the confirming condition (with feedback/shadows). For the free play episode, the experiments that children performed were assigned to the same categories or in the categories of equal items (E) or children treat objects that activate the detector by Various investigations have extended these irrelevant items (F). These figures present examples of the different types of experiments (L = large puppet, S = small puppet), themselves differently from objects that only activate findings to younger children,37 other domains of Raijmakers, 2015 although other variations are possible. Irrelevant experiments also included experiments with one, three, or four puppets. © 2014 John Wiley & Sons, Ltd. Wu & Gweon, 2019 Figure 1: Design and procedure Schulz & Bonawitz, 2007
the box for both the Uncertainty and exploratory playtest trials. A activate “hiding theme song) from (9 marbles actually go into the box) mask the soun placed into the h Max Siegel Procedure. Chi to the task as a g matic of task in Experiments Rachel Magid 1-3. Placem their goal was t Siegel, Magid, Pelz, Tenenbaum, & Schulz, in press, Nature Communications
n motion (i.e., excluding any pauses; see SI). Here we report the results of the since the relationship between uncertainty and exploration may be best ng time the children could have been planning subsequent actions and A B data they generated but the primary results hold for all measures (see Fig. 3A, Probability Probability ize for individual differences in children’s exploratory behavior, we computed d spent exploring on each trial as a proportion of the child’s total playtime ls, and multiplied this proportion by the number of trials in the experiment. less than 1 represents less playtime Internal Numerical Representation (and a proportion more than 1, more Internal Numerical Representation uld be expected if children distributed their playtime evenly across trials. proportional playtime to control for individual differences in length of play, all Models of (log) untransformed children’s playtimeinternal representation reported in seconds of (see SI). y the number, showing of discriminability (A)different normal distributions contrasts, we with adopted a variant of the ectionfixed model variance in which defined over logarithmic shaking a box with space m marbles (but in it would produce a awn visualized from someover linear space) probability and (B) normal distribution over a high-dimensional acoustic distributions be projected downwith to avariance proportional one-dimensional space to mean of abstract numerosity defined sentations inover linear space. number system (Dehaene). We modeled the the approximate tion for each auditorily Measure perceived number the discriminability of each as a normal distribution on a log with equal variances contrast between ! but llogarithmically and m marbles spaced means, and computed the each contrast between l and m marbles presented in Experiments 4-7 in terms |&'( &* | ex " # = , where ,- = log 1 and ,2 = log 3. See SI for a summary of + able S1), as well as a discussion of alternative ways of estimating ncluding different mathematical models, and an empirical estimate from psychophysical data), which produce nearly identicalNresults = 96, for 24 our purposes. children ages 4-8 years (mage= 5.83 years) ren’s intuitions about task difficulty as proportional to this d’ measure. Note
VIDEO
Children’s exploration precisely tracked the discriminability of the contrast … A B A B Experiment 4 Experiment 4 Experiment 4 Experiment 4 Experiment 5 Experiment 5 Proportion Playtime * 4 Experiment 5 Experiment 5 Proportion Playtime * 4 Experiment 6 Experiment 6 Experiment 6 Experiment 6 Experiment 7 Experiment 7 Experiment 7 Experiment 7 0.95 r = 0.94 0.95 r = 0.94 r = 0.92 0.89 0.89r = 0.92 Negative Discriminability Negative Discriminability CC Video coded (including “thinking” time) Arduino motion sensor Siegel, Magid, Pelz, Tenenbaum, &Figure Schulz, 3.inChildren’s Figure press, Nature 3. Children’s Communications proportional proportional
0.95 r = 0.94 .. and was independent of the actual number of marbles inC the box. Figure 3. Children’s exploration times as negative discriminab Proportion Playtime * 4 across Experiments 4 by hand (A) or by the children’s exploratio strongly with the diff discrimination. By co exploration was unre number of marbles in bars indicate SEMs. Number of Marbles In Box Strikingly, children’s exploration time was independent of the number of marbles ac box (Fig. 3C; β=0.0065, 95% CI [-0.0094, 0.022]). Thus, although the sensorimotor of shaking Siegel, Magid, Pelz, a box containing Tenenbaum, only & marbles one or two Schulz, in press, was Nature quite Communications different from shakin
Why do we play? ❖ Cognitive reasons ❖ For practice (Groos, 1898; see also Bekoff & Byers, 1998; Burghardt, 2005; Fagen, 1981; Pelligrini, Dupuis, & Smith, 2006) ❖ For prediction (Berlyne, 1966; Bonawitz, van Schijndel, Butler & Markman, 2012; Bonawitz, et al., 2011; Buchsbaum, Bridgers, Skolnick Weisberg, & Gopnik, 2012; Chitnis, Silver, Tenenbaum, Kaelbing, & Perez, 2020; Cook, Goodman, & Schulz, 2011; Jirout & Klahr 2012; Gottleib, Oudeyer, Lopes, & Baranes, 2013; Florensa, Held, Geng, & Abbeel, 2017; Gopnik & Walker, 2013; Haber, Mrowca, Wang, Li, & Yamins, 2018; Jabria Eysenbach, Gupta, Levine, & Finn, 2019; Kang et al., 2006; Kulkarni, Narasimhan, Saeedi, & Tenenbaum, 2016; Legare, 2012; Oudeyer, Gottleib, & Lopes, 2016; Oudeyer & Smith, 2016; Pathak, Agrawal, Efros & Darrell, 2017; Schmidhuber, 2011; 2013; Schulz & Bonawitz, 2007; Schulz & Standing, 2008; Sim & Xu, 2008; Singh, Lewis, Barto, & Sorg, 2019; van Schijndel,; Visser, van Bers, & Raijmakers, 2015; Wood-Gush & Vestergaard, 1991)
SWITCH Schulz, Gopnik, & Glymour, 2007, Developmental Science
Interventions on each causal structure will produce different patterns of evidence. 1 2 S S 3 4 S S Schulz, Gopnik, & Glymour, 2007, Developmental Science
42
Problems and play ❖ Play might be pleasurable, let us show off, help us bond, teach us real life skills, and improve our predictive models of the world … ❖ But when a child tries to reunite a toy octopus with his mama in a stacking cup … ❖ or catch a velociraptor by sticking play dough under the couch ❖ … these accounts don’t seem very satisfying. ❖ We can use play to assess children’s sensitivity to uncertainty and expected information gain but that’s not necessarily the best characterization of what children use it for.
Problems and play ❖ However, I do think one thing is true of play — both exploratory play and pretend play ❖ Children make up problems, and invent plans to try solve them ❖ Can I turn the gears into puppets? Can I hear inside? Can I reunite the octopus with her mom? ❖ The sheer arbitrariness of these problems may be the point … ❖ the problems and solutions don’t matter. ❖ what matters is the ability to invent new problems and use them to bootstrap new plans and solutions. ❖ Why? Because the hard problem of cognition is not learning (even deep learning networks can do that ; )). ❖ The hard problem of cognition is thinking.
Program induction workshop: Cogsci 2018 “Coming up with the right hypotheses and theories in the first place is often much harder than ruling among them.” ”How do people, and how can machines, expand their hypothesis spaces to generate wholly new ideas, plans, and solutions?” “How do people learn rich representations and action plans (expressable as programs) through observing and interacting with the world?”
Problems and play ❖ So why create problems you don’t have? Why set arbitrary goals? ❖ Because problems and goals — all problems and goals — support search. ❖ Problems impose valuable constraints on hypothesis generation and planning.
Problems are rich in all kinds of information ❖ Consider the information contained in question words (even before we get to the content of the questions) … Where? When? Who? What? How? Which? Why?
this rule or empirical engage in some Why generalization hold? unexpected thought does …? Why did … she…? or action? … Why did this unexpected event Why did occur? the it’s a joke …? chicken … …? … some desirable or Why Why did it’s a rant seemingly possible can’t …? thing happen? … Trump … …?
Problems are rich in all kinds of information ❖ We know a lot about our problems before we can solve them ❖ We can have a sense of being on the right track well before we can better predict or explain observed data… ❖ We can think something is a “great idea” even when we know it’s wrong. ❖ We might be able to constrain our proposals on two separate dimensions ❖ How well they fit the data — “TRUTH” ❖ How likely they would be to solve our problem if they were true — “TRUTHINESS”
Problems are rich in all kinds of information ❖ Consider the information contained in question words (even before we get to the content of the questions) … Where? When? Who? What? How? Which? Why?
Problems are rich in all kinds of information ❖ Consider the information contained in all kinds of representations — independent of domain. Continuous vs discrete Proportion Unimodal vs bimodal Variance Linear vs exponential Cyclic amplitude
Which group of aliens bought which set of rocket ships? Maddie Pelz Pelz & Schulz, in prep
Which set of factories produced which set of candies? Year 1 Year 2 Year 3 Year 4 Year 1 Year 2 Year 3 Year 4 Year 1 Year 1 Year 2 Year 3 Year 4 Pelz & Schulz, in prep
Pelz & Schulz, in prep
Pelz & Schulz, in prep
Problems and play ❖ Note that there is no fact of the matter … These answers are not necessarily right but at least they could be right. ❖ The structure of the problem allows children to endorse plausible hypotheses that go beyond the data.
Problems and play ❖ The point of play is not that the ideas children propose in play are accurate or even verifiable or that the plans are achievable (i.e., play is not chiefly about getting the world right) ❖ The point of play is that it sets up problems — and gives you new things to be right (or wrong) about. ❖ We may be motivated to play and explore, not only by the progress we make in learning (e.g., Oudeyer, Gottleib, & Lopes, 2016) but by the progress we make in thinking. ❖ The fact that a problem contains enough information to let us generate a thought or plan might itself motivating — independent of whether those ideas are right are wrong.
Problems and play ❖ Finally, although my work and many other people’s has been motivated by treating play as a kind of rational exploration … ❖ the problems we set up in play differ in critical ways from those we undertake when we are not playing.
Goal-directed action Goal-directed play “There are stickers in the box. Can “There are stickers in the box. you go in here and try to get one?” Can you play in here and try to get one?”
Goal-directed action Goal-directed play “There are stickers in the box. Can “There are stickers in the box. you go in here and try to get one?” Can you play in here and try to Experiment 2 100% get one?” Percentage of subjects 75% 50% 25% 0% Instrumental play “I need a pencil to fill out this Goal “I need to fill out this form. While form. Can you go over there Retrieval Choice I’m doing that can you play over and try to get a pencil?” Efficient there and try to get a pencil?” Inefficient Junyi Chu Chu & Schulz, in review
Problems and play ❖ Children violate principles of rational action in play, but they do not act either randomly or irrationally ❖ Even when children opt for the harder task, they behave efficiently with respect to that task (adhering close to the twirly path, jumping directly towards the pencils …) ❖ Behavior in play is conditionally rational …rational with respect to a manipulated utility function.
Problems and play ❖ In play, neither the costs nor the reward are real; If they are, you are no longer playing. ❖ In this sense, all play is pretend play. ❖ In play, we “hack” our own utility function to create novel goals. ❖ And as a species, this allows us to take on innumerable goals.
We populate the world with problems of our own making— we want to end poverty, cure cancer, write the Great American novel, achieve enlightenment, eat more hot dogs than anyone else …
Problems and play ❖ One reason our motivational system may be as rich as it is, is because the diversity of goals confer an advantage for learning ❖ As humans, we can endogenously fix our utilities on anything. ❖ Epistemic goals are not the only — or necessarily even the best — ❖ Being able to want anything at all (as a species) might let us explore a vast space of possible plans and ideas.
Problems and play ❖ The world is full of unknown unknowns — as great as our uncertainty about the world is, there are even more things we don’t even know we don’t know. ❖ If we only explored in ways that tried to maximize expected information gain — we would miss the chance to gain unexpected information. ❖ Creating new problems with no obvious utility in themselves may be the best way to discover (genuinely) new things
What is play for? ❖ For pleasure ❖ For performance ❖ For peacemaking ❖ For practice ❖ For prediction ❖ For posing new problems
Thanks! Max Siegel Rachel Magid Maddie Pelz Junyi Chu
You can also read