The free food (contrafreeloading) phenomenon: A reviewand analysis - ZOOSnippets
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Animal Leaming & Behavior 1977, 5 (3), 221-235 The free food (contrafreeloading) phenomenon: A review and analysis STEVE R. OSBORNE Arizona State University, Tempe, Arizona 85281 Anima1s will perform an operant response to obtain food when abundant free food is availab1e. These data have implications for current 1earning theories, especial1y in terms of the motiva- tiona1 variables associated with such behavior. The present paper reviews the literature and provides an analysis that suggests that responding for food in the presence of free food is im- portantly controlled by stimulus change attendant upon response-dependent food presenta- tion. This apparent stimulus-reinforcer effect on behavior is compared to that observed in other areas of anima1 1earning research that include preference between schedu1es of response- dependent and response-independent reinforcement, preference between schedu1es of signa1ed and unsigna1ed reinforcement, autoshaping and automaintenance, and self-reinforcement in anima1s. When an animal is provided a choice between demonstrated with both water (Knutson & Carlson, making an operant response to obtain food or by 1973; PoweIl, 1974; Robertson & Anderson, 1975; eating it from a source of continuously available Taylor, 1972, 1975) and sucrose solution reinforcers free food, a curious thing happens. Animals not only (Carder, 1972; Tarte, Townsend, Vernon, & Rovner, acquire and continue to make the operant response, 1974). Although not food deprived, rats (Davidson, but they often appear to "prefer" the response- 1971; Kopp, Bourland, Tarte, & Vernon, 1976; dependent food. That animals respond for food in McLaughlin, Kleinman, & Vaughn, 1973; Tarte & the presence of identical free food is by now a well- Synder, 1972), pigeons, (Bilbery et al. , 1973; established finding in the experimentalliterature. Neuringer, 1969; Wallace, Osborne, Norborg, & Behavior has been maintained on both fixed-ratio Fantino, 1973) and chickens (Duncan & Hughes, (Alferink, Crossman, & Cheney, 1973; Atnip & 1972) have continued to respond for food in the Hothersall, 1973; Carder & Berkowitz, 1970; presence of free food. Davidson, 1971; Hothersall, Huey, & Thatcher, Two separate, but related, issues are involved in 1973; Tarte & Vernon, 1974) and variable-interval the free food phenomenon (0' Amato, 1974; Taylor, (Bilbrey, Patterson, & Winokur, 1973; Neuringer, 1972). First, under what conditions, i.e., to what 1970; Rachlin & Baum, 1972; Sawisch & Denny, extent, will animals "work" für food when free food 1973) schedules of reinforcement when free food was is available? Second, do animals prefer to work for concurrently available. Different strains of rats food when free food is available? In attempting to (Atnip & Hothersall, 1973; Hothersall et al. , 1973; address the second issue, free food experiments fre- Powell, i 974), rats with septal and ventromedial quently have been oriented towards specifying the hypothalamic lesions (Singh, 1972a), mice (Pallaud, preference relationship between free and response- 1971), chickens (Duncan & Hughes, 1972), pigeons dependent food. Preference for a food source exists (Neuringer, 1969, 1970), crows (PoweIl, 1974), cats when the proportion of food obtained from that (Koffer & Coulson, 1971), gerbils (Lambe & Guy, source exceeds 50010 of the total food obtained during 1973), Siamese fighting fish (Baenninger, & a session. The data from studies of preference often Mattleman, 1973), and humans (Singh, 1970; Singh vary enormously; some studies report a large pre- & Query, 1971; Tarte, 1972) have been shown to ference for response-dependent food, others report work for reinforcers when equivalent free reinforcers an equally large preference for free food (cf. Carder were available. Although food reinforcers typically & Berkowitz, 1970; Taylor, 1972). Inconsistency of have been used, the phenomenon also has been results from similar studies, coupled with difficulty in specifying the variables that control free food re- sponding, may be responsible for the paucity of I thank L. Meyerson, S. Parkinson, and members of the "Glass attempts to integrate free food data into existing Bead Gang,". especially P. Killeen, for their helpful comments on an earlier draft of this paper, and Ada Cardenas for her help in theory, or to relate it to other areas of research in preparing tlre" manuscript, Reprints may be obtained from S. animal learning. However, recent research suggests Osborne, Department of Psychology, Arizona Stare University, new perspectives that a110w an explanation of the Tempe, Arizona 85281. phenomenon that not only is parsimonious with 221
222 OSBORNE regard to existing free food data, but one that is dency, Brown and Jenkins (1968) found that experi- relevant for other research areas as weil. mentally naive pigeons came to peck a briefly lighted response key when keylight offset was followed by Definition of Free Food response-independent food. This "autoshaping" Some experimenters refer to intermittently avail- phenomenon has been shown with a number of non- able food presented on response-independent avian species (see Hearst & J enkins, 1974) and schedules as free food; however, food availability appears to persist when responses result in reinforcer is specified by the experimenter. Free food in the omission (Stiers & Silberberg, 1974; Williams & present review refers to food that is both response- Williams, 1969). independent and continuouslyavailable. The acquisition and maintenance of responding on response-independent schedules is not easily TUEORETICAL IMPLICATIONS accounted for by reinforcement theory (cf. Bindra, 1974; Bolles, 1972), despite the suggestion that Least Effort responding is superstitiously maintained by adven- Substantive experimental evidence suggests that titious correlations between behavior and reinforce- animals tend to maximize reward while simul- ment (Herrnstein, 1966; Zeiler, 1968). taneously minimizing effort. For example, rats prefer The acquisition and maintenance of food- the shorter, less effortful path to a goal (DeCamp, acquiring responses when free food is available also 1920; Gengerelli, 1930; Kuo, 1922; Logan, 1960; challenges a reinforcement analysis. If reinforcement Thompson, 1944; Tolman, 1955; Waters, 1934; strengthens those behaviors which precede it, then Yoshioka, 1929). More recently, Herrnstein (1958) free food should rein force nonresponding (cf , showed that pigeons permitted to choose between Gibbon, Berryman, & Thompson, 1974). Making a responding for food on different fixed-ratio specific response for food in the presence of free schedules of reinforcement that were concurrently food appears to be another example of behavior in- available invariably chose the smaller ratio schedule. adequately accounted for by reinforcement theory. The least effort hypothesis, as exemplified by these data, occupies a central position among most RESPONDING FOR REINFORCERS IN TUE current learning theories (e.g., Ferster & Skinner, PRESENCE OF FREE REINFORCERS 1957; Hull, 1943; Tolman, 1955). Although some studies have found that animals In the premier experiment (Jensen, 1963), rats were make the least effortful response (see Lewis, 1965, shaped to leverpress for food pellets. Following 40, for a review), other studies suggest either that re- 80, 160, 320, 640, or 1,280 continuously reinforced sponse effort has little effect on choice between responses, a cup of free pellets was placed in the rear similar rewards or that the more effortful response of the chamber. Jensen found that in one 40-min is preferred (Collier & Jennings, 1969; Jennings & session the proportion of pellets obtained by lever- Collier, 1970; Kanarek & Collier, 1973). That pressing increased as a logarithmic function of the animals respond for reinforcers when abundant free number of reinforced prechoice responses. Rats that reinforcers are available appears particularly difficult received 40 reinforced responses prior to choice test- to reconcile with least effort concepts, especially ing responded for about 20070 of their food, whereas when many responses per reinforcement are emitted rats that received 1,280 reinforced responses before (e.g., Neuringer, 1970). choice testing responded for 75070 of their food. This finding was subsequently extended by Neuringer Response Dependency (1969). After shaping the keypeck response and The major tenet of reinforcement theory is that receiving seven training sessions on CRF, two reinforcement increases the probability that those pigeons were housed in chambers where food was responses which precede it will recur (Skinner, 1938, available either by keypecking or from a cup of free 1948). Arranging to have reinforcers dependent food located in the rear of the chamber. Although upon a response insures that the response will occur food was continuously available, these animals in close temporal proximity to reinforcement and emitted approximately 420 and 1,200 responses per allows the reinforcement process to operate at an day, respectively, for food. A similar finding was optimal level. One way to assess the functional role reported for two rats, although the average number of response dependencies is by scheduling rein- of responses was lower than for the pigeons. Condi- forcers independently of behavior. Differences be- tions were then employed to control for the possible tween behavior initiated or maintained by schedules reinforcing effects of stimulus feedback associated of response-dependent and response-independent with response-dependent food presentation. For one reinforcement can then be attributed to the presence pigeon, a transparent shield was placed across the or absence of an explicit response-reinforcer depen- feeder opening, for the other the grain hopper was
FREE FOOD 223 emptied, and for the rats, responses produced only that pigeons continued to respond for food on a VI the auditory feedback provided by the pellet 3-min schedule when free food was available. Al- dispenser. In all cases, responding decreased to though Neuringer (1970) and Rachlin and Baum nearly zero. Therefore, the maintenance of respond- (1972) used well-trained pigeons maintained at 80070 ing did not appear attributable to the reinforcing of their free-feeding weights, such conditions are effect such stimulus feedback might have acquired as not necessary für the maintenance of responding on a result of pairing with food presentations or to the intermittent schedules of reinforcement. Bilbrey reinforcing properties of stimulus feedback alone. In et al. (1973) found that pigeons allowed continuous a second experiment, Neuringer (1969) showed that access to free food, both in the horne cage and during responding could not be attributed to habit strength experimental sessions, continued to respond for food of a previously learned response or to special pro- on VI 3-min and VI 30-sec schedules. In a second perties of the keypeck: Two experimentally naive study, they showed that previous experience with pigeons lived in their chambers where grain was keypecking for grain had only minor effects on the dependent on an operant response or available from rate of responding maintained by a VI 30-sec a cup of free food. For one pigeon, the response schedule. was the depression of alever, for the other a key- Sawish and Denny (1973) also reinforced pigeons' peck. In both instances, pigeons learned, without keypecks for food on a VI I-min schedule, but they prior shaping, to respond for food and continued to made the opportunity to respond for food dependent respond despite the availability of free food. on eating or not eating free food. They found that It also has been shown that rats continuously eating free food increased when the opportunity to housed in achamber with free food available readily respond was dependent on eating free food and de- acquire and maintain a food-producing operant re- creased when the opportunity to respond was depen- sponse (Coburn & Tarte, 1976; Kopp et al., 1976; dent on not eating free food for aperiod of 30 or McLaughlin et al. , 1973). Similarly, Baeninger and 60 sec. Sawisch and Denny suggested their data were Mattleman (1973) found that Siamese fighting fish evidence of the operation of the Premack principle acquired and continued to emit an operant response (cf. Premack, 1959) in a free food situation; that is, that produced a mirror to which the fish could the use of a high-probability response (keypecking) display, although a free mirror was continuously to reinforce a low-probability response (eating free present. Therefore, responding for reinforcers in the food). presence of free reinforcers includes both the acquisi- tion and the maintenance of behavior. Preference for Free Food Despite the apparent robustness of the phenomenon Schedule of Reinforcement and the suggestion of earlier research that animals In the Jensen and Neuringer studies, responses prefer response-dependent food, subsequent studies were continuously reinforced. Responding also has produced conflicting data. In one experiment been maintained on intermittent schedules of re- (Taylor, 1972), choice testing was preceded by 1,000 inforcement when free food was concurrently avail- continuously reinforced barpresses and then a single able. Carder and Berkowitz (1970) preceded choice session where only free food was available. During testing with three sessions of free food training and the initial test sessions of 20 min, rats responded for then six sessions of barpress training. They found over 50070 of their food; however, by the 15th test that when only 1 or 2 responses were required per session, preference for response-dependent food reinforcement, rats responded for 80070-100070 of decreased to 20070. Taylor suggested that the differ- their food. However, when the response requirement ence between his and Jensen's data was that Jensen was increased to 10, preference for response- conducted only one session of preference testing dependent food decreased to 10070-30%. Tarte and and hence failed to observe the changes toward free Vernon (1974) investigated barpressing for food food that occur over consecutive sessions. An across a greater range of fixed-ratio schedules and alternative explanation for the inconsistency in found a linear decrease in preference for response- preference is that Taylor used sessions of shorter dependent food as the ratio was increased from FR 1 length. Tarte and Snyder (1973) used a procedure (65070) to FR 21 (11070). similar to that of Taylor, but conducted 10 Free food responding also has been demonstrated l-h sessions of preference testing. They found a on aperiodic schedules of reinforcement. Neuringer consistent preference for response-dependent food (1970) found that pigeons emitted over 40 keypecks with no systematic change in preference evidenced per reinforcement when keypecks were reinforced across consecutive sessions. That Taylor allowed following irregular time intervals that averaged 1 min prechoice training with free food also may have (VI I-min schedule) despite the availability of free affected preferences (see Food Training Procedures food. Similarly, Rachlin and Baum (1972) showed below). Nonetheless, other studies support the find-
224 OSBORNE ing that animals prefer free food. For example, two fairly constant across sessions. By the third choice direct replications of the Carder and Berkowitz session there was a clear difference between the (1970) study showed that rats preferred free over groups: animals in the food-reinforcer group bar- response-dependent food when responses were re- pressed for 14070 (20-mg pellets) and 40070 (45-mg inforced by FR 1, FR 2, and FR 10 schedules (Atnip pellets) of their reinforcers; rats in the water- & Hothersall, 1973; Hothersall et al., 1973). Koffer reinforcer group barpressed for 4070 (0.01 cc) and and Coulson (1971) also found that cats, given a 31070 (0.1 cc) of their reinforcers. Thus the data of choice between responding for fish or obtaining it Robertson and Anderson suggest that reinforcer freely, consistently chose free fish, and Lambe and magnitude exerts a strong effect on preference. Guy (1973) found that both rats and gerbils pre- Moreover, it appears that there is a greater prefer- ferred free over response-dependent food. ence for response-dependent food, relative to response-dependent water reinforcers, even when the Reinforcer-Motivational Variables magnitude of the water reinforcer (by weight) is twice The ability of qualitatively different reinforcers that of food; that is, 0.1 cc of water represents a to maintain responding in the presence of equivalent, larger magnitude reinforcer than does a 45-mg food free reinforcers also has been studied. Choice pro- pellet, yet under these magnitude conditions rats bar- portions vary between experiments, but. it has been pressed for a greater proportion of their food demonstrated that animals show a stronger prefer- reinforcers. ence for response-dependent food reinforcers than Variation in the magnitude of water reinforcers for response-dependent water reinforcers under may account for some of the variability in previously conditions of continuous reinforcement (Carder, reported choice data. For example, Powell (1974) 1972; Roberts & Anderson, 1975; Tarte et al., 1974; and Taylor (1972, 1975) both reported that rats bar- Taylor, 1972), and intermittent reinforcement (cf. pressed for 5070 or less of their water reinforcers, but Tarte & Vernon, 1974; Taylor, 1975). Carder (1972) they used response-dependent reinforcers of rela- exposed rats to a situation where either water or tively small magnitude (0.01-0.02 cc). Alternatively, sucrose solution reinforcers were available freely Carder (1972) and Tarte et al. (1974) found that rats or by barpressing. He found that rats obtained 83070 barpressed for 26070 and 54070, respectively, of their of their sucrose reinforcers but only 23070 of their water reinforcers when the magnitude of the response- water reinforcers by barpressing. Similarly, Tarte dependent reinforcer was 0.1 cc. Knutson and et al. (1974) reported preferences of 79070 and 54070, Carlson (1973) reported that rats barpressed more respectively, for response-dependent food and for water reinforcers than for food reinforcers when response-dependent water reinforcers. equivalent free reinforcers were present. They used Altering the physical quality of the reinforcer also standard 45-mg food pellets and 0.01 cc of water as affects preference. Carder (1972) found that when the response-dependent reinforcers. Therefore, al- both free and response-dependent reinforcers were though rats responded more for water reinforcers, adulterated with quinine, preference for response- they still may have obtained a larger proportion of dependent sucrose decreased as a function of adulter- their food reinforcers by responding (cf. Robertson ation level. Tarte et al. (1974) found that a non- & Anderson, 1975). nutritive reinforcer (saccharin) maintained little In sum, preference for response-dependent rein- responding with either food- or water-deprived rats forcements appears to be an increasing function of when saccharin was freely available, and Davidson the effectiveness of the reinforcer, which depends and Davis (1975) found that the proportion of food upon reinforcer-motivational variables (cf', Bindra, acquired by barpressing decreased following in- 1974), such as rate and magnitude of reinforcement jections of known anorexic drugs. as well as the physical quality of the reinforcer. There Robertson and Anderson (1975) investigated the also is a greater preference for response-dependent effects of varying the magnitude of the response- food compared to response-dependent water rein- dependent reinforcer using both food (20- or 45-mg forcers when equivalent free reinforcers are avail- pellets) and water (0.01 or 0.1 cc) reinforcers. They able. This latter finding is consistent with other re- reported that differences in choice proportions across search (e.g., Fallon, Thompson, & Schild, 1965) reinforcer magnitudes were not statistically signifi- showing that water is a less potent reinforcer than cant. However, no prechoice exposure was provided food when deprivation duration and reinforcer to the free reinforcers and all three choice sessions weight are equated. were included in their data analysis. A close examina- An indirect way to manipulate reinforcer effective- tion of the data shows that preference for small- ness is by varying the severity of reinforcer depriva- magnitude response-dependent reinforcers decreased tion. Given the above data, food deprivation and linearly over sessions, whereas preference for large- preference for response-dependent food might be magnitude response-dependent reinforcers remained expected to covary. In the single experiment in which
FREE FOOD 225 food deprivation was varied systematically (Tarte reinforced responses (Jensen, 1963). McLaughlin & Synder, 1972), preference for response-dependent et al. (1973) found that rats left in experimental food increased as food deprivation time was extend- chambers für 7 to 10 days acquired a barpress ed from 0 to 92 h. Morgan, Einon, and Nicholas response with both free food and water contin- (1975) also found that rats deprived of food for 23 h uously available. Although response rates initially responded for a significantly greater proportion of were low, removal of the free reinforcers for 24 h their food than rats fed immediately before a session. increased barpressing tenfold once a choice between However, other researchers failed to detect similar free and response-dependent food or water was rein- deprivation effects.' Duncan and Hughes (1972) stated. In a more systematic investigation of the found no difference in the proportion of food effects of prechoice training, Tarte and Synder (1973) chickens obtained by keypecking between conditions found that rats barpressed for 72070 of their food of continuous access to food and 22-h food depriva- following three free food sessions and six sessions tion. Davidson (1971) trained rats to keypress of barpress training, but they barpressed for only for food on a multiple schedule of reinforcement. 15070-20070 of their food following equalized food When the chamber was illuminated, every 10th re- training procedures. sponse produced reinforcement (FR 10) and ex- Mitchell et al. (1973) suggested that the prechoice tinguished the chamber illumination, signaling that training schedule, per se, is a less important variable response-dependent food was not available. Cessa- than the amount and/or recency of familiarity with tion of responding for 20 sec (DRL 20 sec) reinstated the different food sources. After shaping, rats were the chamber illumination and the FR 10 schedule. allowed to make 140 continuously reinforced re- When free food was provided, it was consumed dur- sponses and were then given 10 training sessions. One ing the DRL component, but rarely during the FR group received seven sessions of barpress training component. Providing continuous access to food in followed by three sessions of free food training; a the horne cage eliminated within-session consump- second group received the same distribution of train- tion of free food but failed to abolish keypressing ing sessions, but in reverse order. When choice test- in the FR component, ing immediately followed barpress training rats bar- Several investigators (Davidson, 1971; Knutson & pressed for 50070 of their food, but they barpressed Carlson, 1973; MitchelI, Scott, & Williams, 1973; for only 1070 of their food when choice testing im- Robertson & Anderson, 1975) noted that the prob- mediately followed free food training. A third group ability of a response within an experimental session of rats given three alternating free and response- increases over time; free food typically is consumed dependent food training sessions followed by four during the first part of a session. It appears, then, sessions of barpress training responded for 34070 of that probability of responding and within-session their food. Thus preference for free food appeared to food deprivation are inversely related. It remains be an increasing function of the recency of free food possible that within- and between-session food training. However, free food training sessions were deprivation have different effects on responding. conducted with the response bar present but in- Variables that control cessation of responding for operable. Consequently, barpressing for food may food (or eating) are poorly understood (Morgan, have undergone extinction during free food training 1974a). At present the effects of food deprivation on and the decreased preference for response-dependent preference for response-dependent food are food rnay have been due to the recency of equivocal. However, variables other than "hunger extinction. drive" or food deprivation appear responsible for the There also is some evidence that familiarity with maintenance of free food responding, since several the food containers (Mitchell et al., 1973; MitchelI, reports have shown that animals may fail to consume Williams & Sutter, 1974; but see Carder & Beckman, up to 25070 of the food they have responded for 1975) or the food substance (Carlson & Riccio, 1976) (Mitchell et al. , 1973; Neuringer, 1969; Tarte & affects preference. Mitchell et al. (1973) found that Synder, 1973) and that considerable responding is rats barpressed for 48070 of their food when free maintained in the absence of food deprivation (e.g., food was placed in a novel container and response Kopp et al., 1976; Neuringer, 1969; Wallace et al., dependent food was presented in a familiar con- 1973). tainer, but they barpressed for only 1070 of their food when conditions were reversed. Carlson and Riccio Food Training Procedure (1976) found that providing prechoice exposure to Providing differential prechoice training with food pellets of the type used during experimental response-dependent food significantly increases pre- sessions, either in the horne cage or in the test ference for response-dependent food. As previously chamber, reduced subsequent preferences for noted, preference for response-dependent Iood in- response-dependent food. In a second experiment, it creases as a function of the number of prechoice was shown that changing the flavor of both free and
226 OSBORNE response-dependent pellets between conditions of sponse of the rat. Wild rats tend to avoid strange prechoiee training and testing decreased preferences food and strange food sources (Barnett, 1963); for response-dependent food. laboratory rats apparently share this disposition. Stolz and Lott (1964) investigated the effects of That rats run over a pile of pellets in a straight alley prechoiee training on a running response. Rats were (Stolz & Lott, 1964) seems explicable by such an trained to run an 8-ft straight alley for a single food analysis. Rats prefer to eat in that part of the pellet; then, during choiee trials, a pile of 1,500 apparatus where they have been accustomed to pellets was placed midway down the alley. Rats that eating; for example, they will carry food found in the received more prechoice training more often ran to stern of a maze to the goalbox before consuming the goalbox than animals given less training. How- it (Cohen-Salmon & Blancheteau, 1967). On the ever, two subsequent studies suggest that providing other hand, Carlson and Riecio (1976) argued that prechoiee training for a running response increases during barpress training the reinforcer becomes a preference for free food. Leung, Jensen, and Tapley discriminative stimulus for the operant response (1968) allowed two groups of rats either 75 or 285 that produces it (cf. Denny, 1971), and thereby main- reinforced runs down a straight alley before giving tains barpressing in the presence of free food. Yet them a choiee between eating free food in the start- another interpretation of the effects of prechoiee box or running the alley for a single pellet. They food training was provided by Taylor (1975), who found that rats given fewer prechoice trials ate less suggested that preference for response-dependent free food and spent less time in the startbox before reinforcers results from a failure to discriminate be- traversing the alley. Jensen, Leung, and Hess tween conditions of testing and training. He found (1970), using a similar procedure, allowed three that preference for response-dependent water rein- groups of rats 0, 40, or 285 reinforced runs before forcers decreased whenever the response requirement choiee testing. They found that rats given no pre- was increased or decreased from its prechoiee train- choiee training spent less time eating free food before ing value. The wide range of species, reinforcers, and running the alley than rats given 40 or 285 prechoiee procedures used in free "food" studies makes it trials; there was no signifieant difference between difficult to ascribe the effects of training procedures the latter two groups. to food source familiarity or discrimination effects. Free food experiments typically invo1ve a choice In many cases there is little to distinguish between between topographically dissimilar operants that such accounts because common predietions are often lead to the same reinforcer. Larson and Tarte (1976) made by all. However, the discriminative stimulus assessed the effects of prechoiee training when the hypothesis is consistent with data collected outside choiee responses involved topographically similar the free food area (cf, Rescorla & Skucy, 1969) and operants. They found that rats initially preferred the it can more easily account for the effects of pre- longer of two paths to a goalbox in a T-maze if they choiee training when only a single food source is were given more prechoice training with the longer availab1e (e.g., Kleinman et al., 1976). Nevertheless, path. However, over successive choice trials, prefer- of the studies reviewed, an explicit preference for ence for the shorter path increased to about 90070. response-dependent reinforcement was obtained only Rats given an equal number of prechoice trials for when no prechoice training for the free reinforcer both paths preferred the shorter path from the out- was provided (e.g., Jensen, 1963) or when prechoice set of testing. Kleinman, McLaugh1in, Gerard, training procedures that favored responding were Bosza, and Clipper (1976) found that rats consistent- used (e.g., Carder & Berkowitz, 1970). Together, ly preferred the larger of concurrently available FR 1 these data help resolve the preference issue: animals and FR 3 schedu1es following differential training appear to prefer response-dependent reinforcers with the larger ratio schedule. Taken together these only after asymmetrieal food training procedures. two studies suggest that prechoiee training effects are However, the issue of why animals respond at all transient if the difference in "effortfulness" be- when free reinforcers are available is left unresolved. tween the two alternatives is large (e.g., running a path of 122 or 244 cm; Larson & Tarte, 1976) Stimulus Change compared to the more enduring effects that resu1t Several recent experiments suggest that free food when the difference in effortfulness is small (e.g., responding is maintained by stimulus changes cor- barpressing on an FR 1 or FR 3 schedule or, per- related with response-dependent food presentation. haps, barpressing on an FR 1 schedu1e for pellets or Wallace et al. (1973) provided naive pigeons with a obtaining them freely). choiee between two sources of free food. They found Several explanations have been proposed for the that when entries into one food source were accom- effects of prechoice food training. Mitchell and his panied by a change in ambient stimulus conditions, associates suggested that the effect of food source pigeons consistently preferred the food source associ- familiarity on preference is due to a neophobic re- ated with stimulus change. In a second experiment,
FREE FOOD 227 two pigeons lived in experimental chambers where Two explanations have been proposed for the food was avilable by keypecking or from a cup of effects of stimulus change in free food experiments. free food. Whenever keypecks produced both food One suggests that stimulus change associated with and stimulus change associated with food presenta- response-dependent food serves as a conditioned tion, more than 100 continuously reinforced re- reinforcer; the other maintains that stimulus change sponses were made per session. However, respond- derives its reinforcing properties through sensory ing decreased to nearly 0 if stimulus change was reinforcement effects. presented contemporaneously with free food acquisi- tion, and responses produced food without con- Conditioned Reinforcement comitant stimulus change. Alferink et al. (1973) Alferink et al. (1973) and Tarte and Synder (1972) trained pigeons to respond for grain on a FR 300 suggested a conditioned reinforcement explanation schedule and then propped up the grain hopper so of the free food phenomenon. Conditioned rein- that grain was available without responding. Re- forcers are stimuli that acquire their reinforcing sponding was maintained when every 300 responses property through association with an unconditioned continued to produce the hopper light, but decreased reinforcer. Two sources of stimulus change associ- to a low level when responses had no scheduled ated with food presentation in free food experiments effect. Reinstating the hopper-light-on contingency are those afforded by the response operandum and returned responding to its previous rate. Responding the food delivery system, and any scheduled stimu- appeared to be controlled by the response-dependent lus changes. Through repeated pairing with food presentation of a stimulus (hopper light) previously presentation, stimulus change may acquire condi- associated with response-dependent food. tioned reinforcing properties. Responding for food Osborne and Shelby (1975) extended these findings in the presence of free food might be attributed to to a nonavian species. Following six alternating free the combined reinforcing effectiveness of response- and response-dependent food training sessions, rats produced food and conditioned reinforcement were provided a choice between barpressing for provided by stimulus change. Neither response- pellets on CRF or obtaining them freely from a cup produced food alone (Osborne & Shelby, 1975; of 500 pellets. When abrief auditory and visual Wallace et al., 1973) nor resonse-produced stimulus stimulus accompanied barpresses, these animals change alone (Davidson, 1971; Duncan & Hughes, responded for 28070 of their food; however, removal 1972; Enkema, Slavin, Spaeth, & Neuringer, 1972; of the stimulus as a consequence of barpressing de- Neuringer, 1969; Osborne & Shelby, 1975; Tarte, creased responding to nearly zero. Subsequent condi- 1974) are sufficient to maintain responding when free tions revealed that the amount of maintained re- food is available. Yet, when responses produce both sponding, and consequently preference for response- food and stimulus change, many responses are dependent food, varied as a function of stimulus emitted. The failure to maintain responding with change conditions attendant upon response-dependent stimulus change alone is consistent with the finding food presentation. For example, there was a signifi- that stimuli which serve as conditioned reinforcers cantly greater preference for response-dependent often lose their effectiveness when primary reinforce- food when barpresses were accompanied by both ment is withheld (Kelleher & Gollub, 1962). auditory and visual feedback than when barpresses were accompanied by auditory or visual feedback Sensor Reinforcement alone. Response-dependent sensory reinforcers unrelated A fortuitous example of the effects of stimulus to primary appetitive reinforcers are reinforcing change was provided by Mitchell et al. (1973). They (cf. Kish, 1966). Naive pigeons acquire and main- allowed one group of rats a choice between bar- tain a keypeck response when the sole consequence pressing for pellets or obtaining them freely from a of responding is a change in ambient stimulus condi- pile of 300 pellets in an adjacent food dish. To con- tions or a change in response key color (Appel, 1963; trol for the possible novel effect of bulk reinforce- Herrnstein & Loveland, 1972). Osborne and Shelby ment (300 pellets), a second group of rats received (1975) suggested that animals in free food experi- a food dish that contained only 20 free pellets; an ments respond for food plus stimulus change not additional pellet was dispensed by the experimenter because stimulus change is a conditioned reinforcer, each time a free pellet was removed. Although both but because the reinforcing value of stimulus change groups preferred free food, rats provided with free increases in the presence of other more potent food accompanied by the sound of the pellet dis- primary reinforcers. Sensory reinforcement effects penser consumed less free food than rats provided may interact with the primary reinforcement effects free food without the sound of the feeder. In short, of response-dependent food such that the total rein- equating stimulus change conditions for both free forcement obtained by responding is sufficient to and response-dependent food essentially eliminated maintain responding when free food is available. barpressing. In all the free food experiments reviewed here,
228 OSBORNE stimulus change was paired with food presentation. different environments. Coburn and Tarte (1976) This makes it difficult to distinguish between condi- reared groups of rats in either stimulus-impoverished tioned and sensory reinforcement effects, because or stimulus-enriched environments and then housed many of the variables that affect the reinforcing them in chambers where both free and response- effectiveness of conditioned reinforcers have a dependent food were avilable. Animals in the im- similar effect on the reinforcing potential of sensory poverished group readily acquired the barpress re- reinforcers. However, some free food data appear sponse and consistently responded for ab out 600;0 of more amenable to a sensory reinforcement analysis. their daily food intake; rats in the enriched group First, if conditioned reinforcers acquire their rein- responded at a low level throughout the experiment forcing properties through repeated association with and earned less than 1010 of their food by responding. primary reinforcers, then acquisition of a food- Morgan et al. (1975) assessed choice behavior of obtaining response in the presence of free food (Kopp impoverished and enriched rats on a concurrent et al. , 1976; Neuringer, 1969) is difficult to explain schedule of reinforcement. On one side of a by conditioned reinforcing effects of stimulus change. chamber, leverpresses produced a food pellet once On the other hand, sensory, or unconditioned every 30 sec; on the other side of the chamber, a reinforcing stimuli, need no prior exposure to affect pellet was delivered once every 30 sec, independently behavior and they presumably would be reinforcing of the animal's behavior. In this situation there was from the outset. Second, Tarte and Synder (1973) no difference in the choice behavior of impoverished required that 0, 25, or 50 continuously reinforced and enriched rats. When these same rats were sub- responses occur prior to placing free food in the sequently provided a choice between free food and chamber. During the ensuing choice period, respond- leverpressing for pellets on a schedule of continuous ing decreased as the number of reinforced prechoice reinforcement, the impoverished rats responded for responses increased. Although the change was not 42010 of their food whereas animals in the enriched statistically significant, it is noteworthy that this group responded for only 19010 of their food. In the requirement should have increased the conditioned first experiment, identical stimulus change ac- reinforcing strength of stimulus change because of companied both response-dependent and response- the extra number of stimulus-food pairings; yet independent pellet deliveries, but in the second preference for response-dependent food decreased. experiment, stimulus change only accompanied Alternatively, with such a procedure, sensory rein- response-dependent pellets. Therefore, differences forcement effects might have undergone partial satia- in preference behavior between enriched and im- tion. Finally, following a condition where no poverished animals were evident only when differ- stimulus feedback accompanied responding, Osborne ential stimulus change accompanied response- and Shelby (1975) exposed some rats to a condition produced pellets. where an auditory stimulus accompanied res pond- Davis, Beighley, Libretto, Mollenhour, and ing; for others a visual stimulus accompanied re- Prytula (1975) also found that rats reared in stimulus- sponding. Addition of the auditory stimulus in- deprived environments preferred response-dependent creased responding, although this stimulus previous- food, whereas rats reared in either normal or stimulus- ly had not been associated with food presentation. enriched environments preferred free food (but see Subsequently, when both auditory and visual feed- Tarte, Townsend, & Vernon, 1973). back accompanied response-produced food, re- Numerous behavioral differences exist between sponding increased above the level maintained by animals reared in impoverished and enriched en- either auditory or visual feedback alone. Egger and vironments (cf. Morgan, 1973, Morgan et al., 1975; Miller (1962, 1963) have shown that, when stimuli Wallace, 1974). For example, impoverished rats provide redundant information ab out impending show heightened levels of exploratory behavior and reinforcement, such stimuli do not become condi- are slower to habituate to environmental (sensory) tioned reinforcers even though they are temporally stimuli. It might be argued, then, that impoverished contiguous with reinforcement. Adding stimulus rats are more likely to respond for stimulus change feedback from a different stimulus modality, as or satiate more slowly than normals. This interpreta- Osborne and Shelby did, added no information to tion is not only consistent with the data of Coburn the situation, yet responding increased. This squares and Tarte (1976) and Morgan et al. (1975), but also with the fact that increasing the complexity, the is supported by a large body of literature which inconstancy, or the novelty of a sensory reinforcer in- shows that the ability of sensory reinforcers to main- creases its reinforcing potential (Barnes & Baron, tain responding is directly related to antedating 1961; Dember, 1956; Welker & King, 1962). stimulus deprivation conditions (cf. Fowler, 1971; Sensory reinforcement also may account for some Kish,1966). of the differences in acquisition and maintenance of Although a sensory reinforcement explanation free-food responding between animals reared in of stimulus change appears to account for much of
FREE FOOD 229 the data, it does not account for the effects of pre- to encompass existing data collected within the free choice training on preference. Here a conditioned food paradigm: The tendency to respond for rein- reinforcement explanation is more consistent; that is, forcers when physically equivalent free reinforcers the strength of a conditioned reinforcing stimulus are available (a) increases whenever reinforcer presumably increases with repeated stimulus- effectiveness increases, and/or when the reinforcing reinforcer pairings. effectiveness of the stimulus change associated with its presentation increases, and (b) decreases when- Controlling Variables ever the reinforcing effectiveness of the reinforcer or Prechoice food training appears to affect prefer- the stimulus change associated with its presentation ence between free and response-dependent food. decreases. The reinforcing effectiveness of primary However, differential prechoice training with reinforcers like food and water is determined by a response-dependent food is neither necessary nor host of variables that include the rate and magnitude sufficient for the acquisition or maintenance of of reinforcement, the physical quality of the rein- responding for reinforcers in the presence of forcer, and reinforcer deprivation. Several variables equivalent free reinforcers. Although food source that contribute to the reinforcing effectiveness of familiarity or discriminative stimulus hypotheses sensory reinforcers include the complexity, novelty, may aid our understanding of the effects of prechoice and inconstancy of the stimulus change as well as training on preference, neither adequately accounts antedating stimulus deprivation conditions. Al- for the wide range of data reviewed here. First, though the emphasis here is on the sensory rein- differential prechoice training with response- forcing properties of stimulus change, a comparable dependent food does not insure preference for argument can be made for the conditioned rein- response-dependent food (Atnip & Hothersall, 1973; forcing properties of stimulus change, i.e., the Hothersall et al. , 1973). Second, both rats and strength of a conditioned reinforcing stimulus pre- pigeons readily acquire an operant response for food sumably covaries with the reinforcing effectiveness or water reinforcers in the presence of equivalent of the primary reinforcer with which it is associated. free reinforcers (Kopp et al., 1976; McLaughlin et al., It remains possible that stimulus change serves both 1973; Neuringer, 1969), and continue to respond for a conditioned and a sensory reinforcement function. reinforcers after equalized training procedures (Osborne & Shelby, 1975; Tarte & Synder, 1973). Finally, prechoice training appears to have only a Interexperiment Variability minor effect on the rate of maintained responding Considerable interexperiment variability has in avian species (Bilbrey et al., 1973; see also Duncan occurred in studies of preference between free and & Hughes, 1972). Alternatively, responding fails to response-dependent food. Much confusion and develop unless differential stimulus change difficulty in integrating free food data into existing accompanies response-dependent food presentation theory has resulted. Methodological differences are (Mitchell et al. , 1973); well-trained responses are an obvious source of variance, although the problem extinguished readily if the stimulus change that is complicated further by the use of different de- normally accompanies response-dependent food pendent measures. Data have been reported as presentation is withheld (Alterink et al. , 1973; number of responses (Neuringer , 1969), propor- Wallace et al., 1973); and preference for response- tional amounts of free and response-dependent food dependent food appears to vary systematically as a consumed (Carder & Berkowitz, 1970), and latency function of the stimulus change conditions that to eat free or response-dependent food (Jensen et al., accompany responses. Despite the present difficulty 1970; Koffer & Coulson, 1971). Nonetheless, inter- in ascribing a conditioned or sensory reinforcement experimentvariability in preference data appear to be function to stimulus change, it seems clear that attributable largely to differences in prechoice food responding for reinforcers when physically equiv- training procedures and to stimulus change condi- alent reinforcers are freely available is attributable to tions associated with response-dependent food, al- the differential reinforcement afforded by response- though session duration differences and the brevity dependent food and attendant stimulus change. of choice testing may also contribute to variability. However, this appears to be an interactive effect; Differences in stimulus change conditions between that is, the amount of responding maintained by experiments may be no greater than those resulting food-plus-stimulus change cannot be predicted by a from apparatus differences. Such differences may simple additive combination of the amount of nonetheless affect choice behavior and thereby pro- responding maintained by each (Os borne & Shelby, vide a possible explanation for differences in ob- 1975; cf. Herrnstein & Loveland, 1972). tained preferences between experiments that used In considering the interactive effect of food and seemingly identical procedures (cf'. Atnip & stimulus change, several general staternents appear Hothersall, 1973; Carder & Berkowitz, 1970).
230 OSBORNE RELATEDRESEARCH are indifferent between response-independent and response-dependent schedules of reinforcement when Response-Independent and Response- the rate of reinforcement is constant for each; devia- Dependent Reinforcement tions from indifference appear to be attributable to Studies by Morgan (1974b) and Singh (1970) asymmetrical stimulus conditions. That animals typically have been included in evaluations of the free show a near-exclusive preference for free over food phenomenon. Because "free food" was not response-dependent food when stimulus change continuously available in these studies, they differ conditions are equalized may result from differ- from the procedural arrangement used in other free ences in rate of reinforcement. food studies. Singh (1970) allowed rats a choice be- tween barpressing for pellets on one side of a Signaled Reinforcement chamber or receiving free pellets on the other side. Animals and humans permitted to choose between "Free pellets" were dispensed only if the animal a primary reinforcer preceded by a signal and the remained in the free food compartment for a time same reinforcer unsignaled prefer the signaled rein- inter val equated with the average obtained inter- forcer (Cantor, 1971; Cantor & LoLordo, 1970, reinforcement interval of concurrently available 1972; Furedy & Klajner, 1972; Lewis, Lewin, response-dependent food. When barpresses were Muehliesen, & Stoyak, 1974; but see Hershiser & reinforced on fixed-ratio schedules (FR 1, FR 3, and Trapold, 1971). Cantor and LoLordo (1970) ad- FR 11), rats responded for about 70070 of their food, ministered 0.5-sec trains of electrical stimulation of and they continued to prefer response-dependent the brain (ESB) to rats according to a variable-time food even when free food was available at a faster I-min schedule. The rats were placed in a shuttlebox rate. and provided a choice between receiving ESB pre- The finding of a consistent preference for response- ceded by a signal on one side of the chamber and dependent food appears inconsistent with the stimu- the simultaneous presentation of ESB and signal lus change hypothesis outlined above inasmuch as on the other. They found that rats spent approxi- stimulus change presumably was equalized for both mately 70070 of their time in that part of the chamber food sources. However, free pellets could arrive where ESB was preceded by a signal. Lewis, Lewin, when the rat was anywhere in the free food compart- Muehliesen, and Stoyak (1974) reinforced pigeons' ment, whereas on the response-dependent side the keypecks with grain on a VI l-min schedule. A peck rat had to be in contact with the response lever and to a second, concurrently available changeover key in close physical proximity to the food cup at time of produced a VI l-min schedule for 1 min, during pellet delivery. Consequently, the occurrence of which reinforcement was preceded by a 5-sec tone stimulus change and food acquisition could be separ- stimulus. Under this procedure, pigeons pecked the ated temporally on the reponse-independent side but changeover key at a rate sufficient to maintain the presumably would occur in close temporal proximity signaled schedule in effect 90070 of the time. In on the response-dependent side. Within a similar studies of signaled reinforcement, a stimulus pre- paradigmatic arrangement, Morgan (1974b) found cedes and is invariantly correlated with reinforce- that food consumption latencies for response- ment. In free food experiments, a response is independent food were much higher than for followed by a brief stimulus and food. Despite pro- response-dependent food. Morgau's data suggest cedural differences between the two paradigms, the that the temporal relationship between stimulus stimulus effects on behavior are similar. Differ- change and food acquisition may be an important entially adding a stimulus that precedes reinforce- variable. When Morgan equated stimulus change ment in a choice situation increases preference for conditions attendant upon food acquisition, he that alternative. found that rats responded for only 30070-40070 of their food. The obtained rate of reinforcement Autosbaping and Automaintenance (number of pellets/time) was higher on the response- Experimentally naive animals "autoshape" to dependent side and yielded a close approximation stimuli that signal response-independent reinforce- to the "matching law" (Herrnstein, 1970). When an ment (e.g., Brown & Jenkins, 1968; Peterson, Ackil, added stimulus signaled that a barpress would be Frommer, & Hearst, 1972) and still respond when reinforced, rate of reinforcement and preference responses prevent the delivery of reinforcement for that side increased. When rate of reinforcement (Stiers & Silberberg, 1974; Williams & Williams, and stimulus conditions were equalized for both 1969). Autoshaping and free food experiments differ food sources, an equivalent amount of food was in terms of alternative sources of reinforcement, but obtained from each. In short, the Singh and Morgan are similar because responding in both cases is data are consistent with other research (Killeen, 1968; initiated and maintained in a situation where rein- Moore & Fantino, 1975) which indicates that animals forcers are available independently of behavior.
FREE FOOD 231 In arecent monograph, Hearst and Jenkins (1974) reinforcement may explain preference for signaled suggested that animals orient towards, approach, over unsignaled schedules of reinforcement, it has and make contact with stimuli that signal impending difficulty explaining some free food data and appears reinforcement and avoid or withdraw from stimuli to be an unsuitable explanation for autoshaping and that signal reinforcer unavailability, a behavioral automaintenance data (cf. Hearst & Jenkins, 1974; process they call "sign-tracking." Autoshaping and but see Hursh, Navarick, & Fantino, 1974; Zentall related phenomena are attributed to stirnulus- & Hogan, 1975). Presently, a single unifying account reinforcer effects; i.e., stimuli that precede reinforce- that encompasses the range of stimulus-reinforcer ment elicit approach and contact behavior to those effects reviewed here is lacking, but future research stimuli. Unconditioned stimulus events also appear on sign-tracking behavior may fill this void. A sign- to elicit exploratory reactions of approach, investiga- tracking analysis of the free food phenomenon is not tion, and contact behavior (cf. Fowler, 1971). Such currently possible because of the paucity of data on "sensory reinforcers" appear to have an effect on the effects of systematically varying stimulus condi- the initiation and maintenance of responding when tions correlated with response-dependent food. This free food is available, and they may contribute to the deficit itself suggests a productive avenue for future initiation and maintenance of autoshaped responses. free food research. Data forthcoming from such Herrnstein and Loveland (1972) found that naive research not only may allow closure on the free food pigeons autoshaped to a stimulus change reinforcer. phenomenon, but may provide data relevant for They concluded that "this kind of reinforcement other areas of research in animallearning as well. (stimulus change) would by itself, make possible the phenomenon of autoshaping." However, the SUMMARY phenomenon is greatly enhanced by correlating food with stimulus change. Similarly, Blanchard and A plethora of theoretical accounts have been Honig (1976) and Downing and Neuringer (1976) adduced to explain the free food phenomenon. Sorne found that naive pigeons pecked an illuminated of these include: an intrinsic appeal of performing response key the first time it was presented and be- the operant (Jensen, 1963), responses serving as their fore pairing with grain if sufficient hopper training own motivation and reinforcement (Neuringer, 1969; was provided. Kopp & Bourland, Note 1), discriminative stimulus Not all stimuli have an equiva1ent effect on be- properties of reinforcers (Carlson & Riccio, 1976), havior, either prior to (Kish, 1966) or after the con- environmental competence (Singh, 1970; Stephens, ditioning process (Revusky & Garcia, 1970; Seligman, Metze, & Craig, 1975), arousal properties of rein- 1970). Stimulus events therefore may derive their forcers (Carder, 1972), deprivation (Tarte & Synder, reinforcing effectiveness collectively from both con- 1972), habit strength or behavioral persistence ditioned and unconditioned properties (cf. Osborne (Morgan, 1974a; Tarte & Synder, 1973), neophobia & Shelby, 1975). (Mitchell et al., 1973), Premack's principle (Sawisch & Denny, 1973), and discrimination (Taylor, 1975). Self-Reinforcement While all of these theories may account for part of Mahoney and Bandura (1972) found that pigeons the data, none appears to encompass the broad range trained to peck a briefly illuminated response key of experimental findings reviewed here. before eating continued to respond when the response- It has perhaps been the mistaken belief that free dependency for food was removed. They interpreted and response-dependent food are equally attractive their data as an anima1 analogue of human self- (reinforcing) that has engendered both surprise and reinforcement (Mahoney, 1972). However, respond- interest in the free food phenomenon. Results of ing in Mahoney and Bandura's experiment was main- these studies have tempted some (Singh, 1972b) to tained only when responses were accompanied by draw analogies between the propensity of animals a brief stimulus change. Consequently, responding to work for reinforcers in the presence of free rein- in this case appears similar to responding main- forcers and the human welfare system. Others may tained in free food experiments; i.e., animals appear have construed such results as an indictment of to prefer food-plus-stimulus change over food alone. current learning theories, As shown by the data re- The data reviewed here show that stimulus- viewed above, however, the free food anomaly may reinforcer relationships have a strong effect on be- be more in perception than in reality; it is the havior; however, the noteworthy point is the magni- combined reinforcement provided by response- tude of their effect on behavior. The other con- dependent reinforcers and attendant stimulus change spicuous element of these studies is the lack of a that maintains responding for reinforcers in the common explanation. Conditioned reinforcement presence of equivalent free reinforcers. This stimulus- most often is adduced to account for stimu1us- reinforcer interaction serves as both a necessary and reinforcer effects on behavior. Although conditioned a sufficient condition for the emergence and main-
You can also read