Neuroscience

Information, certainty, and learning

Justin A Harris author has email address
Charles R Gallistel

The University of Sydney, Sydney, Australia
Rutgers University, New Brunswick, United States

https://doi.org/10.7554/eLife.102155.3

Open access
Copyright information

Figures and data

Median reinforcements to acquisition plotted against the C/T ratio
Asterisks show the data from Gibbon and Balsam’s (1981) metanalysis of acquisition in pigeons. The two open circles are from Jenkins, Barnes and Barrera (1981) who used even bigger C/T ratios in some groups. The log of the informativeness is the mutual information, I, between the CS and the expected wait for reinforcement (λ_R; lower x axis). The learning rate (right axis, reversed) is the reciprocal of reinforcements to acquisition (thus the axis increases towards zero). The regression model was fit only to the asterisks, but it also predicts the Jenkins et al. data. The value k is the x intercept, the informativeness that produces one-trial learning. The regression model curves upward to infinity as informativeness goes to 1 and mutual information to 0 because, when the CS rate does not differ from the contextual rate of reinforcement, differential responding to the CS does not emerge (Rescorla, 1967, 1968).

Summary of groups.

Number of trials for acquisition of responding to the CS, plotted against CS informativeness (C/T ratio).
A, B and C plot the number of trials to reach three different acquisition criteria for CS response rates exceeding the contextual rate. Each plot shows the median number of trials to criterion for each group (filled coloured circles) and the number of trials for each rat (unfilled coloured diamonds) plotted against informativeness of the CS. These data are superimposed onto the data (black asterisks) from Gibbon & Balsam (1981) and the black regression line shown in Figure 1. The dashed coloured lines show the regression model, log[RtoAcq] = log[k·(C/T-1)^s], fitted to the group medians. The slope, s, and R² of each regression line is: in A, s = -0.68 [95% confidence intervals = -0.94, -0.42], R² = 0.73; in B, s = -0.66 [-0.86, -0.46], R² = 0.81; in C, s = -0.6 [-0.76, -0.45], R² = 0.86. The seven plots in D show the cumulative fraction of rats that had acquired responding to the CS after n reinforced trials (x-axis) according to the same 3 acquisition criteria in A to C. In each plot, the data from two groups with similar informativeness (i, iota) are combined to increase the sample size.

Terminal response rates (final 5 sessions) of individual rats as a function of the reinforcement rate on double-logarithmic coordinates.
Red circles show response rates during the CS (number of pokes after the first poke divided by the remaining CS time when the rat’s head was out of the magazine) plotted against the CS reinforcement rate (1/T). Black circles show response rates during the inter-trial interval plotted against the baseline reinforcement rate (1/C). Filled circles show data of rats given 420 reinforced trials (Groups with C/T ratios ≤ 72) and open circles show data of rats given 126 reinforced trials (Groups with C/T ratios ≥ 110). The Model in A gives the equation for the solid black regression line (R² = 0.81). The slope of this line is virtually indistinguishable from 1 (R² = 0.80, when slope fixed at 1). The dashed red and grey lines show regression lines, with slopes fixed at 1, fitted separately to the CS response rates and ITI responses rates. In B, the blue regression line has slope fixed at 1, and the ITI response rates (black circles) are plotted against the overall reinforcement rate (1/C) divided by 1.9.

Number of trials to each decile of peak response rate.
The coloured line plots show the mean number of trials to each decile for each rat in each of the 14 groups (identified by the C/T ratio). The thick black line in each plot is the average for that group. The slopes of those black lines for each group are plotted against each group’s T in the rightmost plot of the third row. The final plot in the bottom right corner of the figure plots the Bayesian Information Criterion (BIC) for an exponential function against the BIC for a straight line when each was fitted to the trials-to-criterion data of individual rats (each open black circle shows the BICs for one rat). For each point in the top left of the plot (above the solid orange line), BIC_line < BIC_exponential, meaning that the data were better accounted for by the line than the exponential function (and vice versa for points in the bottom right half of the plot). The two dashed orange lines mark where the difference in BICs equals 4.6 which equates to odds of 10:1 in favour of the function with the lower BIC.

Modelling response probability across trials as a function of informativeness.
A: The cumulative probability of a response, p(R), calculated using Equation 6, as a function of number of reinforced trials. Each curve plots p(R) for one of the 14 levels of informativeness used in the current experiment, ι = 1.5 to 300. B: The black line plots the number of trials taken for p(R) to reach 0.5 as a function of the same 14 levels of informativeness. The plot includes the median trials to acquisition (t2acq) for the 14 groups in the current experiment, replotted from Figure 2A. D to I: The cumulative fraction of rats that had acquired responding (orange lines) for pairs of groups with similar informativeness, replotted from Figure 2D. The black and grey curves plot the cumulative p(R) using the same two values of ι (the black line is based on the higher of the two values).

nD_KL and parsed response rates for two rats.
Panels A to D show data from Rat 3; panels E to H show the corresponding data from Rat 176. For each rat, the top two panels plot the cumulative rate of poking during the CS, λ_r|CS (solid black line), during pre-CS ITIs, λ_r|ITI (dashed black line), and the contextual rate, λ_r|C (dotted black line), as functions of the number of trials (1 reinforcement per trial). In each plot, the red curve is the signed nD_KL plotted against the right axis. The black vertical line marks when the cumulative CS response rate permanently exceeded the cumulative context rate. The two red dashed vertical lines to the right of the black line mark when the nD_KL reached 0.82 (Odds 4:1 that CS rate > Context rate) and 1.92 (p <.05 that CS rate = Context rate), and the unbroken red line marks the minimum of the nD_KL. In A and E, response rates were calculated conventionally, as number of responses divided by total time. In B and F, response rates were calculated as the number of responses excluding the first response in each CS divided by the remaining time (after the 1^st response) out of the magazine. The bottom two panels for each rat show the parsed estimates of λ_r|CS and λ_r|ITI. In C and G, the x-axis has been right-cropped to better reveal early changes. In all plots, the vertical red lines mark estimates of acquisition based on the nD_KL as the Earliest estimate (leftmost), and when the odds against the null hypothesis that the parsed CS and ITI rates were equal reached 4:1, 10:1, 20:1, and 100:1.

Sign up for email alerts