Chapter 5: Learning

Operant Conditioning: Learning based on consequences

*
4 main terms:
   Reinforcement: increases the probability that the behavior will happen again
   Punishment: decreases the probability that the behavior will happen again
   Positive: application of a stimulus
   Negative: removal of a stimulus

       Positive Reinforcement: Providing something pleasant
       Negative Reinforcement: Removing something unpleasant
       Positive Punishment: Providing something unpleasant
       Negative Punishment: Removing something pleasant

* Shaping: speeding up the learning process by rewarding approximations of the desired behavior

* Which is more effective: punishment or reinforcement?
   -> punishment does not provide an alternative, more appropriate behavior

* Schedules of reinforcement:
    Interval schedules: based on time
    Ratio schedules: based on number of responses

     
Fixed interval schedule: the reward is given for the first desired behavior after a set amount of time. The rate of responses levels off after the reward, but increases near the end of the interval.


     
Variable interval schedule: the reward is given for the first desired behavior after a random amount of time (within limits). The rate of responses is regular but slow.


     
Fixed ratio schedule: the reward is given after a set number of responses. The rate of responses is fast, but there is a pause after each reward.


     
Variable ratio schedule: the reward is given after a random number of responses (within limits). The rate of responses is regular and fast.