| Chapter 5: Learning Operant Conditioning: Learning based on consequences * 4 main terms: Reinforcement: increases the probability that the behavior will happen again Punishment: decreases the probability that the behavior will happen again Positive: application of a stimulus Negative: removal of a stimulus Positive Reinforcement: Providing something pleasant Negative Reinforcement: Removing something unpleasant Positive Punishment: Providing something unpleasant Negative Punishment: Removing something pleasant * Shaping: speeding up the learning process by rewarding approximations of the desired behavior * Which is more effective: punishment or reinforcement? -> punishment does not provide an alternative, more appropriate behavior * Schedules of reinforcement: Interval schedules: based on time Ratio schedules: based on number of responses Fixed interval schedule: the reward is given for the first desired behavior after a set amount of time. The rate of responses levels off after the reward, but increases near the end of the interval. Variable interval schedule: the reward is given for the first desired behavior after a random amount of time (within limits). The rate of responses is regular but slow. Fixed ratio schedule: the reward is given after a set number of responses. The rate of responses is fast, but there is a pause after each reward. Variable ratio schedule: the reward is given after a random number of responses (within limits). The rate of responses is regular and fast. |