New Page 1

After the test we began on learning. We will have a fifth tes, in order to give folks another chance to bring their test average up. (I drop the lowest grade of the test scores.) It will be a fun format, with a guarantee of passing just for taking the test. It will be the last day of class. Also, I will be planning a review session before the final for people who want to participate.)

1. Learning: Learning is defined as a relatively enduring change in behavior as a result of experience. Learning allows us to respond flexibly to an ever-changing environment.

Learned vs innate behaviors: Humans have very few innate (inborn) behaviors; most of what they need to know in order to survive must be learned. (In contrast, for instance, with baby ducks, who follow the first moving thing they see after hatching (usually their mother) know how to swim, how to eat the same foods their mother eats, and crouch and hold still when the outline of a hawk passes overhead even though they have never had the opportunity to learn about these things.) An infant knows how to suck on a nipple and automatically turns toward something that touches his/her cheek, but is primed to learn very rapidly by making associations, connections between things that happen concurrently. For example, the infant rapidly learns that the smell, sound and sight of the primary caretaker (most often the mother) are associated with food and comfort: a hungry infant cries until fed, but after a short time, reacts to the smell or sound of the mother by quieting even when she/he has not yet been fed.

Research on infant rats ('pups') shows the power of these earliest learning experiences. Rat pups were fed milk from a lemon-scented nipple. When given an empty (surrogate) nipple to suck , they would suckle 80% of a ten minute period, while pups in a control group who had only been fed from a normal smelling nipple suckled only 20% of the time. While most learning is dependent on repeated exposure to paired stimuli, the pups learned this new set of associations (lemon scent = milk) with only one exposure to the two combined stimuli. And this period of almost instantaneous learning is unique to the newborn pups: older rat pups didn't learn as quickly and had to have to stimuli (milk and lemon scent) presented closer in time than the new pups. The scientists hypothesize that this powerful learning mechanism is the result of the fact that milk, so critical to the pup's survival and the mother's odor, under natural circumstances, is such a significant signal for milk, that rat pups are 'hard wired' to learn this association as soon as possible. (Human infants also very quickly learn their primary caretaker's scent as well as the smell of the milk. The findings of this type of study have practical applications in helping infants transition to other caretakers w or formulas when substitutes must be found.)

2. Classical conditioning: the simplest type of learning in which the subject comes to make associations between stimuli or antecedent conditions. This is a passive form of learning based on developing mental expectations based on past events occurring at the same time or in the same sequence (as in the rat pups making the connection between 'milk' and 'lemon scent'.

3. Operant Conditioning: Learning from the results of what we do. Behavior that occurs in order to make something happen is called Operant or Instrumental behavior. The early behaviorists (remember Skinner?) believed that you could teach a person to do or become anything you wanted if you had total control over the conditions of his/her life. By rewarding some behaviors and punishing others, you could completely control the individual's behaviors. Thorndike called this the "Law of Effect"

The easy way to remember these behavioral principles of learning are to think of the ABCs:

How operant conditioning works. The results of your behavior have consequences (Where have you heard this before?) "If you study heard, you will get good grades." Well, no, not if you are not a student....There may be other reasons to study but it won't get you good grades.

But under specific circumstances (A, the antecedents),
                                        certain Behaviors result in particular
                                                   Consequences.
If you are a student, then whether you study or not has consequences, whether the positive reinforcement of praise or good grades (and, when I was a kid, a monetary reward for a good report card), or punishment ("Since your report card is so bad, you can't go out on week nights anymore.")      Reinforcement increases the chance the behavior will occur, punishment decreases the likelihood that you will repeat the behavior.

There are actually four kinds of consequences: positive reinforcement, negative reinforcement, positive punishment, and negative punishment. ('positive' and 'negative' does not refer to whether something is good or bad but whether something is given to or done to the person or taken away from the person.
When I come home from work (A) my dog barks (B) to be let out (C) I am reinforcing her behavior of barking by giving her what she wants. (Positive reinforcement for her)
When my dog barks when I first come home from work (A), I let her out (B) in order to stop her noise (C). (Negative reinforcement for me)
When my daughter used to come home from school (A), the dog would bark (B), and my daughter would yell at her the shut up (C). (Positive punishment)
When I came home from work and the dog had peed on the floor (A), And she had just yelled at the dog and had not put her out (B) I would both make my daughter clean it up (C) (positive punishment) but also not let her borrow my car (C). (negative punishment, loss of a privilege)

For information on how operant conditioning is used in training dolphins at Seaworld click here:
Operant conditioning at work

During the acquisition phase of operant conditioning, the learner rarely can accomplish a desired behavior by learning all the steps at once: the behavior must be 'shaped'. Shaping the behavior and chaining all the needed steps in the right order are accomplished by reinforcing successively complex attempts at achieving the goal. The less than perfect steps are called 'approximations'. (For instance, if you're teaching a child to tie his shoe, you can't just show him once and expect him to do it right. You first reinforce (praise) just his attempts to cross the laces. Once the child has that down, you prompt him to pass one shoe string under the other, then to make a loop, etc, etc...And maybe at first, the shoes aren't tied tightly enough to stay tied for long, but this is still another successive approximation in the whole process.)

Again, there are four aspects of the acquisition phase that affect how quickly and thoroughly conditioningoccurs:

And, as in classical conditioning, there are exceptions to these rules of acquisition, in that very powerful reinforcers or punishments can cause conditioning to happen rapidly or even with one incident. (You will never again stick your finger in a live electrical outlet!)

Once the acquisition phase is over and the behavior thoroughly learned, it is not necessary, usually, to reinforce every correct behavior. How often and when reinforcement occurs, however, does affect the frequency of the behavior as well as how long the behavior is retained after no more reinforcement is occurring (extinction):

Type of Reinforcement schedules:	Frequency (When? How often?)	How powerful in motivating frequecy of behavior?	How durable (how long before extinction occurs when reinforcement ceases?)
Continuous reinforcement	Reinforcement occurs every time behavior performed (Ex.: my neighbor started giving her child a penny every time she pulled a weed.,)	Works until satiation or exhaustion sets in; learner can always take a break and pick up where he/she left off.	Not durable: learner has expectation that every behavior will be rewarded. Once it stops, he/she stops soon after.
Fixed ratios	Reinforcement takes place after a complete (set) number of times the behavior is performed (example: My neighbor was running out of pennies, so she paid her child a nickel every time she pulled five weeds from the lawn	Very powerful. (Like piece rates: if you work faster, you earn more )	Not very durable, but a bit more so than continuous reinforcement schedules, as it takes the learner longer to figure out . Again, the expectation is that the reinforcement will occur regularly every set number of times the behavior is enacted.
Variable ratios	Reinforcement given on a set ratio (# of times the behavior is performed), but it's never clear exactly which of the behaviors will get the reinforcement. (Example: Now, my neighbor goes out every so often, counts up the weeds and divides by five, and give her child a nickel for each set of five.)	The person will still earn more by working faster the basic return for effort stays the same, but the issue of uncertainty usually results, especially in young children or animals, in a slower rate of efforts made.	This schedule , because of the uncertainty factor, is more durable: the person who is not sure when he/she is going to be reinforced, is also not sure when reinforcement stops and will keep at it a longer. Another example: slot machines are set to return a ratio of their earnings to the players, but you never know whichi pull of the lever will pay off.)
Fixed intervals	Reinforcement is given for the first correct response after a set time interval, regardless of how many behaviors he/she does in that time period. (Ex: Now that my neighbor's daughter is older and a pretty good worker, she gets paid $5 for each hour spent weeding the yard and garden. Counting all those weeds was a drag!)	The amount of work done is less than in fixed or variable ratio schedules, and it varies over the time; there is no payoff, in terms of reinforcement, for getting much done at the start of the time period. (Example: if an assignment is due every two weeks, many people don't do as much work on it the first week; they wait until it's almost due!)	Extinguishes more quickly than variable reinforcement. Predictability again plays a role in how quickly a behavior is extinguished with fixed intervals. (You are more likely to quit and look for another job when your expected paycheck doesn't come a couple of times!)
Variable intervals	Reinforcement is given for the first correct response after a variable period of time. (If my neighber sees that her daughter is slacking off, she can check up on her periodically and if she is not weeding at the time that her mother looks, she will not get paid for that hour at all!)	Because you never know when you are going to be reinforced, you would tend to work at a steady rate: you want to be 'caught' doing the right thing, but you don't get extra credit for the work you did that wasn't noticed!	This is schedule of reinforcement is the most durable of all, if you can't predict when the payoff will come, neither can you predict when it won't.(As ads for the state lottery say, "You can't win if you don't play!" and the possibility of a playoff, remote as it is, keeps people shelling out their money for that ticket week after week.)

Many of the concepts explained under Classical Conditioning are also true for Operant Conditioning:

Not all learning is 'Conditioning'. Internal 'thinking' processes bring about some kinds of learning. Understanding, anticipating, figuring things out, are all cognitive processes in which the reinforcement for these cognitive activities can be just the knowledge itself. The knowledge may be immediately useful, but curiosity is a powerful drive for many mammals, and especially for humans,in and of itself. It's as if many organisms, whose other needs and drives are satisfied, have the urge to explore the world just in case the information should be needed in the future.

For example, consider the concept of cognitive maps. These are internal detailed layouts of the experience of the world. If your usual route to work is blocked, you know from driving around town doing other things, basically how the town is laid out and so, even though you may not have taken that particular route before, you can follow it to your workplace. Even rats and bees have internal or cognitive maps.( If you capture a bee, put it in a dark box and carry it to someplace new, it can still find its hive and the flowers it has been using as a food source. A rat, placed in a maze and left to wander around, can later learn to find its way to a food source faster than a rat which has never been in the maze before. This is due to the cognitive map of the maze it has constructed in its wanderings.)