Lexicon of operant conditioning terms

A brief lexicon of behavioral terms used under this training method:

TERM	DEFINITION
Reinforcement	Anything that strengthens a behavior
Punishment	Anything that suppresses a behavior
Positive	Something added to influence a behavior
Negative	Something taken away to influence a behavior

Application of these terms:

TRAINING APPLICATION	WHAT HAPPENS	To do this:
Positive Reinforcement R+	Something is added that strengthens a behavior	the something is good or desired by the subject
Negative Reinforcement R-	Something is taken away that strengthens a behavior	the something is bad or aversive to the subject
Positive Punishment P+	Something is added that suppresses a behavior	the something is bad or aversive to the subject
Negative Punishment P-	Something is taken away that suppresses a behavior	the something is good or desired by the subject
Extinction E	A behavior goes away because it is not reinforced	nothing happens to the subject, good or bad

Summary of dog training realms:

TERM	DEFINITION
"Traditional" training	Works primarily in the R-, P+ realm
Positive traditional training*	Mixes rewards and lures with R-, P+
So-called "clicker" training	Works primarily in the R+, P-, E realm
Crossover trainer	PTT or a traditional trainer who switched to clicker training

Examples -- applying operant training:

	Training the sit	Training the recall
Positive reinforcement R+	Dog sits (either on cue or in free shaping with no cue) Trainer then gives the dog a treat	Dog comes (cue or no cue) Trainer then gives the dog a treat
R+ with event marker	The dog sits (cue or no cue). The instant the dog's fanny hits the ground, the trainer clicks. Trainer then gives the dog a treat.	Trainer cues the dog to come. The instant the dog turns toward the trainer, the trainer clicks.** When the dog comes in, it gets a treat.
Luring (non-operant)	The trainer pulls out a treat and holds it high above the dog's head. In trying to reach the treat, the dog rocks back and sits. The dog then gets to eat the treat	The dog is running away or chasing a cat. The trainer gets out a treat and yells "Lassie, Cookie!!" Dog comes in and gets the treat (you hope).
Negative punishment P-	Trainer cues the dog to sit, but the dog stays standing. Trainer leaves the room with treats in hand. Dog loses the opportunity to be rewarded.	Trainer cues the dog to come. Dog still chases cat. Trainer catches dog and removes the dog from the presence of the cat and removes its ability to run and play.
Negative reinforcement* R-/P+	Trainer commands the dog to sit. If the dog does not quickly comply, the trainer jerks sharply and upward on the leash. The choke or pinch collar tightens and the dog feels uncomfortable. The force of the jerk upward makes sitting an easy choice for the dog. The pressure is then relieved.	Dog is running away or chasing a cat. Trainer commands the dog to come. If the dog does not comply the trainer closes the button on the flexilead or steps on the long line. The dog experiences a sharp jerk and turns toward the handler, relieving the pressure from the collar and leash. The handler then praises the dog as it comes in.**
Physical "modelling" (non-operant)	Trainer pushes dog's behind downward. Under this force, the dog sits. Trainer rewards or praises dog.	Dog is out running after a cat. Trainer reels the dog in with the long line, saying "come" as she does so, then rewards or praises.

*In practice there is no distinction between negative reinforcement and positive punishment, which is not the case for R+ and P-. In another example, the dog is pulling on the leash. As the dog pulls, the trainer pops the leash sharply and repeatedly for as long as the dog is pulling, causing the dog discomfort while it is pulling. As soon as the dog stops pulling and thus starts to walk on a loose leash, the trainer stops the jerking. Positive punishment happened when the discomfort of the jerking and tightening the collar suppressed the pulling. Negative reinforcement happened when taking away the jerking strengthened the behavior of walking on a loose leash.

** In each of these examples, there is a secondary or conditioned reinforcer. In the R+ example, the clicker (or other event marker) is a conditioned reinforcer meaning that food or play is coming. In the R- example, the praise is a conditioned reinforcer meaning that no discomfort will be applied while the praise is occurring.

Other notes:

1. The word "positive" is unfortunately used in two completely distinct senses in animal training. The first use comes directly from plain English usage, that is, "positive" means something good or desirable. The second use is a formal, strict definition in operant behavior analysis and means something "added to". If that something added is meant to suppress a behavior, as in positive punishment, then it is anything but good or desirable

2. "Positive" dog training can therefore mean a variety of things. Just because someone says they are "positive" does not mean that aversives are not also used. In fact, "positive" traditional training is a particular mix of the use of "rewards" or things that the dog desires as lures or mood changers (and not as operant conditioned reinforcers) and corrections. Traditional dog training does use operant conditioning, but specifically that using aversives or corrections, such as leash pops, verbal reprimand, intimidation, or electronic stimuli. In other words, traditional dog training is a mixture of operant and non-operant methods, and of positive and aversive elements.

If food or play is used in a non-operant fashion, i.e., as a lure, it is extremely difficult to fade out. For this reason, the mix of reward and punishment is ensured, because the operant conditioning methods of R-/P+ are required to strengthen desired behaviors.

However, the price that this blend of methods exacts is high. The dog may perform well and reliably, but the dog's attitude is often not the same as a dog trained with R+/P-. In fact, many breeds of dogs and many individuals in any breed maybe passed off as untrainable with R-/P+ methods, written off as "stubborn", "stupid", "willful", "has own agenda", "soft", and so on.

Karen Pryor has written eloquently on this topic. Her article, Poisoning the Cue [Teaching Dogs, 2002, vol.1(1)], is the only place I know of where the concepts of mixing positive and negative "discriminative stimuli" are explained.