- Location
- Andorra and Spain
I've been thinking about this a bit recently, and wanted to bring it up here for discussion.
The matching law states that if there is a choice between two behaviours, the relative proportion of responses matches the relative proportion of reinforcers that are earned by those behaviours.
In practical terms, that means that, say there are two pots in front of the dog and the dog gets rewarded for touching them. If the dog earns the same piece of cheese for touching either pot, then the dog will touch each pot 50% of the time (everything else being equal).
If the dog gets reinforced for hitting the pot on the left only half the time that he gets reinforced for hitting the pot on the right, then he will be half as likely to touch the pot on the left. But he will still touch it.
In that example, I'm only talking about quantity of the rewards, but the same can be said for desirability of the reinforcer: if behaviour X is reinforced with a reinforcer that is twice as desirable than that used for behaviour Y, then behaviour X will occur twice as often as behaviour Y.
Let's consider how this affects something like walking at heel. The environmental rewards for pulling are obvious and large: getting to move forwards, sniffing a lamppost, whatever it is. Compare that reinforcer to the piece of cheese in your treat pouch. What is the ratio of desirability of the two reinforcers? I think it's pretty obvious that the environmental reinforcers trump our cheese, and by a large amount. So, what do we have to do to even up the balance? Easy! Reinforce more with our pitiful cheese.
It basically boils down to some simple maths:
(Value of reinforcer A x Frequency of reinforcement A) : (Value of reinforcer B x Frequency of reinforcement
That is the ratio with which Fido will choose behaviour A over behaviour B. So even though the value of reinforcer B is super high, if we restrict the frequency of the dog being able to get that reinforcement, then that side of the ratio tends to zero. Meanwhile, we work with the highest reinforcer we can on "our" side of the ratio, and make the rate of reinforcement as high as possible, then we are massively tipping the balance of the ratio in our favour.
Dogs do what pays best. If your dog is disengaging, then you have to find a way to up your rate of reinforcement.
The matching law states that if there is a choice between two behaviours, the relative proportion of responses matches the relative proportion of reinforcers that are earned by those behaviours.
In practical terms, that means that, say there are two pots in front of the dog and the dog gets rewarded for touching them. If the dog earns the same piece of cheese for touching either pot, then the dog will touch each pot 50% of the time (everything else being equal).
If the dog gets reinforced for hitting the pot on the left only half the time that he gets reinforced for hitting the pot on the right, then he will be half as likely to touch the pot on the left. But he will still touch it.
In that example, I'm only talking about quantity of the rewards, but the same can be said for desirability of the reinforcer: if behaviour X is reinforced with a reinforcer that is twice as desirable than that used for behaviour Y, then behaviour X will occur twice as often as behaviour Y.
Let's consider how this affects something like walking at heel. The environmental rewards for pulling are obvious and large: getting to move forwards, sniffing a lamppost, whatever it is. Compare that reinforcer to the piece of cheese in your treat pouch. What is the ratio of desirability of the two reinforcers? I think it's pretty obvious that the environmental reinforcers trump our cheese, and by a large amount. So, what do we have to do to even up the balance? Easy! Reinforce more with our pitiful cheese.
It basically boils down to some simple maths:
(Value of reinforcer A x Frequency of reinforcement A) : (Value of reinforcer B x Frequency of reinforcement

That is the ratio with which Fido will choose behaviour A over behaviour B. So even though the value of reinforcer B is super high, if we restrict the frequency of the dog being able to get that reinforcement, then that side of the ratio tends to zero. Meanwhile, we work with the highest reinforcer we can on "our" side of the ratio, and make the rate of reinforcement as high as possible, then we are massively tipping the balance of the ratio in our favour.
Dogs do what pays best. If your dog is disengaging, then you have to find a way to up your rate of reinforcement.