If you studied psychology in the twentieth century, you may remember B. F. Skinner and his work on operant conditioning. Skinner studied whether behavior increased or decreased based on how often, and in what manner, a reinforcement (reward) was given.
You provide a food pellet after a certain interval of time has passed, for example, five minutes. The rat gets a food pellet the first time he presses the bar after five minutes is up.
Reinforcement is based on time and the time interval is always the same.
Reinforcement is based on time. The amount of time varies, but it averages to a particular time
Instead of basing the reinforcement on time, you base it on the number of bar presses. The rat gets a food pellet after every 10 bar presses.
Reinforcement is based on the number of bar presses, and the number is always the same
Reinforcement is based on the number of bar presses. The number varies, but it averages to a particular ratio
For operant conditioning to work, the reinforcement (reward) must be something that
particular audience wants. Hungry rats want food pellets. What does your particular audience really
Think about the pattern of behavior you’re looking for, and then adjust the schedule of rewards to fit
that schedule. Use a variable ratio schedule for the maximum behavior repetition.