Wednesday, May 28, 2008

Juicing the bases early, part II

As a followup to my last post, I am doing another study with a few changes. First, I added 1997 and 1998 to the sample. I would have added 1999 too but RetroSheet apparently doesn't have it for some reason. Second, I fixed some minor query problems that were causing me to double-count PA's with balks and pickoffs. Third, I altered my SQL procedure to output the results broken down by innings and score differential.

So, just to recap:

We are looking for plate appearances in completed games played between 1997-2007 for which there is pitch data, where:

- There are runners on second and third, one out.
- Prior to the seventh inning.
- The batter is in one of the top-six spots of the order.
- The batter is not Barry Bonds.

We are seperating PA's into two types: PA's where there is an obvious intentional walk (four straight intentional balls,) and PA's where there is not. Then, we are looking at how often teams wind up winning the game when they walk the batter vs. when they let him hit.

The following is the results:
















































































































































































































































































































































































































































































































Inning
Score Diff
No-IBB Record
IBB Record
No-IBB W%
IBB W%
Diff
IBB Gain
1-50-00-00.0000.0000.0000.000
2-52-210-40.0870.000-0.087-0.348
3-52-90-00.1810.000-0.1810.000
4-52-430-70.4440.000-0.444-0.311
5-52-260-60.0710.000-0.071-0.429
6-50-440-140.0000.0000.0000.000
1-40-10-00.0000.0000.0000.000
2-44-451-70.0820.1250.043+0.347
3-44-200-40.1670.000-0.167-0.667
4-44-630-80.0600.000-0.060-0.478
5-42-571-100.0340.0910.058+0.627
6-41-690-90.0140.000-0.014-0.129
1-30-00-00.0000.0000.0000.000
2-319-770-110.1980.000-0.198-2.177
3-311-601-170.1550.056-0.994-1.789
4-313-902-30.1260.4000.274+1.369
5-34-745-330.0510.1320.080+3.051
6-36-660-300.0830.000-0.083-2.500
1-232-1092-30.2270.4000.173+0.865
2-231-1041-30.2300.2500.020+0.081
3-223-1043-190.1810.136-0.045-0.984
4-213-931-80.1230.111-0.012-0.104
5-220-1078-280.1570.2220.065+2.330
6-27-671-250.0950.038-0.056-1.459
1-196-2196-100.3050.3750.070+1.12
2-142-921-30.3130.250-0.063-0.254
3-172-1636-150.3060.286-0.021-0.434
4-123-1092-70.1740.2220.048+0.432
5-135-9410-150.2710.4000.129+3.217
6-122-736-240.2320.200-0.032-0.947
1Tied288-41612-200.4090.375-0.034-1.090
2Tied29-410-20.4140.000-0.414-0.829
3Tied138-2286-150.3770.286-0.091-1.918
4Tied54-946-130.3650.312-0.049-0.932
5Tied42-11412-160.2690.4290.159+4.462
6Tied34-7411-220.3150.3330.019+0.611
1+134-461-20.4250.333-0.092-0.275
2+116-200-00.4440.000-0.444-0.000
3+1106-1092-10.4930.6660.174+0.521
4+146-524-30.4690.5710.102+0.714
5+166-574-70.5370.364-0.173-1.902
6+158-609-110.4920.450-0.042-0.831
1+228-230-10.5490.000-0.549-0.549
2+210-70-00.5880.000-0.588-0.000
3+268-381-00.6421.0000.358+0.358
4+247-351-00.5731.0000.427+0.427
5+271-311-20.6960.333-0.363-1.089
6+254-386-20.5870.7500.163+1.304




Phew! Sorry about that monstrosity. We can probably ignore the first 18 lines I guess. Frankly, when teams are behind by 5 runs in the 3rd inning, and are facing this situation, there is going to be a lot of losing regardless of what they do, and not many oIBB's are issued. It's in the middle of the table where it starts to get interesting.

In my previous study, the "-1" situation was the outlier. When down by a run, teams actually seemed to gain wins when IBBing the batter early. Here we see it broken down further. Recall that I am using a different (larger) sample size this time around, but the effect is still there. Almost all of the difference comes in the 5th inning, where teams won 10 of 25 games after issuing an IBB, which is more than 3 full wins better than the non-IBB sample. The rest of the "down by a run" scenarios more or less even out. Is this a sample size fluke, or something else I haven't considered?

Since nobody really enjoys looking at a 54-row table, let's do some grouping. I will group the six innings into three groups of 2. Let's also group the scores into: Tied, ahead by 1 or 2, and behind by 1 or 2.

Here are the Wins gained by IBB's in these situations:
























































Innings
Score Diff
IBB Gain
1-2Behind by 1 or 2+1.817
1-2Tied-1.919
1-2Ahead by 1 or 2-0.824
3-4Behind by 1 or 2-1.090
3-4Tied-2.850
3-4Ahead by 1 or 2+2.021
5-6Behind by 1 or 2+3.140
5-6Tied+5.073
5-6Ahead by 1 or 2-2.517




My findings seem to actually be in line with both MGL and Peter. MGL asserted that intentional walks in this situation was not a good idea, and my findings seems to show that to be mostly true. Peter's assertion was that the results were too small to say much, and, frankly, that seems to be true as well. However, look at with innings 5 and 6, with the team tied or behind by a run, a total of 8 wins were gained with intentional walks.

Indeed, in innings 5 and 6 with the score either tied or the pitching team behind by a run, teams who did not issue the walk went 160-529, for a .232 winning%, whereas teams that took the bat out of the hitter's hands in this situation went 48-130 for a .282 winning%, thus improving their win total fairly substantially. I find this pretty interesting. Innings 1-4 however, seem to be a fairly consistently against issuing the walk.

So, in summary, is it a good idea to walk the bases loaded when it's early in the game and a good hitter is at the plate and first base is open? I have no idea.

Here is the IBB_GAIN broken down by inning. Too lazy to make a fancy table like before so I'll just copy/paste from sql*plus...


INNING SUM(IBB_GAIN)
---------- -------------
5 10.2684563
4 1.11693613
1 .074129052
2 -3.1787919
6 -3.9504488
3 -4.9123056

No comments: