On Millwood
One of the great things about the statistical boom and the internet is that you can have much more detailed arguments about a player.
10 years ago, vis-a-vis Kevin Millwood, the argument would have gone like this:
"Millwood sucks."
"No, he doesn't."
"Yes, he does, look at his ERA."
"Yeah, but he led the league in ERA a few years ago."
"Yo momma."
Nowadays, with all the new information that is readily available, it is more like this:
"Millwood sucks. Look at his ERA last year"
"No, he doesn't. His FIP is more relevant, and his FIP was pretty good last year. He just had an unusually high BABIP, which isn't likely to be repeated."
"His tRA was terrible, though."
"That's because his LB% was higher than it has been historically."
"Exactly. And that's why his BABIP was so high, and why he isn't as good as his FIP indicates, because he's throwing meatballs up there that the other team is driving all over the park."
"Yo momma."
So...recapping the data that's been kicked around this offseason, Kevin Millwood's Fielding Independent Pitching (his FIP, which is based on his peripheral rates) was pretty good last season -- a 4.02.
His ERA was dramatically higher than that because his Batting Average on Balls In Play (BABIP) was .366 -- the highest in the majors last year among qualifying pitchers.
Doctrinaire DIPS/McCrackenian theory says that Millwood can't really control whether balls in play go for hits or not, and thus it is a product of luck and defense.
The counter to that is that Millwood also gave up the highest rate of line drives in the majors last year, and that the high line drive rate was the result of him throwing meatballs up there for batters to hit, and thus the high ERA and high BABIP are to be expected, and are more indicative of his ability. And thus, you should look at his tRA, which incorporates LD% (among other things), and reveals that Millwood's ERA last season was about what should have been expected.
The question then really becomes...how much control does a pitcher really have over his line drive percentage, and is it something that is that variable or tells that much about a pitcher?
Looking at the 10 pitchers with the highest line drive rates last season, Millwood, of course, is #1...but #2 on the list is Josh Beckett, a guy who isn't exactly known for throwing meatballs at the plate, and who had a pretty good year last year. Matt Cain and Johan Santana also crack the top 10, and they are pretty good pitchers...but then you also have guys like Bronson Arroyo, Jeff Suppan, and Paul Byrd, who fall in the meatballing-strike-thrower category, and ciphers like Ian Snell and Barry Zito.
Nevertheless, my feeling is that if Millwood were just throwing BP pitches up there and that's why his line drive rate is so high, he'd be allowing a higher rate of home runs than he has.
Last year Millwood had the highest line drive rate he's had of his career -- his career percentage is 22.6%, and his previous two seasons in Texas his LD% was around 21%.
So what does all this mean? I'm not sure, exactly. Millwood's K and HR rates have been remarkably consistent since coming to Texas, and while his walk rates have fluctuated some, his walk rate last year wasn't that much higher than in 2006 (and was much lower than in 2007).
Bill James, CHONE, and ZiPS all expect him to be good, not great, this season, in large part, I would wager, because of solid peripherals over the past few seasons. The projection systems expect him, in fact, to be worth more in 2009 than he's being paid, strange as that may seem.
But the projection systems also don't expect him to have a BABIP close to .370 again. Now, the BABIP should come down anyway, because of the better defense behind him, but it really is a question of how much.
So whether or not you think Millwood will be productive (aside from any potential health issues) largely depends on whether you think the line drive rate and BABIP last year was an aberration, a random spike that will level out this year, or if you think he's going to allow a 25% line drive rate and a BABIP north of .350 again.
If you think that the line drive rate will come back down to the 21-22% rate, you probably expect he'll perform about like he did in 2006.
If not, you are expecting a repeat of 2008.
36 comments
|
0 recs |
Do you like this story?
Comments
yay...hope
Stability is key, and JD is a Beast.
Jindal - 2012
"AMMIITAAAAAAAAAAAAAAAAAAAAAAAABHH!!!"
It still all ends with 'yo mamma'
Everything went wrong for Rangers pitchers last year, that’s just not likely to happen again. Theodds are they’ll be healthier and have a better defense behind them and better numbers will result.
G G G E-flat_______ F F F D__________....
"One of the great things about the statistical boom and the internet is that you can have much more detailed arguments about a player."
Honest question: why don’t sports wirters know this? They either write lazy arguments with no homework done or they make claims of fact as if they don’t realize people can verify their statements in a few seconds and expose them as idiots at best and liars at worst.
They may know this
but I think part of it is that they are shooting for a larger demographic than the people you find on these types of blogs. They are writing for us as well as the fans that simply look at the box score numbers to determine the relative worth of a player.
I think Grant does the best job that I have seen of trying to incorporate things like FIP and ERA+ into his articles of the people that target a large range audience.
I'm not even talking about using advanced stats.
That’s a separate issue. I’m just talking about having some actual evidence to support your claim. And the worst of them (Wilbon does this all the time) make statistical-type statements of fact, but just make up the facts. They don’t guess or wonder allowed, they say something as fact that is blatantly wrong and it’s easy to prove them wrong.
Because they're too busy readying Summa Theologica or the latest from Jack Higgins, apparently.
She was a grand old lady...
(I'm expecting somewhere in the middle leaning towards 2008, by the way.)
He actually improved his tRA over 2007, by the way, and, unlike 2007, was right inline with his tRA*.
Millwood is gonna suck this year
But he’ll still be one of the best pitchers on the team.
Ok, the optimist in me says that if the Rangers only allow 800 runs this year, their offense can easily score 150 and that will take the division. But that 800 runs allowed is a BIG if.
make that 950
the rangers can easily score 950 runs, not 150 runs. well, they’ll score 150 runs alright.. by the 12th game or so. :)
Yo momma is so fat.....
(put it up on a tee)
"You shut your mouth when you're talking about the Rangers!!"
Yo mamma...
is so easy she has a mattress on her back.
"The only good is knowledge and the only evil is ignorance."-Socrates
Milly will have a pretty good year...
the ole boy whipped it into shape for the love of greenbacks.
Even he admits he’s sucked the last couple seasons.
What's the Nef tally??
Just wondering
From someone who’s seen him more this spring. I saw him on Saturday and his FB seemed to be pretty much high 80s and seemed to be working consistently high. I’m guessing this is going to get him creamed and was wondering if anyone who might have seen him some other time saw this happening.
I saw him the time before..
when he gave up I believe it was 12 hits and yes his fastball or whats left of it was high and in the 80’s. It was getting crushed.
I really had not seen or heard of that much laboring since Octumom kicked out her litter.
What's the Nef tally??
If thats actually what he has left
Hes a #5 starter that you pray can get you 180 innings that doesn’t leave you out of most games. The ballpark is going to be a nightmare for him.
if he stinks up the joint..
there may be more people praying that he does not get 180 innings so that he can be let go. I believe that is the inning criteria that must be met otherwise the Rangers have the option as to whether he stays or goes in ’10.
What's the Nef tally??
I know its reflected in all the stats listed
But really, 220 hits in less than 169 innings? Ridiculous.
the preceding post was a great success.
i like it when adam does this
he’s pretty good at it.
There has to be a tertiary way to evaluate a pitcher, doing a better job of predicting LD% or BABIP. I would think the best proxy for those things would be pitch fx data: movement on pitches and pitch speed.
I’ll get right on it.
"Josh Hamilton doesn't act like he is Josh Hamilton. He acts like a guy who knows a guy who knows a guy named Josh Hamilton. " - jparks
um
will someone point me to a website that logs pitching data
Ceterum censeo Cat esse delendam - Cahill the Elder
pitch f/x data
From MLB
Pick the year, month, day and the game played. Inside you will find a folder called “pbp”, which contains pitch f/x info. You need to player ID number, which is contained in the “players.xml” file in the original folder containing the pbp folder.
Alternatively, here is the Josh Kalk’s pitch f/x tool
The scorers at the BPIA grade line drives at a
a factor 1.18 times the normalized league wide average. It could be park, it could be scorers, it could be hitters, but as a whole, it is dramatically higher than the rest of the league
PARK_ID PARK_NAME First Last PAw LDf
PHI12 Veterans Stadium 2003 2003 4768 1.23**
ARL02 Ballpark Arlington 2003 2008 26850 1.18
TOK01 Tokyo Dome 2004 2008 283 1.13
CIN09 Great American 2003 2008 28827 1.11
DEN02 Coors Field 2003 2008 29158 1.10
STL10 Busch Stadium III 2006 2008 13967 1.09
KAN06 Kauffman Stadium 2003 2008 27530 1.09
WAS11 Nationals Park 2008 2008 4790 1.09
TOR02 Rogers Centre 2003 2008 27513 1.08
SFO03 Phone Co Park 2003 2008 29439 1.07
MON02 Stade Olympique 2003 2004 7684 1.07
STL09 Busch Stadium II 2003 2005 14280 1.06
STP01 Tropicana Field 2003 2008 27830 1.06
DET05 Comerica Park 2003 2008 28008 1.06
PHI13 Citizens Bank Park 2004 2008 24640 1.06
MIL06 Miller Park 2003 2008 29354 1.06
WAS10 RFK Stadium 2005 2007 14885 1.05
OAK01 Oakland Coliseum 2003 2008 26719 1.03
SEA03 Safeco Field 2003 2008 26683 1.01
CHI12 Comiskey Park II 2003 2008 28644 1.00
NYC16 Yankee Stadium 2003 2008 28722 1.00
MIA01 Dolphin Stadium 2003 2008 29849 1.00
CLE08 Jacobs Field 2003 2008 28136 0.99
BAL12 Camden Yards 2003 2008 29103 0.99
PIT08 P.N.C. Park 2003 2008 27652 0.98
PHO01 Bank One Ballpark 2003 2008 28810 0.98
SJU01 Hiram Bithorn 2003 2004 2598 0.98
SAN01 Jack Murphy 2003 2003 4943 0.98
LOS03 Dodger Stadium 2003 2008 29555 0.98
CHI11 Wrigley Field 2003 2008 28663 0.96
SAN02 PetCo Park 2004 2008 24432 0.95
NYC17 Shea Stadium 2003 2008 29299 0.92
BOS07 Fenway Park 2003 2008 28311 0.86
ATL02 Turner Field 2003 2008 29016 0.86
ANA01 Anaheim Stadium 2003 2008 26490 0.86
HOU03 Minute Maid Park 2003 2008 28271 0.82
MIN03 Metrodome 2003 2008 28048 0.80
Good stuff
Here’s another thought (me being a statistical idiot) could the fact that Cain, Santana and Beckett have high LDs be more related to the fact that they miss a lot of bats? It’s not total number of LDs but LD; i.e. the only way they give up hits is if someone hits a LD off of them. So maybe LD% isn’t that telling.
great post.
"Was this really necsarry?" - cowpoke/hurler hurley
LD%
I was doing regressions for pitcher values and for line drive percentages year-to-year (y-t-y), I found a correlation® of 0.238082.
This was only over 49 pitchers, but I think it’s fairly close to the correlation for overall pitchers.
In contrast, both GB% (r=0.814) and FB% (r=0.832) were highly correlated y-t-y. And IFFB% was more correlated (r=0.427).
So major league pitchers show little differentiation between themselves of being able to control LD%. And part of it, from laxtonto’s post, looks like it could be from park effects/scorer bias.
R

by 



































