All the stats are based off of Pitch f/x data from last year, except I included starts from Jeff Niemann and Clay Buchholz this year (prior to May 12th), since their curveballs are among the best stuff-wise in baseball and there was only Futures Game data for the two last year.
The links containing the stats are below. The "Basic Data" gives stats like "contact rates" and "fish rates" for pitches of every single pitcher that had pitch f/x data last year (that's 625 pitchers). It also includes month-by-month pitch selection and pitch selection in certain situations such as "with bases loaded" or "in the 5th and 6th innings". The "Basic Raw Data" gives much of the same information as the basic data except in numbers and not percents, which would be particularly useful when looking at the pitch selection splits. The "Monthly/Locational Splits" gives contact rates and other stats for each month for the pitch and also how the pitch up in the zone or away from the hitter did. All three are .csv files and can be saved and opened in excel or can be opened in Zoho spreadsheets online for free by supplying the web address of the .csv files (the first and third are about 3 mb each so Zoho may crash your web server). The "leading pitches" links give the top 15 or so pitches per pitch type for the category and (for some categories) the worse pitches.
Be aware that the designating of changeups and fastballs and all else is not 100% correct, but it's not too far from it. The Pitch f/x data doesn't cover the whole 2007 season, but about half of the games and just about every game at the end of the season. There are certainly sampling issues with many of the pitchers and their pitches. In the "leading pitches" links I generally used a qualifier of 50 pitches, which is still a small sample. Lastly, stats may be called percents, but are actually left in ratio form in the way that SLG% or OBP are.
Leading pitches in terms of: Speed | Sink/Drop | Rise | Run/Cut | Total Spin | Downward Acceleration | Upward Accel. | Horizontal Accel. | Runs/100 | Contact Rate | BABIP | SLGBIP | Ground Out% | Air Out% | Swinging Strike% | Called Strike% | Strike% | Ball% | In Strike Zone% | Out of Strike Zone% | Fish Rate | Badball/ConOZ% | Swung at% | Command
T: arm the ball is thrown with; P or pitch: type of pitch thrown; %: percent of the time that pitch is thrown; count: number of data points for that pitch; R/100: runs allowed (if positive) or prevented (if negative) above average by the indicated pitch per 100 pitches thrown by indicated pitcher, more detail here; BABIP: batting average when that pitch type is hit into play including home runs; SLGBIP: slugging percentage on balls hit into play; S%: percent of pitches resulting in a strike (anything but a ball) for said pitch; %IZ: percentage of pitch thrown in the strike zone. Con or Con%: contact rate ((fouls + hit into play balls)/(pitches swung at)); Sw%: % swung at; SW%: % of pitches that are result in a swinging strike; B%: % that are balls; CS%: called strike %; F%: % of pitches fouled off; InP%: % hit into play; GO: percent of times groundouts result when that pitch is hit into play; ect., ect. for fly outs (FO), pop outs (PO), line outs (LO), singles (1B), doubles (2B), triples (3B), home runs (HR). Fish: Dan Fox stat, how often the batter swings at the pitch when it's out of the strike zone; BadBall/ConOZ: another Dan Fox stat, percentage of time swings at a ball out of the strike zone make contact, so the lower the better for the pitcher; %SWOZ: percent of swinging strikes for indicated pitch that are out of the strike zone; SO: percent of pitcher's strikeouts that result from this pitch; sSO: percent of pitcher's swinging strikeouts that result from this pitch; cSO: percent of pitcher's looking strikeouts that result from this pitch; K/100: number of strikeouts resulting from indicated pitch per 100 pitches thrown by pitcher, adding up all K/100 values for a pitcher would give his K/100 regardless of pitch selection; S/E: command indicator, the average distance from the closest strike zone edge for strikes and S/C being from the closest corner; B/E: average distance from the closest strike zone edge for balls; Com P/E: average for all pitches; SW B/E: average distance for swinging strike out of the SZ;
Pitch f/x: V0: average speed in mph when pitch is 55 feet from home plate; Vf: speed in mph just before the ball crosses the plate (1.417 feet from plate); vx0: initial horizontal velocity measured 55 feet from feet in feet per second; vy0: initial velocity in direction towards plate; vz0: initial vertical velocity; vxf: final measured horizontal velocity 1.417 feet from plate in feet per second; vyf: final velocity in direction towards plate; vzf: final vertical velocity; time: time it takes the ball to travel 55 feet from home plate to 1.417 feet from the plate; ax: horizontal acceleration in feet per second per second, a pitch accelerating away from a right-handed batter has a positive value; ay: acceleration in direction towards plate; az: vertical acceleration; break: maximum break of the pitch in inches between initial and final measurement; bre_x: horizontal component of break; bre_z: vertical downward component of break; pfxx: horizontal movement or break in inches, difference in final location between that pitch and if that same pitch was thrown with no spin but same speed, a pitch that breaks into a right-handed batter has a negative pfxx value; pfxz: same as pfxx, but vertically with a negative value meaning downward movement; pfx: total length of movement, square-root(pfxx^2+pfxz^2); pfxx-d: like pfxx but removes the effects of air resistance that creates a drag force, go here for more details; pfxz-d: like pfxx-d but in vertical direction; pfxzg: adds the effects of gravity back into pfxz giving a number in inches that more closely resembles the vertical break/movement observers see; spin dir: the direction in degrees the balls spin towards with 0 degrees being a ball with no horizontal spin/movement and at least some downward movement and top spin, 180 degrees would be upward movement and bottom spin (bottom spin like a fastball); rpm: average spin rate of ball in revolutions per minute, formula for "rpm" and "spin theta/dir" is available here, dividing # by 60 and multiplying by "time" would give the # of revolutions while the ball is in flight; angle: angle of the break; x0, z0: initial coordinates in feet 55 feet from plate, could be approximated as the release point. x, z: coordinates of the ball when it crosses the plate in feet... for x, 0 is the center of the plate with edges being roughly 0.8 and -0.8 feet, negative being the left side of the plate from the catcher's perspective... for z the strike zone obviously changes a bit from batter to batter, but roughly between 1.5-3.5 feet above the ground;
Counts: what pitch is selected in the designated count (e.g. 1-2: 1 ball, 2 strike count); Inning: pitch selection in designated inning; #1: pitch selection for the first pitch of an inning and best to be ignored for relievers; Month: pitch selection by month; Outs: pitch selection with indicated # of outs and 0on/out meaning bases empty and no outs; Base Occupied: pitch selection with indicated bases occupied, 0 meaning bases empty; 1 meaning 1st base occupied; 2,3 meaning 2nd and 3rd base occupied; RISP: with runners in scoring position; RISP, 2out: with runners in scoring position and 2 outs; Tied: pitch selection in a tied game; Within 1 after 4: with pitching team is either down by 1 run, up by 1 run, or tied and the inning is the 5th or later; C&L: in Close and Late situations, meaning in the 7th inning or later with the batting team either ahead by one run, tied, or with the potential tying run at least on deck; L: pitch selection when pitching team is losing; W: when pitching team is winning; L BO: when pitching team is losing in a blowout, defined as 4 or more runs; W BO: when pitching team is winning by 4 or more runs;
v.R: pitch selection and outcomes vs. right handed bats; v.L: pitch selection and outcomes vs. left handed bats; RAA: runs allowed above average with T against all batters, v.R against righties and v.L against lefties ucR/100: R/100 uncorrected for the frequency the pitch is thrown, so the metric theorizes a world where only the indicated pitch is thrown, which of course would take away its effectiveness ucK/100: uncorrected like ucR/100 for strikeouts Pitch Speed Histogramish: percent of indicated pitch that falls in indicated speed category, a 99.8 mph pitch falls in the >98 category and not the >100 or >96 category; V.R After: pitch selection against right-handed batter after throwing indicated pitch in the column header; V.L After: pitch selection against lefties after throwing indicated pitch in the column header; stdev: the standard deviation of indicated stat, for instance a relatively large x0 and z0 means an inconsistent release point if you were to actually trust the pitch f/x data; NA: means the stat is not applicable for this pitcher's pitch because there's not enough data to calculate it.
more information on and analysis of the pitch f/x data...
-mlb.com has news and info here on the pitch f/x system and how to interpret the data.
-Dan Fox at bp.com and Joe P. Sheehan at BaseballAnalysts.com have great achived content.
-John Walsh, Mike Fast, and Josh Kalk among others have articles using pitch f/x at Hardball Times.
-Dan Brooks has a great analysis tool here and Josh Kalk has nice plots as well here.
Use any of the content here to your liking. It belongs to those who put out the Pitch f/x data more than me.
If you spot any errors in my data or possible errors, have any suggestions, have hate mail to send me, or anything else, please contact me at email@example.com.