Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
FINAL THESIS.pdf (1.96 MB)
ETD Abstract Container
Abstract Header
An Exploration of the First Pitch in Baseball
Author Info
Spangler, Ashley
ORCID® Identifier
http://orcid.org/0000-0002-2349-9276
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1490300154782369
Abstract Details
Year and Degree
2017, Master of Science (MS), Bowling Green State University, Applied Statistics (Math).
Abstract
Sabermetrics is the statistical analysis of baseball. This research was started in the 1950’s and since then has become increasingly popular. Over the last couple of years, the availability of data within the sport of baseball has exploded. From mainly three sources, we have access to a vast arrange of statistics. This research investigates the importance of count and the first pitch in baseball. The first pitch determines whether the hitter or the pitcher has the advantage in the at-bat and can set the precedence for the rest of the at-bat. Exploratory methods are used to investigate and summarize the relationships between various variables through the use of tables, contour plots, scatterplots, and line graphs. As the pitcher’s thrown first pitch strike percentage increases, the number of innings pitched per game increases, Walks per Hits per Innings Pitched (WHIP) decreases, walk percentage decreases, and strikeout percentage increases. 64% of the first pitches thrown are either four-seam or two-seam fastballs or sliders, which are all fast pitches. Over 50% of the first pitches are in the strike zone. Singles, doubles, triples, and homeruns are more likely to be hit on the first pitch. Pitchers have the highest pitching statistics when the hitter swings and misses compared to putting the ball in play, a called strike, or a ball on the first pitch. When the first pitch is a ball, the hitters have the highest hitting statistics. Generalized Additive Models (GAM) and Logistic Regression Models are used to discover the factors significant in predicting the probability that hitters swing. Logistic models were created for all pitches and then first pitches for all players. Next, four logistic models were created for four different players. In the majority of the models, count type (whether the count favored the pitcher, hitter, or was neutral), the distance in feet of the pitch from the center of the strike zone, and if runners were on base or not were significant in predicting the probability of swinging. Overall results suggest that hitters have different hitting strategies and swinging on the first pitch, or in general, depends on the hitter. More mature hitters tend to not swing on the first pitch.
Committee
James Albert (Advisor)
Christopher Rump (Committee Member)
John Chen (Committee Member)
Pages
143 p.
Subject Headings
Statistics
Keywords
baseball
;
statistics
;
swing
;
pitch count
;
count
;
logistic model
;
first pitch
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Spangler, A. (2017).
An Exploration of the First Pitch in Baseball
[Master's thesis, Bowling Green State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1490300154782369
APA Style (7th edition)
Spangler, Ashley.
An Exploration of the First Pitch in Baseball.
2017. Bowling Green State University, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1490300154782369.
MLA Style (8th edition)
Spangler, Ashley. "An Exploration of the First Pitch in Baseball." Master's thesis, Bowling Green State University, 2017. http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1490300154782369
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
bgsu1490300154782369
Download Count:
2,104
Copyright Info
© 2017, all rights reserved.
This open access ETD is published by Bowling Green State University and OhioLINK.